I'm now work jointly between Google New York City and Tel Aviv since January 2016 to improve query understanding
and help lead the Cricket experience on Search.
I graduated with a Bachelors and a Masters in
Computer Science from Cornell University in 2014, after which I worked on locations and places at Facebook for a year.
I finished high school in Calcutta, India in 2011, prior to which I spent 7 years during elementary school in Bay Area, California.
I maintain a list of the stuff I enjoy reading, watching, or learning from called where the knowledge is and occassionally write on my blog, Writes.
I'm helping organize the world's information in the New York City and Tel Aviv office, on the Search cricket experience. Google "cricket scores" on mobile.
I also work on with machine learning and infrastructure for knowledge graph, query understanding and personalization for Search.
I interned at the YouTube Captions team in Mountain View under Loretta Guarino Reid. I shipped the Captions editor and creator to production and met some really smart people.
I contributed to this open-source project as a part of a Facebook initiative, met the legendary Evan Priestley and amongst harder things, created the meme generator.
I went to school here, in Kolkata, for 8 years. It was founded in 1836, so it's pretty old. It turned 175 when I graduated. I was taught Math by David Royan.
In June '13, I published Hacking into the Indian Education System about some code I wrote to expose glaring anomalies in one of India's premier education boards. It amassed a lot of media coverage.
I've been mining, analyzing and working with media organizations for 5 years now since 2013 to spread the word about these severe grading issues affecting over a million Indian high school students annually culminating in a
Delhi High Court order to stop it in May '17. It featured on headlines multiple times.
In April '14, I open-sourced a XeLaTeX resume template that went viral and has over 1200 stars and 350 forks on the Github repo after trending on Reddit and HackerNews. It features on many popular online TeXing platforms and archives.
It is one of the top 10 most popular Latex templates on Overleaf, with over 85,000 views.
It initially peaked at number 5 on HackerNews.
The Open-Source community has made a Resume Generator based on it that peaked at number 2 on HN and a version in Jade.
I've given several talks at some of India and USA's best engineering colleges
about software, technology, my projects and my past experiences.
On February '17 and September '17, I wrote a two part blog series about my body transformation and the data-driven approach I took to lose 66lbs, or 30kgs, in the span of 8 months and gain a 6-pack, and keep it off for another 8 while gaining strength. This featured on the front page of Hacker News and in GQ India.
My blog posts on Writes have been featured on Reddit, been widely retweeted, and have front-paged Hacker News five times.
Delhi University is one of India's biggest universities with offering over 50 subjects amongst more than 50 colleges. Admissions to DU are based off of complicated cut-off calculations, innumerable quotas, and one needs to look at multiple cut-off list pdfs being served through websites stuck in the 90s. I scraped and structured the data from DU admissions and provide a clean, user-friendly form so you can check which colleges and for what subjects you're eligible to be admitted in.
GradCafe has 372 thousand graduate school admissions results. I scraped 93% of that data, including 80 thousand GRE scores and 75 thousand undergraduate GPAs. I also extensively deduplicated the user provided university names to over 98% cleanliness, and compiled a hyperclean Computer Science specific set of 28 thousand results. Let's dive into these never-before-seen statistics on graduate school admissions! Retweeted and shared by many professors.
Hacker News, the Y Combinator tech forum, has exploded from its humble beginnings in 2006 to one of the primary sources of news in the tech community. Looking through the data, we can see how HN grew, the things people talked about, the most shared domains, the most influential contributors and more. Received over 100k hits.
Inspired by King James Programming, I decided to write my own generic Hidden Markov Model that generates funny one liners by doing a random walk between two (or more) works of text. Here is a sample from the mix between King James Bible and Artificial Intelligence: A Modern Approach. And here's one with the latter and the English translation of Mahabharata. They're hilarious.
A computer is mostly limes with chords of the terrified, herd of deer.
Torrage is a super secure peer-to-peer cloud storage platform written in C, where your password never leaves your machine and your files never leave your machine in one piece or unencrypted. It's based on the BitTorrent protocol. Read the paper here. I'm going to open source the code soon.
Worked with tGELF to aggregate and analyze over one and a half million tweets related to the 2014 Indian General Elections. We visualized interesting trends with time, location, and sentiment.
I developed a player trading model for the Fantasy Indian Premier League, the world's biggest cricket league. It was the first such model to compete in the actual fantasy league. It used Binary Integer Programming, computationally outperforming previous research in the field by 33x and achieved 99.54%ile out of over 400,000 participants. Read the paper here.
Marauder allows players to create and join maps that continuously but efficiently log location data from your phone and displays live locations of all players on the map with very low latency, much like the Marauder's Map. We used Tornado, WebSocket, and Redis amongst other technologies, and built a web app and iOS app.
Finally ported my obsolete Word resume into Latex and open-sourced it! There aren't many half-decent resume templates out there. I also compiled a few of my favorite Latex templates as well. The template has over 700 stars and is the 4th most popular English resume creating system on Github.
I developed two trading models - a volatility sensitive Markowitz model for small investors, as well as a Support Vector Machine model limited to trading only on SPDR ETFs, managing to outperform baselines and in the first case, half the risk. Read the paper here. This paper was tweeted by Carl Carrie, former Global Head of Algorithmic Products at JP Morgan and Head of Commodity Technology at Barclays.
I worked in a team of 4 to create a compiler from scratch in Java for Cubex. We used ANTLR, and implemented some optimizations. Ross Tate taught us and he's awesome.
Wrote a GUI, an orbit analysis tool, and did some code factoring on a simulation for a nanosatellite that will actually launch into space.
I've also created a fully rendered, realistic and interactive 3D Game with a random terrain generator, a Ray Tracer renderer, a multithreaded SMTP Server, my own malloc, a fully functional filesystem with a multithreaded garbage collector, a MapReduce in OCaml, a Pokémon playing bot, a MIPS Processor, and Breakout.
I've also created a simple algorithmic trading simulator, tools to scrape the Cornell University student database, a Typeracer hack, a Cornell instant course search tool called Instudy, classic games like Pong, Tetris and Minesweeper, and a 3D Graphing utility.
I enjoy critically reasoning and debating controversial topics. Topics that particularly interest me are India, education, politics, technology, and economics. I tweet about my thoughts at @debarghya_das.
My Myers-Briggs Type Indicator(MBTI) is INTP [Introverted iNtuitive Thinking Perceiving]. The favorite people I supposedly share my personality type with are Albert Einstein, Larry Page and Sergey Brin. You too should take the test here. As of late '17, I test as an ENTP.
I love Hindi music and have a large Spotify playlist going, cleverly marketed as the Best Hindi Playlist on Spotify. I also like Michael Jackson, music from different countries, and my throwback high school Rock and Metal playlist.