Probabilistic Reasoning

March 16, 2017

It’s Monday morning, and you just got up. You look at the clock, and it tells you the time is now 7.45am. You have a presentation that starts 8.30am, so that gives you exactly 45min to get to the office. Now, you have a few decisions to make here. Breakfast, yes or no? Mode of transportation, bus, metro or biking? Clothing preparation, i.e. throwing on last week’s shirt or ironing new clothes? ... Read more

Kaggle: Titanic

September 4, 2016

I made an attempt at building a predictor for the Kaggle Titanic data challenge. You can checkout the Jupyter Python notebook on Github

Pokémon Go: A Killer Catch

August 6, 2016

Pokémon Go has been released for roughly six weeks now, and it’s already a phenomenon that is moving people. Literally. Whether you’re trying to get an egg hatched, searching for new Pokémon to capture, or just looking for the next gym battle to conquer your turf, and territory, people are out and about with their Pokédex in hand walking the Earth. It’s a must, if you want to become the world’s most premier Pokémon master; and according to the number of users of the app, almost every young adult and child with a smartphone does. ... Read more

JsonURI: json serialization and deserialization for logging

May 28, 2016

A while back, while working on the infrastructure of a ecommerce recommendations service provider, we ran into problems handling traffic from our clients in real time. As a simple solution, we decided to send data to logs through AWS S3, by appending HTTP URL parameters to GET requests to a tiny image file; something that should only be done when you’re not dealing with sensitive data. However, we had a minor issue: we had complex nested JSON objects, with objects inside fields; however, JavaScript and JQuery standard libraries only supported serialization of flat JSON objects. ... Read more

YelpSí: Visualizing Yelpers Daily Activity

January 29, 2016

Find out what the most popular places yelpers check-in to with YelpSí, a visualization tool that let’s you explore Yelpers past daily check-in activity across different cities in the US and Europe. It was built with Shiny, using the Yelp Dataset Challenge academic dataset.

Indexing UIMA Annotated Docs, with Solr

January 7, 2016

In this post, I’m going to walk you through the process of indexing UIMA annotated documents with Solr. In a previous post, Finding Movie Starts, I demonstrated how we can use UIMA to find and tag structured information in unstructured data. In most scenarios, once we have that data we extracted, we want to be able to query it. To do this, we can put our data into a data store, be it an RDMS, document store, graph, or other. ... Read more

Falcon 9

December 31, 2015

Being up late at night, days before Christmas holidays, crunching in school projects was a long forgotten memory for me. Though, being back at university after a period of work, it was a less dreaded experience now. Still, I was ready to welcome any distractions. Having been told that night was the night SpaceX would be attempting a miraculous landing, I welcomed it with open arms. After all, watching live rocket launches beat the heck of figuring our seasonal components for time series at 2 in the morning. ... Read more

Programming

December 26, 2015

The art of writing was invented, I suppose, so that we could communicate with the future, i.e. record the past and present, and in the process create history. Software programs, on the other hand, are written for the purpose of defining the future. One that is meant to be interpreted by machines. The tapping of a keypad turns a blank page into a blueprint. It starts with one file, and can quickly grow larger. ... Read more