Archive
66 posts
2025
2017
- Sep 24 Provision AWS EC2 cluster with Spark version 2.x
- Mar 12 Streaming with Apache Storm
- Mar 1 Data ingestion and loading: Flume, Sqoop, Hive, and HBase
- Feb 26 Streaming processing (III): Best Spark Practice
- Feb 25 Streaming processing (II): Best Kafka Practice
- Jan 6 Streaming processing (I): Kafka, Spark, Avro Integration
2016
- Dec 25 Deep Sentiment Prediction as Web Service
- Jun 19 Track my sports
- Mar 17 Deploy ELK stack on Amazon AWS
- Feb 17 Build a simple web application with Amazon AWS
- Feb 1 Spark on time series preference data
- Jan 31 GPU computation on Amazon EC2
- Jan 21 2015年NIPS会议中酷炫的东西 - Neural Style
- Jan 5 Cool stuff in NIPS 2015 (symposium) - Neural Style
- Jan 1 A super fabulous beginning of a super great year 2016
2015
- Dec 31 Data science in the next 50 years - are machine learning and statistics complementary?
- Dec 26 Cool stuff in NIPS 2015 (workshop) - Non-convex optimization in machine learning
- Dec 25 Cool stuff in NIPS 2015 (workshop) - Time series
- Dec 21 A rich and dynamic December
- Dec 20 My research on machine learning and AI
- Dec 16 NIPS conference 2015
- Dec 15 Me
- Nov 19 Build web applications with Flask+Heroku
- Nov 11 Calendar view of data in Jekyll with D3.js
- Nov 9 Xplanner in Junction Hackathon 2015
- Nov 2 Documentation and test modules for Python
- Oct 29 Teaser solution
- Oct 20 Pabulo, my lovely cat
- Oct 19 Chinese national day celebration in China embassy Helsinki
- Oct 19 Spark regression models
- Oct 18 Spark classification models
- Oct 13 Spark with Python: collaborative filtering
- Oct 12 Feature extraction, selection and predictive modeling with Scikit
- Oct 10 Novelty detection and outlier detection with Scikit
- Aug 28 One class classification with Scikit
- Aug 25 Predicting transporter proteins
- Aug 24 Searching Algorithm
- Aug 20 BFS and DFS
- Aug 18 SQL related
- Aug 16 Compute TF-IDF with Hadoop Python
- Aug 15 Mapreduce with Hadoop via Python with Examples
- Aug 13 Scikit: A machine learning package for Python
- Aug 12 Get Emoji support for Jekyll pages
- Aug 12 Outstanding doctoral candidate award of 2014
- Aug 3 Heap
- Jul 30 Stack and Queue
- Jul 29 Dynamic programming related problems
- Jul 29 Recursion
- Jul 27 Setup Hadoop on Macos
- Jul 26 Spark via Python: basic setup, count lines, and word counts
- Jul 22 Palindrome problems
- Jul 19 SQL refreshment
- Jul 17 Sorting algorithms
- Jul 12 Bit integer for operating large numbers
- Jun 17 Feature extraction for protein sequences via InterProScan
- Jun 16 Sequence alignment with NCBI-BLAST search
- Jun 10 Tiny little bit of Python Pandas
- Jun 9 Facebook challenge of detecting robots
- May 22 A projected Newton method for optimizing structured output model
- May 21 Spark with Python: optimization algorithms
- May 17 Spark with Python: linear models in MLlib
- May 15 Some useful Coding techniques
- May 12 Spark with Python: configuration and a simple Python script
- May 11 The quickest way to blog, GitHub + Jekyll
2011
- Dec 29 Untitled