Machine Learning (2017, Fall) @ National Taiwan University
PM2.5 prediction
Income prediction
Using the Adult dataset from:
https://archive.ics.uci.edu/ml/datasets/adult
Image sentiment
Tweets sentiment
Using the dataset from:
http://thinknook.com/twitter-sentiment-analysis-training-corpus-dataset-2012-09-22/
Movie rating
Using the MovieLens 100K dataset from:
https://grouplens.org/datasets/movielens/100k/
There are three independent parts:
-
Implementing eigenface using SVD
-
Chinese word embedding visualization (word2vec, t-SNE)
-
Dataset separation using unsupervised learning
Chinese QA based on a chinese version of the SQuAD dataset
The model is a modified version of an implementation of Microsoft R-NET:
https://github.com/matthew-z/R-net
Another model is a naïve sliding window method