My first impression of RL comes from AlphaGo: http://www.nature.com/nature/journal/v529/n7587/full/nature16961.html where policy and value networks are used. I realized this is a valuable idea since long term memory could be phrased and guide the behaviors, even at the sacrifice of suffering in the short term. A typical formulation to train the policy network is given in AlphaGp […]
I have a naive understanding of things so far. For example, suppose we’re talking about classification problems. For typical SVM, the classification problem turns into looking for a linear boundary between the data (image from wiki): But usually the boundary is not linear, so that’s why kernel methods (kernel svm, logistic regressions) are introduced, in […]
Here’s a list of different gradient descent optimizers: http://sebastianruder.com/optimizing-gradient-descent The most common one is SDG, which is also the most basic one in TF: https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/training/gradient_descent_test.py Other versions are listed in the training folder: https://github.com/tensorflow/tensorflow/tree/master/tensorflow/python/training Although mentioned in the list, the learning ratio of SGD should be gradually decreasing as iteration # goes up, but there’s no […]
I think it’s time to move on to TensorFlow and embrace the DL world now…. So first, why TensorFlow? It’s mainly due to the following comparison from Dr. Matt Rubashkin: A concise summary of deep learning frameworks I chose TF because it supports multi-GPUs and it’s supposed to be easy to setup — and yes […]
a good discussion about cpu, memory, cache and latency: http://www.csie.ntu.edu.tw/~r89004/hive/cache/page_1.html
1. error: X11/StringDefs.h: No such file or directory Solution: sudo apt-get install libxt-dev re-run cmake Resource:http://adrienbailly.wordpress.com/2010/03/27/error-x11stringdefs-h-no-such-file-or-directory/ 2. installation coptions: tick: “BUILD EXAMPLES” do not tick: “BUILD SHARED LIBS” T.B.A.
I was reading something on algorithm and a question occurred: What’s the difference between Linear Programming and Dynamic Programming? —————————————————————————————————————- After digging on the internet for a while, I found people already had the Q&A 50 years ago. ( A short bio and a list of publication of the author could be found on wiki […]
Last night I was studying tutorial 7, trying to run the codes in VS2010 environment, a problem emerged: “LCDRange.obj : error LNK2001: unresolved external symbol “public: virtual struct QMetaObject const * __thiscall LCDRange::metaObject(void)const “” Tutorial 7 is different from tutorial 6 only by adding a user defined signal and slot, which means the signal/slot suite […]