#Semantic Search engine – NLP lab
###17/05/2016 Task
Use the above blog to scrap the following information and show in terminal (ubuntu) or in a file in windows.
1.Heading of each blog article.
2.Date of posting each article.
3.Comments (if any) from each article.
4. Image url for each article.
5. On each page there are multiple blog snippets present, we want them all.
Hints : python-bs4, python-urllib2 search these you will get the idea
Please refer this image
https://drive.google.com/open?id=0B0Mf1CuHV_44N0pPWERMSGhHNk0
sudo apt-get install lamp-server^
sudo apt-get update
sudo apt-get install phpmyadmin
sudo php5enmod mcrypt
sudo service apache2 restart
You can now access the web interface by visiting your server's domain name or public IP address followed by /phpmyadmin:
http://localhost/phpmyadmin
For installing phpmyadmin Visit https://www.digitalocean.com/community/tutorials/how-to-install-and-secure-phpmyadmin-on-ubuntu-14-04
http://www.aossama.com/search-engine-with-apache-nutch-mongodb-and-elasticsearch/
https://www.ntu.edu.sg/home/ehchua/programming/sql/MySQL_Beginner.html
http://www.sitepoint.com/sql-vs-nosql-differences/SQL%20Schema%20vs%20NoSQL%20Schemaless
https://www.elastic.co/guide/en/elasticsearch/reference/current/index.html
https://github.com/anmolsachan/NLPWorkshop/archive/master.zip
https://university.mongodb.com/?jmp=docs%2F&_ga=1.226181715.87059799.1463421535
#Hadoop
http://bradhedlund.com/2011/09/10/understanding-hadoop-clusters-and-the-network/
http://www.plottingsuccess.com/hadoop-101-important-terms-explained-0314/
http://www.edupristine.com/blog/hadoop-installation-using-ambari
https://blog.cloudera.com/blog/2013/04/how-scaling-really-works-in-apache-hbase/