Yusef Mohamadi yuseferi

Detecting Similar News

The following gist is an extract of the article Detecting Similar News. It exploit data retrieve by a crawler and detect similar article across different domains

Usage

Start by running the crawler to retrieve the data. Crawler takes about 50 minutes to retrieve all the data the first time.

$ python run.py

retrieving url... [techcrunch.com] /

	# 2017 - @leonjza
	#
	# Wordpress 4.7.0/4.7.1 Unauthenticated Content Injection PoC
	# Full bug description: https://blog.sucuri.net/2017/02/content-injection-vulnerability-wordpress-rest-api.html

	# Usage example:
	#
	# List available posts:
	#
	# $ python inject.py http://localhost:8070/

	<?php
	/*
	PHP script to import MYSQL database dump in Heroku cleardb
	By Shayan Eskandari 2015

	export your mysql dump from phpmyadmin or with sqldump commandline -> database_scheme.sql
	you have to modify the dump and remove the create database and schema details
	here is one example

	//database_schema.sql