Skip to content

Instantly share code, notes, and snippets.

[ 37.8708393 -122.2728639] 240.04118850187228
[ 37.86244859 -122.26466766] 239.68142099137842
[ 37.853691 -122.25534538] 239.29099560290098
[ 37.8440049 -122.2462379] 238.88792442436258
[ 37.83574267 -122.23624132] 238.4975920293918
[ 37.82748868 -122.22757974] 238.13829762342573
[ 37.81749266 -122.21934141] 237.7536954896114
[ 37.80977947 -122.20958085] 237.3856572182171
[ 37.80205436 -122.2005225 ] 237.0341329338361
[ 37.79307632 -122.19075968] 236.6436680470766
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

Data Sets

The data sets are a result of three crawls of mobile web, as documented in our ACM CCS paper, in May 2018. Two of crawls were performed from the University of Illinois (US1 and US2) and a third was from a data center in Frankfurt (EU1). The crawls visited 100,000 websites as taken from Alexa top sites. The list of sites used and their corresponding ranks is included as site-list.csv.

File Contents

Raw OpenWPM files

The files <crawl>-crawl-data.sqlite.xz and <crawl>-javascript.ldb.tar.xz (e.g., US1-crawl-data.sqlite.xz) contain the raw data generated by OpenWPM, as described in https://github.com/citp/OpenWPM#output-format. The crawl data file contains an sqlite3 database (compressed using xz) with instrumentation data from each web page load, and the javascript database contains all of the scripts fetched while loading a site, stored using LevelDB instance (and then archived using tar and xz).

Feature files

@nikitaborisov
nikitaborisov / combinations.py
Created May 27, 2018 20:48
count number of item combinations
import functools
MENU = (
('cake', 99),
('cupcake', 20),
('donut', 10),
('muffin', 25),
('cookie', 5)
)
PRICE = 1035
@nikitaborisov
nikitaborisov / ls.dot
Created September 6, 2017 03:05
IDA-generated call graph for ls
digraph G {
"0" [label="_main" ];
"1" [label="_set_program_name" ];
"2" [label="_setlocale" ];
"3" [label="_close_stdout" ];
"4" [label="_atexit" ];
"5" [label="_isatty" ];
"6" [label="_set_quoting_style" ];
"7" [label="_getenv" ];
"8" [label="_argmatch" ];