Name | DL | Torrents | Total Size | Social Networking [edit] | 8 | 3.93GB | 58 | 0 | Cyber Security [edit] | 2 | 3.25GB | 20 | 0 | ML LD Prime Datasets [edit] | 6 | 3.26TB | 17 | 0 | The Cybersecurity Academic's Seedbox [edit] | 24 | 1.29TB | 127 | 0 |
fbnames (11 files)
facebook-urls.txt.bz2 | 1.39GB |
facebook-names-withcount.txt.bz2 | 477.27MB |
facebook-names-unique.txt.bz2 | 479.33MB |
facebook-names-original.txt.bz2 | 504.06MB |
facebook-lastnames-withcount.txt.bz2 | 21.17MB |
facebook-firstnames-withcount.txt.bz2 | 16.38MB |
README | 1.71kB |
facebook-f.last-withcount.txt.bz2 | 66.35MB |
facebook-first.l-withcount.txt.bz2 | 36.93MB |
facebook.rb | 1.22kB |
facebook.nse | 3.91kB |
Type: Dataset
Tags:
Bibtex:
Tags:
Bibtex:
@article{, title= {Facebook Names Dataset}, keywords= {}, journal= {}, author= {Ron Bowes (Skull Security)}, year= {2010}, url= {https://blog.skullsecurity.org/2010/return-of-the-facebook-snatchers}, license= {}, abstract= {171 million names (100 million unique) This torrent contains: The URL of every searchable Facebook user's profile The name of every searchable Facebook user, both unique and by count (perfect for post-processing, datamining, etc) Processed lists, including first names with count, last names with count, potential usernames with count, etc The programs I used to generate everything So, there you have it: lots of awesome data from Facebook. Now, I just have to find one more problem with Facebook so I can write "Revenge of the Facebook Snatchers" and complete the trilogy. Any suggestions? >:-) Limitations So far, I have only indexed the searchable users, not their friends. Getting their friends will be significantly more data to process, and I don't have those capabilities right now. I'd like to tackle that in the future, though, so if anybody has any bandwidth they'd like to donate, all I need is an ssh account and Nmap installed. An additional limitation is that these are only users whose first characters are from the latin charset. I plan to add non-Latin names in future releases.}, tos= {}, superseded= {}, terms= {} }