eye Title Creator

175,213 175K Academic Torrents collection ITEMS 2,264 VIEWS 175,213 by ACADEMICTORRENTS.COM collection eye 175,213

Welcome to Academic Torrents! Making 14.15TB of research data available. We've designed a distributed system for sharing enormous datasets - for researchers, by researchers. The result is a scalable, secure, and fault-tolerant repository for data, with blazing fast download speeds.



The Dataset Collection 49,698 50K Complete Public Reddit Comments Corpus data eye 49,698 favorite 8 comment 3

(Here is the original Reddit comment announcing this collection of data and what the processes were.) This is an archive of Reddit comments from October of 2007 until May of 2015 (complete month). This reflects 14 months of work and a lot of API calls. This dataset includes nearly every publicly available Reddit comment. Approximately 350,000 comments out of ~1.65 billion were unavailable due to Reddit API issues. Q: How are the files structured? Each file is compressed with bzip2 compression....

favorite favorite favorite favorite favorite ( 3 reviews )



The Dataset Collection 32,353 32K Dark Net Market archives, 2011-2015 by Gwern Branwen data eye 32,353 favorite 10 comment 1

Dark Net Markets (DNM) are online markets typically hosted as Tor hidden services whose users transact in Bitcoin or other cryptocoins, usually for drugs or other illegal/regulated goods; the most famous DNM was Silk Road 1, which pioneered the business model. From 2013-2015, I scraped/mirrored on a weekly or daily basis all existing English-language DNMs as part of my research into their usage, lifetimes/characteristics, & legal riskiness; in addition, I made or obtained copies of as many...

favorite favorite favorite favorite favorite ( 1 reviews )

Topics: Tor, Bitcoin, drugs, Silk Road, Evolution, Agora, black-markets, dark net markets



The Dataset Collection 21,328 21K One Million Audio Cover Images for Research by Internet Archive data eye 21,328 favorite 8 comment 1

Culled from various sources, this collection includes over one million JPG, PNG and GIF album covers. The resolution ranges from "thumbnail" through to very large sizes. Filenames are variant in usefulness, although a good number indicate at least the name of the original album. This dataset is for experimentation and image processing research only. At 148gb, the collection is large but not unmanageable (there is a torrent available) and allows a developer or artist to work with the...

favorite favorite favorite favorite favorite ( 1 reviews )

Topics: dataset, big data, album covers, covers, cover art, cover photos



The Dataset Collection 15,696 16K NYC Taxi Trip Data 2013 (FOIA/FOIL) by NYC Taxi and Limousine Commission data eye 15,696 favorite 3 comment 0

FOIA/FOILed Taxi Trip Data from the NYC Taxi and Limousine Commission 2013. Released by http://chriswhong.com/open-data/foil_nyc_taxi/ trip_data.7z and trip_fare.7z are more efficiently compressed versions of the data, you probably want these files. The data is in csv format. For the data files this includes the fields: medallion, hack_license, vendor_id, rate_code, store_and_fwd_flag, pickup_datetime, dropoff_datetime, passenger_count, trip_time_in_secs, trip_distance, pickup_longitude,...

Topics: data, nyc, taxi, fare, csv, FOIA, FOIL

Source: torrent:urn:sha1:6c594866904494b06aae51ad97ec7f985059b135



The Dataset Collection 15,217 15K Melody datasets generated by All the Music, LLC by All the Music, LLC audio eye 15,217 favorite 22 comment 9

From: https://www.vice.com/en_uk/article/wxepzw/musicians-algorithmically-generate-every-possible-melody-release-them-to-public-domain : Musicians Algorithmically Generate Every Possible Melody, Release Them to Public Domain Damien Riehl and Noah Rubin generated and saved every possible melody to a hard drive, then turned it back around to the commons. From: https://www.dailymail.co.uk/sciencetech/article-8042979/Musician-uses-computer-algorithm-compose-melody-thats-possible-key-C.html :...

favorite favorite favorite ( 9 reviews )



Academic Torrents 6,841 6.8K [Coursera] Compilers by Alex Aiken (Stanford University) movies eye 6,841 favorite 9 comment 0

This course will discuss the major ideas used today in the implementation of programming language compilers. You will learn how a program written in a high-level language designed for humans is systematically translated into a program written in low-level assembly more suited to machines!

Source: http://academictorrents.com/details/e31e54905c7b2669c81fe164de2859be4697013a



The Dataset Collection 5,933 5.9K CAT Dataset by Weiwei Zhang, Jian Sun, and Xiaoou Tang data eye 5,933 favorite 2 comment 0

This dataset mirrored from http://137.189.35.203/WebUI/CatDatabase/catData.html, which circa May 2017 is a dead link. The original page is available in Wayback: https://web.archive.org/web/20150520175645/http://137.189.35.203/WebUI/CatDatabase/catData.html The CAT dataset includes 10,000 cat images. For each image, we annotate the head of cat with nine points, two for eyes, one for mouth, and six for ears. The detail configuration of the annotation was shown in Figure 6 of the original paper:...

Topics: cats, datasets, computer vision



Topics: Coursera, dsp

Source: http://academictorrents.com/details/43d881e5128841876104742314ccd9851901f460



4,611 4.6K MusicBrainz Data Dumps collection ITEMS 684 VIEWS 4,611 collection eye 4,611

The MusicBrainz Database is built on the PostgreSQL relational database engine and contains all of MusicBrainz' music metadata. This data includes information about artists, release groups, releases, recordings, works, and labels, as well as the many relationships between them. The database also contains a full history of all the changes that the MusicBrainz community has made to the data. Core data Artists Name, sort name, IPI, aliases, type, begin and end dates, disambiguation comment, MBID...



Source: http://academictorrents.com/details/1448261dd6932e549ba4a86b5d6750aae858d003



I took the Reddit comment archive and converted all the JSON into one SQLite database using this program that I wrote: https://gist.github.com/ers35/3b615a75fa0ed5e6d5cc I ran a few tests to make sure the number of database rows matches the number of JSON records. "SELECT MAX(rowid) FROM comment" and "SELECT COUNT(id) FROM comment" both return 1659361605. This gives me some confidence as to the integrity of the dataset, but I cannot be 100% sure. The compressed size is 163G....



Academic Torrents 3,917 3.9K [Coursera] Probabilistic Graphical Models by Stanford University movies eye 3,917 favorite 0 comment 0

In this class, you will learn the basics of the PGM representation and how to construct them, using both human knowledge and machine learning techniques. Uncertainty is unavoidable in real-world applications: we can almost never predict with certainty what will happen in the future, and even in the present and the past, many important aspects of the world are not observed with certainty. Probability theory gives us the basic foundation to model our beliefs about the different possible states of...

Source: http://academictorrents.com/details/e74f08f0fc699e84a9eb046309727d07d80171c5



When you take a digital photo with your phone or transform the image in Photoshop, when you play a video game or watch a movie with digital effects, when you do a web search or make a phone call, you are using technologies that build upon linear algebra. Linear algebra provides concepts that are crucial to many areas of computer science, including graphics, image processing, cryptography, machine learning, computer vision, optimization, graph algorithms, quantum computation, computational...

Source: http://academictorrents.com/details/54cd86f3038dfd446b037891406ba4e0b1200d5a



Source: http://academictorrents.com/details/0342ad0bd7ef06eb1500b5e7c8ef398060827c4e



Academic Torrents 3,629 3.6K Artificial Intelligence(EDX) by ColumbiaX movies eye 3,629 favorite 2 comment 0

Seed

Topics: AI, edx, artifical, intelligence, course, deep, learning, natural, language, processing, ColumbiaX,...

Source: http://academictorrents.com/details/5daa22057577521a378b71e0f0de6a934bd5c2ea



Source: http://academictorrents.com/details/f07203f2eedb4792c351ba0e28406dab9ab54d7d



Academic Torrents 3,518 3.5K [Coursera] Introduction to Mathematical Thinking by Dr. Keith Devlin (Stanford University) movies eye 3,518 favorite 3 comment 0

About this course: Learn how to think the way mathematicians do - a powerful cognitive process developed over thousands of years. The goal of the course is to help you develop a valuable mental ability – a powerful way of thinking that our ancestors have developed over three thousand years. Mathematical thinking is not the same as doing mathematics – at least not as mathematics is typically presented in our school system. School math typically focuses on learning procedures to solve highly...

Source: http://academictorrents.com/details/2b5e5cc8c7414bc3b0f6974190065bc8c2f629dc



Source: http://academictorrents.com/details/7bfcfbaf2c53588b23ba1ebccae47a2b9c5197b7



Large sets of malware examples for the purposes of research, comparison, and history. This is the Various set, which is a volume of specific smaller sets of malware.



Source: http://academictorrents.com/details/e8b1f9c5bf555fe58bc73addb83457dd6da69630



#Course Description This course provides a broad introduction to machine learning and statistical pattern recognition. Topics include: supervised learning (generative/discriminative learning, parametric/non-parametric learning, neural networks, support vector machines); unsupervised learning (clustering, dimensionality reduction, kernel methods); learning theory (bias/variance tradeoffs; VC theory; large margins); reinforcement learning and adaptive control. The course will also discuss recent...

Topics: machine learning, statistics, Regression

Source: http://academictorrents.com/details/da90dedfb78190e5c62af1ad40a2413cb918457f



Source: http://academictorrents.com/details/cb91a3d7a4c4c086be240b54e83ed8d587b31ff5



Course Description from Professor Plous: Each of us is dealt a different hand in life, but we all face similar questions when it comes to human behavior: What leads us to like one person and dislike another? How do conflicts and prejudices develop, and how can they be reduced? Can psychological research help protect the environment, and if so, how? This course offers an introduction to classic and contemporary social psychology, covering topics such as decision making, persuasion, group...

Topics: coursera, social, psychology, mooc

Source: http://academictorrents.com/details/3374eb064817a8edd12167b6e9e1300b13d9f08a



Academic Torrents 3,114 3.1K [Coursera] Natural Language Processing by Michael Collins (Columbia University) movies eye 3,114 favorite 4 comment 0

Course Description: COMS W4705 is a graduate introduction to natural language processing, the study of human language from a computational perspective. We will cover syntactic, semantic and discourse processing models. The emphasis will be on machine learning or corpus-based methods and algorithms. We will describe the use of these methods and models in applications including syntactic parsing, information extraction, statistical machine translation, dialogue systems, and summarization....

Source: http://academictorrents.com/details/f99e7184fca947ee8f77901679e171fcadbf82e7



Academic Torrents 3,071 3.1K [Coursera] Heterogeneous Parallel Programming by Wen-mei W. Hwu (University of Illinois) movies eye 3,071 favorite 2 comment 0

This course introduces concepts, languages, techniques, and patterns for programming heterogeneous, massively parallel processors. Its contents and structure have been significantly revised based on the experience gained from its initial offering in 2012. It covers heterogeneous computing architectures, data-parallel programming models, techniques for memory bandwidth management, and parallel algorithm patterns. All computing systems, from mobile to supercomputers, are becoming heterogeneous,...

Source: http://academictorrents.com/details/8903d0871c652b96c7b29db738cea76902d65888



Source: http://academictorrents.com/details/d2c8f8f1651740520b7dfab23438d89bc8c0c0ab



"Demanding, but definitely doable. Social, but educational. A focused topic, but broadly applicable skills. CS50 is the quintessential Harvard (and Yale!) course. Hello, world! This is CS50 (aka CS50x through edX), Harvard University's introduction to the intellectual enterprises of computer science and the art of programming. Introduction to the intellectual enterprises of computer science and the art of programming. This course teaches students how to think algorithmically and solve...

Topics: Science, course, introduction, computer, cs50x, harvard, yale

Source: http://academictorrents.com/details/52da574b6412862e199abeaea63e51bf8cea2140



Source: http://academictorrents.com/details/b7579be97c2f01e4efadb0b6b06f0d071afeaac9



Source: http://academictorrents.com/details/05a8fe3f7e3420df6b83f40b0cbccd05e591d9f4



Source: http://academictorrents.com/details/e24c15ce89cac9c380284595d1d8a475cb485e28



2,701 2.7K Dumps of DISCOGS.ORG Metadata (2008-Present) collection ITEMS 118 VIEWS 2,701 by DISCOGS.ORG collection eye 2,701

This is an unofficial mirror of the DISCOGS.ORG data collection, which is located at http://www.discogs.com/data/ . Discogs, short for discographies, is a website and database of information about audio recordings, including commercial releases, promotional releases, and bootleg or off-label releases. The Discogs servers, currently hosted under the domain name discogs.com, are owned by Zink Media, Inc., and are located in Portland, Oregon, USA. Discogs is one of the largest online databases of...



Source: http://academictorrents.com/details/8dcd401c1b3db696fc2f04d3b49c850f0e5cc309



The Dataset Collection 2,495 2.5K A 2018 Anonymous FTP Server Census by Ben data eye 2,495 favorite 0 comment 0

Ben's FTP List (May, 2018): This is a trimmed down list of all servers that are online and allow anonymous connections. There are 244441 FTP's in total Please note: It is unknown if these servers are online after the scan or are behind dynamic IP addresses, making it impossible to guarantee if they are available after this list was compiled. This census is provided as a series of bzip2 files, which can be read directly by utilities such as zmore and zless. It is both intended to be used for...



Academic Torrents 2,481 2.5K [Coursera] Exploring Quantum Physics by University of Maryl movies eye 2,481 favorite 0 comment 0

An introduction to quantum physics with emphasis on topics at the frontiers of research, and developing understanding through exercise. Quantum physics is the foundation for much of modern technology, provides the framework for understanding light and matter from the subatomic to macroscopic domains, and makes possible the most precise measurements ever made. More than just a theory, it offers a way of looking at the world that grows richer with experience and practice. Our course will provide...

Source: http://academictorrents.com/details/f24122f15283757aa8a9bf9cb638db266273442d



2,437 2.4K Internet Census 2012 collection ITEMS 15 VIEWS 2,437 by Anonymous collection eye 2,437

Abstract While playing around with the Nmap Scripting Engine (NSE) we discovered an amazing number of open embedded devices on the Internet. Many of them are based on Linux and allow login to standard BusyBox with empty or default credentials. We used these devices to build a distributed port scanner to scan all IPv4 addresses. These scans include service probes for the most common ports, ICMP ping, reverse DNS and SYN scans. We analyzed some of the data to get an estimation of the IP address...



Source: http://academictorrents.com/details/91bc48e6c8341de198c970acccdc87199391ab46



Source: http://academictorrents.com/details/4281ef52a65d26489e686a0540d86abd4161b88e



Source: http://academictorrents.com/details/412d52b0bfcf2a8bf3201a28c2ba04b6dff5b290



The last three or four decades have seen a remarkable evolution in the institutions that comprise the modern monetary system. The financial crisis of 2007-2009 is a wakeup call that we need a similar evolution in the analytical apparatus and theories that we use to understand that system. Produced and sponsored by the Institute for New Economic Thinking, this course is an attempt to begin the process of new economic thinking by reviving and updating some forgotten traditions in monetary thought...

Source: http://academictorrents.com/details/970f4ee32d1a49168466a517b3dcd0442b043abc



Source: http://academictorrents.com/details/b63a566df824b39740eb9754e4fe4c0140306f4b



Topics: Coursera, qoptintro

Source: http://academictorrents.com/details/b1f4d8ccee24aa956f6226607612ce8867b235a3



Source: http://academictorrents.com/details/b02188bbb764f7f5fdd499c5144add35f56ed3e7



Source: http://academictorrents.com/details/be19083019ae3954680733d394e5e5b5b3572a15



Source: http://academictorrents.com/details/8033015783d2df3e8b33a343edb8cea9a0b8319a



Source: http://academictorrents.com/details/10d1bf7161a1b3ea70697cd61834ceea6c3d1f87



Source: http://academictorrents.com/details/066a55d231d3918ad3de994e6211bb99417bcdf0



Source: http://academictorrents.com/details/ec1c86afefda42f4b36c34ae7b235ef0bfd6b9d3



Source: http://academictorrents.com/details/de34574326abc4666c7ede41d0205a4a2129bf85



##Outline This is an introductory course in machine learning (ML) that covers the basic theory, algorithms, and applications. ML is a key technology in Big Data, and in many financial, medical, commercial, and scientific applications. It enables computational systems to adaptively improve their performance with experience accumulated from the observed data. ML has become one of the hottest fields of study today, taken up by undergraduate and graduate students from 15 different majors at...

Source: http://academictorrents.com/details/8190b5122515ab158cd29ccdb33ea946a3e529f4



Academic Torrents 1,697 1.7K [Coursera] Clinical Problem Solving by Catherine R. Lucey, MD (University of California San Francisco) movies eye 1,697 favorite 0 comment 0

Participants will learn how to move efficiently from patient signs and symptoms to a rational and prioritized set of diagnostic possibilities and will learn how to study and read to facilitate this process. Clinical problem solving or diagnostic reasoning is the skill that physicians use to understand a patient’s complaints and then to identify a short, prioritized list of possible diagnoses that could account for those complaints. This differential diagnosis then drives the choice of...

Source: http://academictorrents.com/details/dae02888e2fb6484a7b471cb7977eb859aba4831



Large sets of malware examples for the purposes of research, comparison, and history. This is the alphabetical set.



Source: http://academictorrents.com/details/d0262f08717ba584551357e4bf8c1945dc6d6935



Source: http://academictorrents.com/details/3e6f1876bbd46780602e72f4b122329fb668bd2c



Academic Torrents 1,600 1.6K [Coursera] The Hardware/Software Interface by Gaetano Borriello, Luis Ceze (University of Washington) movies eye 1,600 favorite 1 comment 0

Examines key computational abstraction levels below modern high-level languages. From Java/C to assembly programming, to basic processor and system organization. This course examines key computational abstraction levels below modern high-level languages; number representation, assembly language, introduction to C, memory management, the operating-system process model, high-level machine architecture including the memory hierarchy, and how high-level languages are implemented. We will develop...

Source: http://academictorrents.com/details/f1384286c8581bffba11e378fdb37608e649d82a



Source: http://academictorrents.com/details/78515f90de063ffc144be5e7e726c03849b4e0ed



Source: http://academictorrents.com/details/dfc1ddde962101f00ef9764b91181bd6bb5c9e93



Academic Torrents 1,476 1.5K Imagenet Full (Fall 2011 release) by Jia Deng; Wei Dong; Richard Socher; Li-Jia Li; Kai Li; Li Fei-Fei data eye 1,476 favorite 0 comment 0

ImageNet is an image dataset organized according to the WordNet hierarchy. Each meaningful concept in WordNet, possibly described by multiple words or word phrases, is called a "synonym set" or "synset". There are more than 100,000 synsets in WordNet, majority of them are nouns (80,000+). In ImageNet, we aim to provide on average 1000 images to illustrate each synset. Images of each concept are quality-controlled and human-annotated. In its completion, we hope ImageNet will...

Topics: imagenet, deep learning

Source: http://academictorrents.com/details/564a77c1e1119da199ff32622a1609431b9f1c47



The Dataset Collection 1,453 1.5K DOI URLs software eye 1,453 favorite 1 comment 0

All the "journal article" DOIs from CrossRef's OAI-PMH server; URLs of just under 50 million journal articles.

Topics: doi, dataset



Source: http://academictorrents.com/details/d180bcd510aeec3a20044a0946ac658b9ab30760



The Dataset Collection 1,418 1.4K UPC Database 2007-06-01 data eye 1,418 favorite 0 comment 0

Database of UPC product codes, as compiled by upcdatabase.com

Topics: UPC, Universal Product Code, barcode



The Dataset Collection 1,417 1.4K FanFiction Collection (Repack) data eye 1,417 favorite 1 comment 0

A collection of fanfiction stories from fanfiction.net, repacked for easier bulk collecting and archiving. Contains many tens of thousands of fan fiction stories.



Academic Torrents 1,390 1.4K [Coursera] Computer Architecture by David Wentzlaff (Princeton University) movies eye 1,390 favorite 0 comment 0

About this course: In this course, you will learn to design the computer architecture of complex modern microprocessors. ### Introduction, Instruction Set Architecture, and Microcode This lecture will give you a broad overview of the course, as well as the description of architecture, micro-architecture and instruction set architectures. ### Pipelining Review This lecture covers the basic concept of pipeline and two different types of hazards. ### Cache Review This lecture covers control...

Source: http://academictorrents.com/details/53bae6d22f3b6e692673f9335e0a0198c1618426



Academic Torrents 1,377 1.4K [Coursera] Algorithms Part I by Kevin Wayne; Robert Sedgewick (Princeton University) movies eye 1,377 favorite 2 comment 0

About this course: This course covers the essential information that every serious programmer needs to know about algorithms and data structures, with emphasis on applications and scientific performance analysis of Java implementations. Part I covers elementary data structures, sorting, and searching algorithms. Part II focuses on graph- and string-processing algorithms. ## Union−Find We illustrate our basic approach to developing and analyzing algorithms by considering the dynamic...

Source: http://academictorrents.com/details/a2934d859a14c07a80092ab03552310838f66590



This subject is aimed at students with little or no programming experience. It aims to provide students with an understanding of the role computation can play in solving problems. It also aims to help students, regardless of their major, to feel justifiably confident of their ability to write small programs that allow them to accomplish useful goals. The class will use the Python? programming language.

Source: http://academictorrents.com/details/f7c9a9db5d0d9a1e0f2f383ec629fefffa475ae5



Academic Torrents 1,270 1.3K Statistical Machine Learning CMU Spring 2016 by Larry Wasserman movies eye 1,270 favorite 0 comment 0

Statistical Machine Learning is a second graduate level course in advanced machine learning, assuming students have taken Machine Learning (10-715) and Intermediate Statistics (36-705). The course covers methodology and theoretical foundations. Function Spaces Concentration of Measure Linear Regression Nonparametric Regression Linear Classification Nonparametric Classification Minimax Theory Density Estimation Nonparametric Bayes Clustering Graphical Models Dimension Reduction Random Matrix...

Source: http://academictorrents.com/details/07f1555918ed051809f0075fedc0cd469a194c93



Source: http://academictorrents.com/details/560d07faaf09f640fea96b3650874e2903cbc639



Topics: text mining, analytics

Source: http://academictorrents.com/details/e2c129491a3841bfac5d7b08b41ad79387132a23



http://www.cs.cmu.edu/~tom/10601_fall2012/lectures.shtml

Topics: machine learning, Tom Mitchell

Source: http://academictorrents.com/details/35b6b8bf0c2931ba7ecd8a1a8e65fa32f3e7473f



Source: http://academictorrents.com/details/3ba301f087680f41a88224c49c01218b24de868b



Volume I - mainly mechanics, radiation, and heat Volume II - mainly electromagnetism and matter Volume III - quantum mechanics

Source: http://academictorrents.com/details/c5af268ec55cf2d3b439e7311ad43101ba8322eb



Source: http://academictorrents.com/details/459e24d28a6abce04cc9fd6e9a148c86dcaac19c



Academic Torrents 1,012 1.0K [Coursera] Statistics: Making Sense of Data by Alison Gibbs, Jeffrey Rosenthal (University of Toronto) movies eye 1,012 favorite 1 comment 0

This course is an introduction to the key ideas and principles of the collection, display, and analysis of data to guide you in making valid and appropriate conclusions about the world. We live in a world where data are increasingly available, in ever larger quantities, and are increasingly expected to form the basis for decisions by governments, businesses, and other organizations, as well as by individuals in their daily lives. To cope effectively, every informed citizen must be statistically...

Source: http://academictorrents.com/details/a0cbaf3e03e0893085b6fbdc97cb6220896dddf2



Source: http://academictorrents.com/details/ed196d080a2208727a225ab5e7a5630e5bf53be4



Academic Torrents 982 982 Coursera - Economics of Money and Banking Part Two by Perry G Mehrling (Columbia University) movies eye 982 favorite 0 comment 0