Book: Mining of Massive Datasets (free download)


Anand Rajaraman
Anand Rajaraman

Jeffrey Ullman
Jeffrey Ullman
This book evolved from material developed over several years by Anand Rajaraman and Jeff Ullman for a one-quarter course at Stanford. The course CS345A, titled "Web Mining," was designed as an advanced graduate course, although it has become accessible and interesting to advanced undergraduates. At the highest level of description, this book is about data mining. However, it focuses on data mining of very large amounts of data, that is, data so large it does not fit in main memory. Because of the emphasis on size, many of our examples are about the Web or data derived from the Web. Further, the book takes an algorithmic point of view: data mining is about applying algorithms to data, rather than using data to "train" a machine-learning engine of some sort.
Download Mining of Massive Datasets, (PDF, 340 pages, 2MB)
You can find materials from past offerings of CS345A at:
infolab.stanford.edu/~ullman/mining/mining.html
There, you will find slides, homework assignments, project requirements etc
The book canbe downloaded in full form or chapter by chapter.

0 comments:

Post a Comment

Translate this blog:

Copyrights

Disclaimer

No responsibility is taken for any potential inaccuracies and/or errors in the text, and any damages that are incurred through the use of this material.Most of the material is sourced from internet and taken from other scientific blogs(text or idea;as well as pictures)& if you find any of your copyright material which you donot wish to appear on this blog,kindly inform at the e-mail id:ooogyx@gmail.com and it will be promptly removed.All the opinions expressed on this blog are solely of the author and not of any organization or institution.

  © MAD_HELIX

Design by OogYx