Meta-Data of Books
If you are using our datasets, please cite the following papers:
- Mengting Wan, Julian McAuley, "Item Recommendation on Monotonic Behavior Chains", in RecSys'18. [bibtex]
- Mengting Wan, Rishabh Misra, Ndapa Nakashole, Julian McAuley, "Fine-Grained Spoiler Detection from Large-Scale Review Corpora", in ACL'19. [bibtex]
Detailed book graph
Detailed book graph
- ~2gb, about 2.3m books
- Download link
- Example:
{'isbn': '',
'text_reviews_count': '7',
'series': ['189911'],
'country_code': 'US',
'language_code': 'eng',
# top user-generated shelves for a book, used to define genres by goodreads 'popular_shelves': [{'count': '58', 'name': 'to-read'},
{'count': '15', 'name': 'fantasy'},
{'count': '6', 'name': 'fiction'},
{'count': '5', 'name': 'owned'}, ...],
'asin': 'B00071IKUY',
'is_ebook': 'false',
'average_rating': '4.03',
'kindle_asin': '',
# a list of books that users who like the current book also like 'similar_books': ['19997', '828466', '1569323', '425389', '1176674', '262740', '3743837',
'880461', '2292726', '1883810', '1808197', '625150', '1988046', '390170',
'2620131', '383106', '1597281'],
'description': 'Omnibus book club edition containing the Ladies of Madrigyn and the Witches of Wenshar.',
'format': 'Hardcover',
'link': 'https://www.goodreads.com/book/show/7327624-the-unschooled-wizard',
'authors': [{'author_id': '10333', 'role': ''}],
'publisher': 'Nelson Doubleday, Inc.',
'num_pages': '600',
'publication_day': '',
'isbn13': '',
'publication_month': '',
'edition_information': 'Book Club Edition',
'publication_year': '1987',
'url': 'https://www.goodreads.com/book/show/7327624-the-unschooled-wizard',
'image_url': 'https://images.gr-assets.com/books/1304100136m/7327624.jpg',
'book_id': '7327624',
'ratings_count': '140',
'work_id': '8948723',
'title': 'The Unschooled Wizard (Sun Wolf and Starhawk, #1-2)',
'title_without_series': 'The Unschooled Wizard (Sun Wolf and Starhawk, #1-2)'}
Detailed information of authors
Detailed information of authors
- Download link:
- Example:
{'average_rating': '3.98',
'author_id': '604031',
'text_reviews_count': '7',
'name': 'Ronald J. Fields',
'ratings_count': '49'} .
Detailed information of works
Detailed information of works
- This is the abstract version of a book regardless any particular editions
- Download link:
- Example:
{'books_count': '1',
'reviews_count': '6',
'original_publication_month': '8',
'default_description_language_code': '',
'text_reviews_count': '1',
'best_book_id': '5333265',
'original_publication_year': '1984',
'original_title': 'W. C. Fields: A Life on Film',
'rating_dist': '5:1|4:1|3:1|2:0|1:0|total:3',
'default_chaptering_book_id': '',
'original_publication_day': '',
'original_language_id': '',
'ratings_count': '3',
'media_type': 'book',
'ratings_sum': '12',
'work_id': '5400751'}
Detailed information of book series
Detailed information of book series
- Download link:
- goodreads_book_series.json.gz
- Note: Unfortunately, the series id included here cannot be used for URL hack (We will see if things can be fixed in the future)
- goodreads_book_series.json.gz
- Example:
{'numbered': 'true',
'note': '',
'description': 'Plot-wise, "Crowner\'s Crusade" is a prequel to the series, but #15 in publication order.',
'title': 'Crowner John Mystery',
'series_works_count': '15',
'series_id': '169353',
'primary_work_count': '15'}
Extracted fuzzy book genres:
Extracted fuzzy book genres:
- This a very fuzzy version of book genres. These tags are extracted from users' popular shelves by a simple keyword matching process.
- Download link:
- Example:
{'book_id': '7327624',
'genres': {'fantasy, paranormal': 31,
'fiction': 8,
'mystery, thriller, crime': 1,
'poetry': 1}}