HN
Today

Project Gutenberg – keeps getting better

Project Gutenberg, the venerable digital library, continues its decades-long mission to provide free access to public domain literature, now with a refreshed website and improved e-reader compatibility. Hacker News celebrates this foundational internet project, praising its volunteer spirit and engaging directly with its active team members who dropped into the discussion. The conversation ranges from its impressive 1971 origins to modern formatting challenges and intriguing download anomalies, showcasing both reverence for its past and excitement for its future.

286
Score
96
Comments
#1
Highest Rank
6h
on Front Page
First Seen
May 15, 4:00 PM
Last Seen
May 15, 9:00 PM
Rank Over Time
211111

The Lowdown

Project Gutenberg is a pioneering digital library, offering over 75,000 free ebooks, primarily focusing on older works where U.S. copyright has expired. Founded in 1971 by Michael S. Hart, it predates the mainstream internet and represents one of the earliest efforts to democratize access to literature. The project is entirely volunteer-based, with thousands contributing to digitizing and proofreading texts.

  • It boasts a collection of over 75,000 free ebooks, accessible without fees, registration, or dedicated apps.
  • The project emphasizes its 50+ year history, pioneering free ebooks since 1971.
  • Volunteer efforts, particularly through Distributed Proofreaders, are central to its operation and growth.
  • Recent site improvements include better mobile styling, EPUB3 support for most books, and plans for PDF availability.
  • It provides machine-readable metadata in XML/RDF format and content feeds, encouraging developers to use these instead of crawling the site.
  • The team actively maintains the site, addressing bugs and responding to user feedback, as evidenced by staff engagement in the discussion.

Project Gutenberg remains a crucial public good, a testament to the power of volunteerism, and an essential resource for preserving and disseminating human knowledge. Its continuous evolution, while staying true to its core mission, ensures its relevance for future generations of readers.

The Gossip

Venerating Volunteers & Vibrant Vitals

Many commenters expressed deep appreciation for Project Gutenberg's enduring mission and the tireless work of its volunteers and staff. They highlighted its foundational role in the internet's history, with some surprised by its 1971 origins. The presence of Project Gutenberg staff (JSeiko, gluejar) in the comments, actively responding to feedback and sharing insights, further amplified the positive sentiment and showcased the project's vibrant, engaged community.

Formatting Finesse & Future Formats

A significant portion of the discussion revolved around ebook formats. While some remembered Project Gutenberg for its plaintext offerings, many praised the recent addition and improvement of EPUBs. There was a strong desire for PDF support, which staff confirmed is 'coming this year.' Commenters also frequently recommended `standardebooks.org` as a source for beautifully formatted versions of PG texts, illustrating a shared community goal for high-quality digital editions.

Accessing Archives & Agent Avenues

Users discussed the practicalities of accessing Project Gutenberg's vast library. Some lamented the lack of direct integration with popular e-readers like Kindle, leading to friction in downloading. There was also a strong emphasis on how developers and 'agents' should access the data, with the PG team clearly pointing to machine-readable metadata feeds and tarballs rather than site crawling, to manage traffic and ensure stability.

Bot Blips & Bizarre Books

A peculiar observation was the top downloaded book, 'Concrete Construction: Methods and Costs,' which significantly outranked classics like Moby Dick. This sparked speculation among commenters and even the Project Gutenberg staff that it was likely due to bot activity rather than a genuine surge in interest in early 20th-century concrete guides. This highlighted the challenges of distinguishing legitimate usage from automated traffic.

Geographical Gating & General Gratitude

A brief but notable theme involved past geo-blocking issues, particularly in Germany, which the Project Gutenberg team confirmed had been resolved. This, along with general expressions of thanks for the site's usability (even with JS disabled) and continuous improvements, underlined the global impact and appreciation for the project.