Publisher Digitization Service
Document Actions
Many publishers are rushing to digitize archived, legacy content from books, magazines and other backlisted titles. Hardly viewed as out-of-date, older articles, particularly those from scientific, technical and professional journals, are still in high-demand. A recent survey by the British Library pointed out that some 15% of articles read by scientists are older than five years. Trade and consumer magazines are also digitizing backfiles as they relaunch or rebrand Web sites to draw more traffic and increase subscription rates.
In fact, recent initiatives by Google, Simon & Schuster and the British Library are just a small sampling of large-scale digitization projects currently underway.
The British Library's Publisher Digitization Service, for example, is a "one-stop-shop" for publishers looking to digitize their archival material. The service addresses a major need for many publishers, finding and locating original content. As the library of record for the United Kingdom, the British Library holds vast serial collections, which enables publishers to save both time and money in locating materials.
Publisher Digitization Service Deep-tagging History
Innodata Isogen is helping a national U.S. newspaper publisher convert its historical archives to a PDF and html format that can be purchased online via subscription or pay-per-view. The scope is vast, encompassing more than 150 years worth of newspapers and millions of newspaper pages and individual articles. The Innodata Isogen team is improving the searchability of the back issues by deep tagging the articles with metadata, using the original newspaper index as their guide.
To speed up the project, we automated the indexing process via a tool, called "Concept Search," achieving an accuracy rate of close to 95 percent. Working closely with the automated process, our content architects validate the accuracy and enhance search results by inserting applicable terms and keywords.
Proven Partner of digitization services
But digitizing backfiles is just the start of the process of bringing content online. And that's why the British Library has forged a partnership with Innodata Isogen to convert digitized material into an XML-based format. Converting digitized content to XML enables it to be tagged with metadata, vastly enhancing its searchability. XML conversion also supports content repurposing and the ability to print on demand - important attributes of an on-line archive.
With its extensive experience converting content to XML, Innodata Isogen can deliver four key attributes for a large-scale data conversion projects:
Scalability: With its extensive production infrastructure, Innodata Isogen can easily ramp up production capacity to support higher throughputs.
Experience: Innodata Isogen has completed four of the world's five largest data conversion efforts, including production runs for The New York Times and Washington Post.
Cutting Edge Tools and Technology: The company has developed a number of automated indexing programs that dramatically expedite indexing and tagging data.
Quality: Established quality assurance protocols ensure that the accuracy of the work is carefully checked against the original version.
The British Library and Innodata Isogen are poised to offer a comprehensive array of digitization and conversion services that will vastly enhance the ability of readers to search for content from journal publishers.
For more information on how you can take advantage of our combined expertise, please contact us at innodata-isogen.com.




