Federal court documents are currently made available to the public through a crufty system called PACER. For eight cents per page, users can download filings and other relevant documents associated with individual cases. PACER is intended to open case law and court activity to broad public scrutiny, but the system's obfuscated design and its paywall significantly undermine its efficacy.

The content hosted on PACER can be freely redistributed by third parties because copyright is not applicable to court documents, but the access fees make it costly and difficult for data archivers to assemble their own comprehensive mirrors that would offload the hosting burden and make the content more easily accessible to the general public. Princeton's Center for Information Technology Policy (CITP) is launching a new project to tackle this problem.

A team led by CITP director Ed Felten has devised a novel means of boosting the availability of PACER documents outside of the paywall. They have created a new Firefox extension called RECAP that seamlessly replicates PACER content and uploads it to a mirror hosted by the Internet Archive. When RECAP users browse the PACER site, the content that they pay to view will be uploaded to the mirror by the Firefox extension. Users will get free access to the documents that are already hosted by the mirror.

Over time, free PACER content will accumulate at the Internet Archive's mirror, making it unnecessary for additional users to pay PACER for access to those files. The unrestricted availability of the mirrored legal documents will empower legal researchers and members of the public who can't simply pass the access costs along to clients as most lawyers do.

The Administrative Office of the United States Courts contends that access fees are needed in order to fund the bandwidth requirements and ongoing maintenance of the system. Crowdsourcing distribution and offloading the data to public mirrors like the Internet Archive presents a practical way to alleviate the strain on PACER while making the data more broadly available.

The RECAP project could also illuminate potential solutions to the problems that are blocking a more complete PACER overhaul. Despite growing pressure from Congress to reform the PACER system and make data available at no cost, the courts are still reluctant to make major changes to the system because of uncertainty about the cost and the technical challenges of hosting and distributing the data. The RECAP project's effort to mirror a portion of PACER could help answer those questions and provide a real-world model that demonstrates the viability of open access.

Our own Tim Lee, who wrote about the problems with PACER in article earlier this year, is one of the developers behind the RECAP project. Lee participated in the creation of the Firefox extension and some related backend infrastructure in collaboration with Harlan Yu and Steve Schultze. The Firefox extension is open source software and is distributed under the terms of GNU's General Public License. The extension is available for download from the RECAP website, which launched today. The website also has an introductory video and additional details about the project's goals and philosophy.

Yu offers additional commentary about RECAP at the CITP's Freedom to Tinker blog. He envisions a future where court documents are freely available to everyone and are annotated with software-friendly metadata that makes the information easy to process, index, and programmatically analyze.

"With today's technologies, government transparency means much more than the chance to read one document at a time. Citizens today expect to be able to download comprehensive government datasets that are machine-processable, open, and free," he wrote. "Today, we are excited to announce the public beta release of RECAP, a tool that will help bring an unprecedented level of transparency to the US federal court system. RECAP is a plug-in for the Firefox web browser that makes it easier for users to share documents they have purchased from PACER, the court's pay-to-play access system."

As Lee explained in his previous article, the revenue generated by PACER's paywall far exceeds the amount of money needed to run the system. The operating costs could be further reduced through much-needed consolidation and other changes. The RECAP system is an important vehicle for encouraging open access and moving the system forward. It also reflects the growth of an emerging movement that seeks to boost government transparency through data availability.

A number of similar projects have popped up recently with the goal of making the inner workings of the government visible to regular citizens via the Internet. An example is OpenRegs.com, a website established in June by Mercatus Center researcher Jerry Brito and programmer Peter Snyder to help people navigate federal regulations. The government itself is also pushing a number of important data transparency projects, such as the new Data.gov website that was launched in may by Federal CIO Vivek Kundra to aggregate government data sets in machine-readable formats.

Listing image by Brian Turner