Google is putting A.I. and machine learning technologies into the hands of journalists. The company this morning announced a suite of new tools, Journalist Studio, that will allow reporters to do their work more easily. At launch, the suite includes a host of existing tools as well as two new products aimed at helping reporters search across large documents and visualizing data.
The first tool is called Pinpoint and is designed to help reporters work with large file sets — like those that contain hundreds of thousands of documents.
Pinpoint will work as an alternative to using the “Ctrl + F” function to manually seek out specific keywords in the documents. Instead, the tool takes advantage of Google Search and its A.I.-powered Knowledge Graph, along with optical character recognition and speech-to-text technologies.
It’s capable of sorting through scanned PDFs, images, handwritten notes, and audio files to automatically identify the key people, organizations, and locations that are mentioned. Pinpoint will highlight these terms and even their synonyms across the files for easy access to the key data.
The tool has already been put to use by journalists at USA Today, for its report on 40,600 COVID-19-related deaths tied to nursing homes. Reveal also used Pinpoint look into the COVID-19 “testing disaster” in ICE detention centers. And The Washington Post used it for a piece about the opioid crisis.
Because it’s also useful for speeding up research, Google notes Pinpoint can be used for shorter-term projects, as well — like Philippines-based Rappler’s examination of CIA reports from the 1970s or Mexico-based Verificado MX’s fast fact checking of the government’s daily pandemic updates.
Pinpoint is available now to interested journalists, who can sign up to request access. The tool currently supports seven languages: English, French, German, Italian, Polish, Portuguese, and Spanish.
Google has also partnered with The Center for Public Integrity, Document Cloud, Stanford University’s Big Local News program and The Washington Post to create shared public collections that are available to all users.
The second new tool being introduced today is The Common Knowledge Project, still in beta.
The tool allows journalists to explore, visualize and share data about important issues in their local communities by creating their own interactive charts using thousands of data points in a matter minutes, the company says.
These charts can then be embedded in reporters’ stories on the web or published to social media.
This particular tool was built by the visual journalism team at Polygraph, supported by the Google News Initiative. The data for use in The Common Knowledge Project comes from Data Commons, which includes thousands of public datasets from organizations like the U.S. Census and the CDC.
At launch, the tool offers U.S. data on issues including demographics, economy, housing, education, and crime.
As it’s still in beta testing, Google is asking journalists to submit their ideas for how it can be improved.
Google will demonstrate and discuss these new tools in more detail during a series of upcoming virtual events, including the Online News Association’s conference on Thursday, October 15. The Google News Initiative training will also soon host a six-part series focused on tools for reporters in seven different languages across nine regions, starting the week of October 20.
The new programs are available on the Journalist Studio website, which also organizes other tools resources for reporters, including Google’s account security system, the Advanced Protection Program; direct access to the Data Commons; DataSet Search; a Fact Check Explorer; a tool for visualizing data using customizable templates, Flourish; the Google Data GIF Maker; Google Public Data Explorer; Google Trends; DIY VPN Outline; DDoS defense tool, Project Shield; and tiled cartogram maker Tilegrams.
The site additionally points to other services from Google, like Google Drive, Google Scholar, Google Earth, Google News, and others, as well as training resources.