We've recently expanded coverage of 'secForms' back to 1994 (pre XBRL introduced in 2005). Over the coming weeks and months, you will see these data flowing into organization profiles. To view examples of secForms already in the graph search for organizations that have secForms (type:Organization has:secForms)

type:Research

by Kris Negulescu

We've added type:Research to the graph. Each entity is a journal article or research publication from a validated source. Data will continue to flow for the next month or so as we integrate all of the available metadata from OpenAlex.org. Future phases will combine OpenAlex metadata with the full text of the publication extracted from HTML or PDF (when it is publicly available to access) and will include data provenance and fair use details. Try a search for journal articles that include AI in the title: type:Research title:"AI" sortBy:date

Convert JSON to CSV

by Kris Negulescu

We built a fast JSON to CSV converter that supports automatic list detection, a column selection UI, and offline mode. Try it out here.

Diffbot ChatGPT Plug-in

by Kris Negulescu

Diffbot's ChatGPT plugin is available to everyone free of charge. The only requirement is a subscription to ChatGPT Plus for access to plugins. Get the Diffbot plugin by searching for "Diffbot" in the ChatGPT plugin store under the GPT-4 tab. Click here to find Diffbot on ChatGPT.

NL API Summarization

by Kris Negulescu

The NL API now has the ability to summarize text. Summarization in the NL API uses the Diffbot LLM for fast inference. There is no truncation. Long documents are broken into chunks and summarized individually. A final summary is generated from these chunk summaries.

See the NL API summarization demo here.

Enhance will now generate Coverage Reports in the background rather than during the execution of the bulkjob to speed up completion of bulkjobs. Previously, users could download the report immediately following completion of the bulkjob. Users should now poll the bulkjob status to ensure that the coverage report is generated. Status is available under content.reports[].status and should have the value of COMPLETE to indicate that the report is generated.

The changes are described here: https://docs.diffbot.com/reference/bulkjobstatus
*Please note that this change DOES NOT affect the behavior of bulkjob results - only the coverage report.

We have trained a new LLM model for the generation of Organization.summary. We can generate about ~5 million summaries per month using this new approach. We have prioritized private corporation entities to update first. These new summaries began appearing in the KG on July 7th. Please share your feedback and coverage requests with [email protected].