Convert JSON to CSV
by Kris NegulescuWe built a fast JSON to CSV converter that supports automatic list detection, a column selection UI, and offline mode. Try it out here.
We built a fast JSON to CSV converter that supports automatic list detection, a column selection UI, and offline mode. Try it out here.
Diffbot's ChatGPT plugin is available to everyone free of charge. The only requirement is a subscription to ChatGPT Plus for access to plugins. Get the Diffbot plugin by searching for "Diffbot" in the ChatGPT plugin store under the GPT-4 tab. Click here to find Diffbot on ChatGPT.
The NL API now has the ability to summarize text. Summarization in the NL API uses the Diffbot LLM for fast inference. There is no truncation. Long documents are broken into chunks and summarized individually. A final summary is generated from these chunk summaries.
See the NL API summarization demo here.
Enhance will now generate Coverage Reports in the background rather than during the execution of the bulkjob to speed up completion of bulkjobs. Previously, users could download the report immediately following completion of the bulkjob. Users should now poll the bulkjob status to ensure that the coverage report is generated. Status is available under content.reports[].status and should have the value of COMPLETE to indicate that the report is generated.
The changes are described here: https://docs.diffbot.com/reference/bulkjobstatus
*Please note that this change DOES NOT affect the behavior of bulkjob results - only the coverage report.
We have trained a new LLM model for the generation of Organization.summary. We can generate about ~5 million summaries per month using this new approach. We have prioritized private corporation entities to update first. These new summaries began appearing in the KG on July 7th. Please share your feedback and coverage requests with [email protected].
We've begun to populate open hours for local businesses in the Knowledge Graph. Try searching for hospitality companies in the United States with openHours
Job API docs:
https://docs.diffbot.com/reference/job
We've updated Diffbot's Privacy Policy.
Effective Date: April 10, 2023
We care deeply about your privacy. Recently, we discontinued all advertising, cookie, and web beacon-based tracking on www.diffbot.com. Instead, we use Plausible Analytics for Visitor analytics and website reporting. Please note: Trial Users & Subscribers who create an account with Diffbot are still required to accept cookies to use the Diffbot Dashboard or Leadgraph web applications.
While we've discontinued advertising, we do plan to continue to send the occasional marketing newsletter to update you on recent product launches and programs. Want to change how you receive these emails? You can update your preferences or unsubscribe from the list by contacting us at [email protected] or by clicking the 'unsubscribe' link in the footer of the newsletter.
Major additions to the Diffbot Privacy Policy include:
These additions expand the protections and rights of Users under the California Privacy Rights Act, Cal. Civ. Code §§ 1798.100 et seq. (“CPRA”), EU Data Protection Laws, other laws of the European Union or any Member State thereof, the UK GDPR, and the laws of any other country or state to which you may be subject.
Major edits made to and removals from the policy and additional terms include:
We pushed several breaking changes to Article data, Job Posts, Discussion Threads, and Product Offers in the KG.
FIRST:
The 'types' array was sorted with the most specific type first. We reversed the order to make the array consistent with 'types' in Organization, Person, and Place entities in the graph.
SECOND:
The 'types' array currently always contains GlobalIndexDiffbotEntity for Articles/Job Posts/Discussions/Product Offers. We have removed that unnecessary 'types' designation, i.e.
"types": [
"Article",
"GlobalIndexDiffbotEntity"
]
became
"types": [
"Article"
]
similar to
"types": [
"Organization",
"Corporation",
"Company"
]
FINALLY:
Some 'types' arrays previously contained GlobalIndexComplexTypeWrapper as well. We removed that unnecessary 'types' designation, i.e.
"types": [
"Discussion",
"GlobalIndexComplexTypeWrapper",
"GlobalIndexDiffbotEntity"
]
became
"types": [
"Discussion"
]
In the month of March, we rolled to production two key service layer optimizations to the Diffbot Knowledge Graph architecture. Both radically reduce response time to queries and enrichment requests. The first is called the KG Engine, a look-up service that enables us to improve the performance and scalability of both Enhance and DQL and to update the graph between builds for Organization and Person data. The second is a new search indexing layer that aggregates all Articles in the KG that were crawled &/or published in the last six months which reduces response times when querying recent news.