Diffbot

Diffbot

Technology, Information and Internet

Menlo Park, California 4,724 followers

We Structure the World's Knowledge.

About us

We Structure the World's Knowledge. Diffbot is a world-class group of AI engineers building a universal database of structured information, to provide knowledge as a service to all intelligent applications. Whether you are building an app that uses web content, an enterprise business application, or a smart robotic assistant, we've got you covered. Thousands of leading companies rely on Diffbot data for their enterprise and consumer applications.

Website
https://www.diffbot.com/
Industry
Technology, Information and Internet
Company size
11-50 employees
Headquarters
Menlo Park, California
Type
Privately Held
Founded
2011
Specialties
machine learning, relation extraction, truth discovery, knowledge fusion, computer vision, web scraping, data extraction, information retrieval, artificial intelligence, and ecommerce

Locations

Employees at Diffbot

Updates

  • Diffbot reposted this

    View profile for Leann Chen, graphic

    Grounding LLMs with Knowledge Graphs @ Diffbot

    Graph RAG or vector-based RAG? There's some discussions (debate) around which is better. (but we're not continuing this conversation...) Thanks to Tomaz Bratanic, one of the thought leaders and trailblazers in the graph world with cool applications, we jointly explore a project that realizes and visualizes the idea of"bringing unstructured and structured data" together. You'll see side-by-side comparisons of AI-generated responses between vector-only searches and vector+knowledge graph in the video, showing how the latter can outperform the former. Neo4j can store both graph data and text embeddings for vector-based searches, and Diffbot provides verified data from the web — the exact opposite of hallucinated LLM-based sources, so AI applications don't suffer from unreliable outputs. To see the cool network visualization in the app, jump to 3:51 and check out Tomaz and Anej's work. You can always go back and see how we use Diffbot's Article, Natural Language Processing, and Enhance APIs to enrich knowledge graphs with accurate and reliable information. p.s. Who's also going to AI Engineer World's Fair the next 2 days in SF? DM if you're around! #neo4j #diffbot #graphrag

  • Diffbot reposted this

    View profile for Reza Ardestani, graphic

    Chatbot & Language Models developer | Software Engineer | Data Scientist | Python & Pytorch enthusiast

    Hi everyone. Have you ever had a pile of personal or work documents that you wanted to make sense of or find all relevant information from? In three well-documented projects, I have addressed this challenge. I implemented a Retrieval-Augmented Generation (RAG) system using a recent architecture, RAPTOR. However, this system could not answer all questions. To cover these limitations, I used a Knowledge Graph. Finally, I implemented everything locally to ensure the privacy of proprietary documents, which is of paramount importance. Visit my webpage to read more and watch the videos of the three projects: https://lnkd.in/gNnnH3Ne In the implementation of RAG, I did not sweep anything under the rug. All limitations are exposed, and I provide suggestions for improvements to set realistic expectations for the deployment of this AI system. Some technologies I used for implementation include: Neo4j Graph DBMS, Diffbot API, Cypher query language, LangChain, Mistral Language model, Nomic AI embeddings, Beautiful Soup for web scraping, Streamlit for UI, Vector Database, Credits: + Inventures 2024 website is the source of my training data. + YouTube thumbnails are made by Sombilon Studios + I am thankful for Javier A. Jaime Serrano, who answered my questions on Graph DBMS. + Soundtrack: Persian Version of Game Of Thrones Song Other references on my page ⚠️ Disclaimers: + This is an educational purpose porjece, in-line with Alberta Innovates privacy policies. Any further use of the sessions data should be checked with their latest privacy policy. + Language models may modify the text during summarization process. The only source of data for reference is their webpage at https://lnkd.in/gHk7k4r6

  • View organization page for Diffbot, graphic

    4,724 followers

    👋 Hi friends, we're going to be hanging out at the upcoming AI Engineer World's Fair in San Francisco the week of June 24-28. If you're in San Francisco, we'd love to meet you! Leave a comment to let us know. We may even have a t-shirt in your size! Here, I even modeled for you.

    • Jerome attempts to make a jumping star pose thing for the camera but fails and is caught mid jump.
  • Diffbot reposted this

    Neo4j + Diffbot is an incredible combination. Here's how you can learn more about it! This month on the new episode of GraphStuff.fm, the official Neo4j Podcast, my colleagues Jennifer Reif and Jason Koo welcome Leann Chen to talk about knowledge graphs and Diffbot. Leann is a developer advocate at Diffbot and focused on using knowledge graphs to improve LLM based applications. This is a must-listen if you are interested in the intersection of knowledge graphs and LLMs. You can find the full episode here: https://buff.ly/4e7RwB6 Follow me Daniel Bukowski for daily content about the intersection of graphs, data science, and GenAI. #neo4j #diffbot #genai #llm #knowledgegraphs

    GraphStuff.FM: The Neo4j Graph Database Developer Podcast

    GraphStuff.FM: The Neo4j Graph Database Developer Podcast

    graphstuff.fm

  • Diffbot reposted this

    View profile for Tomaz Bratanic, graphic

    Graph ML and GenAI research at Neo4j

    Looking good!

    View organization page for LangChain, graphic

    234,479 followers

    📄 Refreshed docs for LangChain v0.2 We've listened to your feedback and made major improvements to our docs. With the release of v0.2 today, we now have versioned docs, with clearer structure and consolidated content. Our docs are separated into: • Tutorials: step-by-step guides on how to build specific applications (e.g. a chatbot, RAG app, or agent) from start to finish • How-to-guides: detailed instruction guides on how to do particular tasks (more in-depth, advanced) • Conceptual guides: glossary of terminology and techniques for new concepts or general LangChain knowledge • API docs: detailed technical reference documentation We also provide instructions on how to upgrade & how to map previous concepts from old versions to new ones in the "LangChain over time" docs section. Thank you to everyone in the community for your feedback! ✍️Read our blog post: https://lnkd.in/gR4Jm-Dt ➡️ New Python docs: https://lnkd.in/gycrPZAH ➡️ New JavaScript docs: https://lnkd.in/g7xWnW9F 📽️ Video walkthrough of new docs: https://lnkd.in/g3H54HXb 🗣️ Give your feedback: https://lnkd.in/gVqxqhK8

  • View organization page for Diffbot, graphic

    4,724 followers

    It's here! The UI sandbox for Natural Language API is now generally available on the Diffbot Dashboard. This doesn't add any new NL API features. It's primary objective is to make it easy to test NL API without having to write an API call. - Click through disambiguated entities to access their entity profile in the Diffbot KG - Visualize and isolate extracted facts - Generate NL API requests in Python, Javascript, and cURL - View and copy the complete NL JSON response Up next, we're going to share some fresh open-source NL showcase projects like the sanctions tracker we built back in 2022. Stay tuned! 📣 🙌 Shout out to our fans — Leann Chen: Confidence scores are listed in the entity table! William Bui: IAB categories with confidence scores can be found in the JSON response. Tomaz Bratanic: Find the diffbotUris for each entity in the JSON or just click on them in the entity/facts table.

  • View organization page for Diffbot, graphic

    4,724 followers

    This has been on our roadmap for a minute, but I've finally got a scaffold in place for a Natural Language UI in the dashboard! Planned features: - Entity highlighting and linking - API request code generator for Python, Javascript & cURL - View complete NL API JSON response Anything else ya'll would like to see here? Maybe some #LangChain specific tools?

    • No alternative text description for this image

Affiliated pages

Similar pages

Browse jobs

Funding

Diffbot 3 total rounds

Last Round

Series A

US$ 10.0M

See more info on crunchbase