A Beginners Guide to Building a RAG App Using Open Source Milvus

•

1 like•219 views

We will showcase how you can build a RAG using Milvus. Retrieval-augmented generation (RAG) is a technique for enhancing the accuracy and reliability of generative AI models with facts fetched from external sources.

Similar to A Beginners Guide to Building a RAG App Using Open Source Milvus

Cloud Computing Without The Hype An Executive Guide (1.00 Slideshare)

Lustratus REPAMA

Author: Steve Craggs - Lustratus Research Limited. Defining Cloud Computing and identifying the current players This document offers a high-level summary of Cloud Computing, targeted at Executives who find themselves bombarded with Cloud Computing and need to cut through the hype to get a clear understanding of what cloud is all about. Cloud is defined in simple terms and the main categories of cloud are identified. A high level segmentation of the cloud marketplace is also offered, and includes a reasonably comprehensive index of suppliers in the Cloud Computing marketplace and the Cloud segments in which they operate.

Wcm777 Apresentação Armazenamento em nuvens

Paulo Morais

The document discusses WCM7 cloud services and provides an overview of current and upcoming offerings. It describes available cloud services like Cloud Space for storage, Music Cloud, and Books Library. Upcoming services mentioned include TV/Video Cloud, Game Cloud, and Social Cloud. A live demo is offered if internet connectivity allows. The document concludes by providing contact information for general inquiries and directing the reader to the WCM7 website for more information.

Wcm7 serviços de cloud

Mundo Novo Informatica

The document discusses WCM7 cloud services and provides an overview of current and upcoming offerings. It describes available cloud services like Cloud Space for storage, Music Cloud, and Books Library. Upcoming services mentioned include social cloud, TV/video cloud, and game cloud. A live demo is offered if internet connectivity allows. The document concludes by providing contact information for general inquiries and directing the reader to the WCM7 website for more information.

MODAClouds Decision Support System for Cloud Service Selection

Ioan Toma

The document discusses MODAClouds' Decision Support System (DSS) for cloud service selection. Some key points: - The DSS helps users select cloud services by considering multiple dimensions like cost, quality, risks, and technical/business constraints. - It allows multiple stakeholders like architects, operators, managers to provide input on tangible/intangible assets and risks. - The DSS performs risk analysis and generates requirements. It also considers issues around multi-cloud environments like interoperability and migration challenges. - Other features include automatic data gathering from various sources, and progressive learning over time from user inputs and service selections.

MODAClouds Decision Support System for Cloud Service Selection

LDBC council

06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...

Timothy Spann

06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...

Timothy Spann

06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI Discussion on Vector Databases, Unstructured Data and AI https://www.meetup.com/unstructured-data-meetup-new-york/ This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.

Introduction to Blockchain Business Models

Gokul Alex

Blockchain provides new business models that can transform existing models. Some key models include: - Token economies where tokens power functionality and enable value exchange within an ecosystem. Utility tokens exemplify this. - Blockchain as a service allows businesses to outsource technical blockchain aspects while focusing on front-end development. - Blockchain development platforms empower developers to build decentralized applications that require tokens to access network resources and provide value to users.

Cloud Technologies for Businesses

Ernesto Loya

The Growth Of Data Centers

Gina Buck

Data centers are growing to accommodate more internet-connected devices, with innovations helping achieve network coverage for billions of devices by 2020. As data centers grow, trends like software-driven infrastructure, microtechnology, and alternative energy use are making data centers more efficient by consolidating resources and reducing size. Hyperconvergence allows more efficient use of rack space by consolidating computer storage, networking, and virtualization in compact 2U systems from companies like Simplivity and Nutanix.

Situation Normal, FOWA Dublin

Simon Wardley

The document discusses the evolution of cloud computing from its early conceptualization to its current form. It explores how cloud computing has progressed from an undefined concept to widespread ubiquity due to increasing demand and continuous improvements by suppliers. Key factors that have driven this transition include the commoditization of infrastructure, the delivery of software and platforms as standardized services, and a shift towards viewing these resources as utilities rather than custom-built products.

Mitre ATT&CK by Mattias Almeflo Nixu

Nixu Corporation

IRJET- Blockchain Technology a Literature Survey

IRJET Journal

This document provides a literature survey of blockchain technology. It begins with an introduction to blockchain, describing it as a decentralized digital ledger that securely records data exchanges without a central authority. The document then reviews typical algorithms used in blockchains like proof-of-work and proof-of-stake. It discusses challenges of blockchain like scalability issues due to increasing transaction volumes. The document also summarizes potential applications of blockchain beyond cryptocurrencies in areas like smart contracts, supply chain management, healthcare records, and more. It concludes by noting ongoing work to address technical challenges and potential future advances in blockchain.

MLOps – Applying DevOps to Competitive Advantage

DATAVERSITY

MLOps is a practice for collaboration between Data Science and operations to manage the production machine learning (ML) lifecycles. As an amalgamation of “machine learning” and “operations,” MLOps applies DevOps principles to ML delivery, enabling the delivery of ML-based innovation at scale to result in: Faster time to market of ML-based solutions More rapid rate of experimentation, driving innovation Assurance of quality, trustworthiness, and ethical AI MLOps is essential for scaling ML. Without it, enterprises risk struggling with costly overhead and stalled progress. Several vendors have emerged with offerings to support MLOps: the major offerings are Microsoft Azure ML and Google Vertex AI. We looked at these offerings from the perspective of enterprise features and time-to-value.

MajorProject_AnilSharma

Anil Sharma

This document discusses strategic dimensions for network and IT service providers entering the cloud computing market. It conducted a survey of experts on both the demand and supply sides of the industry. The survey found that cloud computing is seen as a good opportunity for both network and IT service providers to expand into. Experts said the top benefits of cloud computing are flexibility, scalability, cost savings, and business continuity. However, security and data confidentiality were cited as major concerns. The document provides recommendations on strategic positioning in areas like value proposition, branding, and customization to differentiate in the cloud computing market.

SYN207: Newest and coolest NetScaler features you should be jazzed about

Citrix

Citrix NetScaler engineering continues to deliver new enhancements and cool features. This technical session will highlight five recent NetScaler innovations in virtual application, desktop and server availability and security that can improve your datacenter network and make applications run better and faster. Topics will include faster app acceleration and why developers are building apps to leverage advanced ADC capabilities.

How a Time Series Database Contributes to a Decentralized Cloud Object Storag...

InfluxData

Cisco Connect 2018 Malaysia - Secure data center-building a secure zero-trus...

NetworkCollaborators

The document discusses how Cisco Tetration Analytics can be used to strengthen data center security through comprehensive visibility and machine learning capabilities. It describes Tetration's ability to map all network traffic, establish baselines of normal behavior, detect anomalies and outliers, and enable automated whitelisting policies. The document also outlines Tetration's key security use cases like segmentation, inventory of running processes, and reducing mean time to identify threats.

Istio as an Enabler for Migrating Monolithic Applications to Microservices v1.3

Ahmed Misbah

Migrating application architectures to microservices is considered a key area of transformation in the IT world. Modernizing legacy applications to Kubernetes-based microservices can prove to be very challenging if not planned correctly, taking into consideration the right technologies and enablers. This session explains how Istio can be used as an enabler for modernizing legacy monolithic applications to microservices. Topics covered in the presentation will include: 1- Advantages of migrating to microservices and service mesh 2- Designing a microservice application based on splitting an existing monolithic application 3- Implementing microservices iteratively as a strangler fig application with Istio

Building clouds with apache cloudstack apache roadshow 2018

ShapeBlue

Talk given at Apache Roadshow, FOSS Backstage, Berlin, June 2018 Apache CloudStack is open source software designed to deploy and manage large networks of virtual machines, as a highly available, highly scalable Infrastructure as a Service (IaaS) cloud computing platform. This talk will give an introduction to the technology, its history and its architecture. It will look common use-cases (and some real production deployments) that are seen across both public and private cloud infrastructures and where CloudStack can be completed by other open source technologies. The talk will also compare and contrast Apache Cloudstack with other IaaS platforms and why he thinks that the technology, combined with the Apache governance model will see CloudStack become the de-facto open source cloud platform. He will run a live demo of the software and talk about ways that people can get involved in the Apache CloudStack project.

Similar to A Beginners Guide to Building a RAG App Using Open Source Milvus (20)

Cloud Computing Without The Hype An Executive Guide (1.00 Slideshare)

Wcm777 Apresentação Armazenamento em nuvens

Wcm7 serviços de cloud

MODAClouds Decision Support System for Cloud Service Selection

06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...

Introduction to Blockchain Business Models

Cloud Technologies for Businesses

The Growth Of Data Centers

Situation Normal, FOWA Dublin

Mitre ATT&CK by Mattias Almeflo Nixu

IRJET- Blockchain Technology a Literature Survey

MLOps – Applying DevOps to Competitive Advantage

MajorProject_AnilSharma

SYN207: Newest and coolest NetScaler features you should be jazzed about

How a Time Series Database Contributes to a Decentralized Cloud Object Storag...

Cisco Connect 2018 Malaysia - Secure data center-building a secure zero-trus...

Istio as an Enabler for Migrating Monolithic Applications to Microservices v1.3

Building clouds with apache cloudstack apache roadshow 2018

More from Zilliz

How CXAI Toolkit uses RAG for Intelligent Q&A

Zilliz

Multimodal Embeddings (continued) - South Bay Meetup Slides

Zilliz

Ensuring Secure and Permission-Aware RAG Deployments

Zilliz

In this talk, we will explore the critical aspects of securing Retrieval-Augmented Generation (RAG) deployments. The focus will be on implementing robust secured data retrieval mechanisms and establishing permission-aware RAG frameworks. Attendees will learn how to ensure that access control is rigorously maintained within the model when ingesting documents, ensuring that only authorized personnel can retrieve data. We will also discuss strategies to mitigate risks of data leakage, unauthorized access, and insider threats in RAG deployments. By the end of this session, participants will have a clearer understanding of the best practices and tools necessary to secure their RAG deployments effectively.

Retrieval Augmented Generation Evaluation with Ragas

Zilliz

Retrieval Augmented Generation (RAG) enhances chatbots by incorporating custom data in the prompt. Using large language models (LLMs) as judge has gained prominence in modern RAG systems. This talk will demo Ragas, an open-source automation tool for RAG evaluations. Christy will talk about and demo evaluating a RAG pipeline using Milvus and RAG metrics like context F1-score and answer correctness.

Garbage In, Garbage Out: Why poor data curation is killing your AI models (an...

Zilliz

Enterprises have traditionally prioritized data quantity, assuming more is better for AI performance. However, a new reality is setting in: high-quality data, not just volume, is the key. This shift exposes a critical gap – many organizations struggle to understand their existing data and lack effective curation strategies and tools. This talk dives into these data challenges and explores the methods of automating data curation.

It's your unstructured data: How to get your GenAI app to production (and spe...

Zilliz

So you've successfully built a GenAI app POC for your company -- now comes the hard part: bringing it to production. Aparavi addresses the challenges of AI projects while addressing data privacy and PII. Our Service for RAG helps AI developers and data scientists to scale their app to 1000s to millions of users using corporate unstructured data. Aparavi’s AI Data Loader cleans, prepares and then loads only the relevant unstructured data for each AI project/app, enabling you to operationalize the creation of GenAI apps easily and accurately while giving you the time to focus on what you really want to do - building a great AI application with useful and relevant context. All within your environment and never having to share private corporate data with anyone - not even Aparavi.

The History of Embeddings & Multimodal Embeddings

Zilliz

Using LLM Agents with Llama 3, LangGraph and Milvus

Zilliz

How Vector Databases are Revolutionizing Unstructured Data Search in AI Appli...

Zilliz

"Powered by the popularity of ChatGPT, Llama2, and other LLMs, we've seen a huge surge in interest for vector databases in 2023 and 2024. Vector databases are commonly used to connect relevant documents with LLMs, through a process called retrieval augmented generation (RAG). RAG has seen widespread adoption, from single-person startups to Fortune 500 companies. Despite the popularity of vector databases for LLMs, they are more broadly applicable for a variety of different types of unstructured data, i.e. any type of data that does not conform to a predefined data model, such as text, images, audio, molecules, and graphs. In this talk, we'll discuss some of the use cases for vector databases across many types of unstructured data."

ASIMOV: Enterprise RAG at Dialog Axiata PLC

Zilliz

The presentation will delve into the ASIMOV project, a novel initiative that leverages Retrieval-Augmented Generation (RAG) to provide precise, domain-specific assistance to telecommunications engineers and technicians. The session will focus on the unique capabilities of Milvus, the chosen vector database for the project, and its advantages over other vector databases. Attending this session will give you a deeper understanding of the potential of RAG and Milvus DB in telecommunications engineering. You will learn how to address common challenges in the field and enhance the efficiency of their operations. The session will equip you with the knowledge to make informed decisions about the choice of vector databases, and how best to use them for your use-cases

Metadata Lakes for Next-Gen AI/ML - Datastrato

Zilliz

As data catalogs evolve to meet the growing and new demands of high-velocity, unstructured data, we see them taking a new shape as an emergent and flexible way to activate metadata for multiple uses. This talk discusses modern uses of metadata at the infrastructure level for AI-enablement in RAG pipelines in response to the new demands of the ecosystem. We will also discuss Apache (incubating) Gravitino and its open source-first approach to data cataloging across multi-cloud and geo-distributed architectures.

Multimodal Retrieval Augmented Generation (RAG) with Milvus

Zilliz

Specializing Small Language Models With Less Data

Zilliz

Most AI teams are exploring the possibilities of LLMs, rather than being focused on margins but soon efficiency will become important. Implementing small, specialized models to solve specific problems is an option, but is not leveraged often, because it requires gathering high volumes of human-labeled training data which are hard to acquire. To alleviate this problem, I will discuss how large language models can be used to generate synthetic data used to help tune small models on domain-specific tasks. We will focus on extractive question answering use case where additional unstructured context can help training.

Occiglot - Open Language Models by and for Europe

Zilliz

Large language models (LLMs) have emerged as transformative tools, revolutionizing various natural language processing tasks. Despite their remarkable potential, the LLM landscape is predominantly shaped by US tech companies, leaving Europe with limited access and influence. This talk will present Occiglot - an ongoing research collective for open-source language models for and by Europe. More specifically, we will explain why open European LLMs are needed and share insights as well as lessons learned, ranging from data collection and curation, model training and evaluation

Fueling AI with Great Data with Airbyte Webinar

Zilliz

Programming Foundation Models with DSPy - Meetup Slides

Zilliz

Generating privacy-protected synthetic data using Secludy and Milvus

Zilliz

During this demo, the founders of Secludy will demonstrate how their system utilizes Milvus to store and manipulate embeddings for generating privacy-protected synthetic data. Their approach not only maintains the confidentiality of the original data but also enhances the utility and scalability of LLMs under privacy constraints. Attendees, including machine learning engineers, data scientists, and data managers, will witness first-hand how Secludy's integration with Milvus empowers organizations to harness the power of LLMs securely and efficiently.

Building Production Ready Search Pipelines with Spark and Milvus

Zilliz

Read more: https://zilliz.com/blog/building-production-ready-search-pipelines-spark-milvus Spark is the widely used ETL tool for processing, indexing and ingesting data to serving stack for search. Milvus is the production-ready open-source vector database. In this talk we will show how to use Spark to process unstructured data to extract vector representations, and push the vectors to Milvus vector database for search serving.

MemGPT: Introduction to Memory Augmented Chat

Zilliz

Copilot Workspace: What it is, how it works, why it matters

Zilliz

More from Zilliz (20)

How CXAI Toolkit uses RAG for Intelligent Q&A

Multimodal Embeddings (continued) - South Bay Meetup Slides

Ensuring Secure and Permission-Aware RAG Deployments

Retrieval Augmented Generation Evaluation with Ragas

Garbage In, Garbage Out: Why poor data curation is killing your AI models (an...

It's your unstructured data: How to get your GenAI app to production (and spe...

The History of Embeddings & Multimodal Embeddings

Using LLM Agents with Llama 3, LangGraph and Milvus

How Vector Databases are Revolutionizing Unstructured Data Search in AI Appli...

ASIMOV: Enterprise RAG at Dialog Axiata PLC

Metadata Lakes for Next-Gen AI/ML - Datastrato

Multimodal Retrieval Augmented Generation (RAG) with Milvus

Specializing Small Language Models With Less Data

Occiglot - Open Language Models by and for Europe

Fueling AI with Great Data with Airbyte Webinar

Programming Foundation Models with DSPy - Meetup Slides

Generating privacy-protected synthetic data using Secludy and Milvus

Building Production Ready Search Pipelines with Spark and Milvus

MemGPT: Introduction to Memory Augmented Chat

Copilot Workspace: What it is, how it works, why it matters

Recently uploaded

History and Introduction for Generative AI ( GenAI )

Badri_Bady

TrustArc Webinar - Innovating with TRUSTe Responsible AI Certification

TrustArc

In a landmark year marked by significant AI advancements, it’s vital to prioritize transparency, accountability, and respect for privacy rights with your AI innovation. Learn how to navigate the shifting AI landscape with our innovative solution TRUSTe Responsible AI Certification, the first AI certification designed for data protection and privacy. Crafted by a team with 10,000+ privacy certifications issued, this framework integrated industry standards and laws for responsible AI governance. This webinar will review: - How compliance can play a role in the development and deployment of AI systems - How to model trust and transparency across products and services - How to save time and work smarter in understanding regulatory obligations, including AI - How to operationalize and deploy AI governance best practices in your organization

How UiPath Discovery Suite supports identification of Agentic Process Automat...

DianaGray10

📚 Understand the basics of the newly persona-based LLM-powered Agentic Process Automation and discover how existing UiPath Discovery Suite products like Communication Mining, Process Mining, and Task Mining can be leveraged to identify APA candidates. Topics Covered: 💡 Idea Behind APA: Explore the innovative concept of Agentic Process Automation and its significance in modern workflows. 🔄 How APA is Different from RPA: Learn the key differences between Agentic Process Automation and Robotic Process Automation. 🚀 Discover the Advantages of APA: Uncover the unique benefits of implementing APA in your organization. 🔍 Identifying APA Candidates with UiPath Discovery Products: See how UiPath's Communication Mining, Process Mining, and Task Mining tools can help pinpoint potential APA candidates. 🔮 Discussion on Expected Future Impacts: Engage in a discussion on the potential future impacts of APA on various industries and business processes. Enhance your knowledge on the forefront of automation technology and stay ahead with Agentic Process Automation. 🧠💼✨ Speakers: Arun Kumar Asokan, Delivery Director (US) @ qBotica and UiPath MVP Naveen Chatlapalli, Solution Architect @ Ashling Partners and UiPath MVP

FIDO Munich Seminar Workforce Authentication Case Study.pptx

FIDO Alliance

FIDO Munich Seminar In-Vehicle Payment Trends.pptx

FIDO Alliance

DefCamp_2016_Chemerkin_Yury_--_publish.pdf

Yury Chemerkin

Indian Privacy law & Infosec for Startups

AMol NAik

Redefining Cybersecurity with AI Capabilities

Priyanka Aash

In this comprehensive overview of Cisco's latest innovations in cybersecurity, the focus is squarely on resilience and adaptation in the face of evolving threats. The discussion covers the imperative of tackling Mal information, the increasing sophistication of insider attacks, and the expanding attack surfaces in a hybrid work environment. Emphasizing a shift towards integrated platforms over fragmented tools, Cisco introduces its Security Cloud, designed to provide end-to-end visibility and robust protection across user interactions, cloud environments, and breaches. AI emerges as a pivotal tool, from enhancing user experiences to predicting and defending against cyber threats. The blog underscores Cisco's commitment to simplifying security stacks while ensuring efficacy and economic feasibility, making a compelling case for their platform approach in safeguarding digital landscapes.

Finetuning GenAI For Hacking and Defending

Priyanka Aash

Generative AI, particularly through the lens of large language models (LLMs), represents a transformative leap in artificial intelligence. With advancements that have fundamentally altered our approach to AI, understanding and leveraging these technologies is crucial for innovators and practitioners alike. This comprehensive exploration delves into the intricacies of GenAI, from its foundational principles and historical evolution to its practical applications in security and beyond.

Self-Healing Test Automation Framework - Healenium

Knoldus Inc.

UiPath Community Day Amsterdam: Code, Collaborate, Connect

UiPathCommunity

Welcome to our third live UiPath Community Day Amsterdam! Come join us for a half-day of networking and UiPath Platform deep-dives, for devs and non-devs alike, in the middle of summer ☀. 📕 Agenda: 12:30 Welcome Coffee/Light Lunch ☕ 13:00 Event opening speech Ebert Knol, Managing Partner, Tacstone Technology Jonathan Smith, UiPath MVP, RPA Lead, Ciphix Cristina Vidu, Senior Marketing Manager, UiPath Community EMEA Dion Mes, Principal Sales Engineer, UiPath 13:15 ASML: RPA as Tactical Automation Tactical robotic process automation for solving short-term challenges, while establishing standard and re-usable interfaces that fit IT's long-term goals and objectives. Yannic Suurmeijer, System Architect, ASML 13:30 PostNL: an insight into RPA at PostNL Showcasing the solutions our automations have provided, the challenges we’ve faced, and the best practices we’ve developed to support our logistics operations. Leonard Renne, RPA Developer, PostNL 13:45 Break (30') 14:15 Breakout Sessions: Round 1 Modern Document Understanding in the cloud platform: AI-driven UiPath Document Understanding Mike Bos, Senior Automation Developer, Tacstone Technology Process Orchestration: scale up and have your Robots work in harmony Jon Smith, UiPath MVP, RPA Lead, Ciphix UiPath Integration Service: connect applications, leverage prebuilt connectors, and set up customer connectors Johans Brink, CTO, MvR digital workforce 15:00 Breakout Sessions: Round 2 Automation, and GenAI: practical use cases for value generation Thomas Janssen, UiPath MVP, Senior Automation Developer, Automation Heroes Human in the Loop/Action Center Dion Mes, Principal Sales Engineer @UiPath Improving development with coded workflows Idris Janszen, Technical Consultant, Ilionx 15:45 End remarks 16:00 Community fun games, sharing knowledge, drinks, and bites 🍻

Enterprise_Mobile_Security_Forum_2013.pdf

Yury Chemerkin

What's New in Copilot for Microsoft 365 June 2024.pptx

Stephanie Beckett

FIDO Munich Seminar FIDO Automotive Apps.pptx

FIDO Alliance

Generative AI Reasoning Tech Talk - July 2024

siddu769252

Generative AI technology is a fascinating field that focuses on creating comp...

Nohoax Kanont

Generative AI technology is a fascinating field that focuses on creating computer models capable of generating new, original content. It leverages the power of large language models, neural networks, and machine learning to produce content that can mimic human creativity. This technology has seen a surge in innovation and adoption since the introduction of ChatGPT in 2022, leading to significant productivity benefits across various industries. With its ability to generate text, images, video, and audio, generative AI is transforming how we interact with technology and the types of tasks that can be automated.

Mule Experience Hub and Release Channel with Java 17

Bhajan Mehta

Keynote : Presentation on SASE Technology

Priyanka Aash

Secure Access Service Edge (SASE) solutions are revolutionizing enterprise networks by integrating SD-WAN with comprehensive security services. Traditionally, enterprises managed multiple point solutions for network and security needs, leading to complexity and resource-intensive operations. SASE, as defined by Gartner, consolidates these functions into a unified cloud-based service, offering SD-WAN capabilities alongside advanced security features like secure web gateways, CASB, and remote browser isolation. This convergence not only simplifies management but also enhances security posture and application performance across global networks and cloud environments. Discover how adopting SASE can streamline operations and fortify your enterprise's digital transformation strategy.

FIDO Munich Seminar: Biometrics and Passkeys for In-Vehicle Apps.pptx

FIDO Alliance

Mastering Board Best Practices: Essential Skills for Effective Non-profit Lea...

OnBoard

Recently uploaded (20)

History and Introduction for Generative AI ( GenAI )

TrustArc Webinar - Innovating with TRUSTe Responsible AI Certification

How UiPath Discovery Suite supports identification of Agentic Process Automat...

FIDO Munich Seminar Workforce Authentication Case Study.pptx

FIDO Munich Seminar In-Vehicle Payment Trends.pptx

DefCamp_2016_Chemerkin_Yury_--_publish.pdf

Indian Privacy law & Infosec for Startups

Redefining Cybersecurity with AI Capabilities

Finetuning GenAI For Hacking and Defending

Self-Healing Test Automation Framework - Healenium

UiPath Community Day Amsterdam: Code, Collaborate, Connect

Enterprise_Mobile_Security_Forum_2013.pdf

What's New in Copilot for Microsoft 365 June 2024.pptx

FIDO Munich Seminar FIDO Automotive Apps.pptx

Generative AI Reasoning Tech Talk - July 2024

Generative AI technology is a fascinating field that focuses on creating comp...

Mule Experience Hub and Release Channel with Java 17

Keynote : Presentation on SASE Technology

FIDO Munich Seminar: Biometrics and Passkeys for In-Vehicle Apps.pptx

Mastering Board Best Practices: Essential Skills for Effective Non-profit Lea...

A Beginners Guide to Building a RAG App Using Open Source Milvus

1. 1 | © Copyright 8/16/23 Zilliz 1 | © Copyright 8/16/23 Zilliz Stephen Batifol | Zilliz A Beginners Guide to Building a RAG App Using Milvus

2. 2 | © Copyright 8/16/23 Zilliz 2 | © Copyright 8/16/23 Zilliz Stephen Batifol Developer Advocate, Zilliz stephen.batifol@zilliz.com https://www.linkedin.com/in/stephen-batifol/ https://twitter.com/stephenbtl Speaker

3. 3 | © Copyright 8/16/23 Zilliz 3 | © Copyright 8/16/23 Zilliz | © Copyright 8/16/23 Zilliz 3 RAG (Retrieval Augmented Generation)

4. 4 | © Copyright 8/16/23 Zilliz 4 | © Copyright 8/16/23 Zilliz Basic Idea Use RAG to force the LLM to work with your data by injecting it via a vector database like Milvus

5. 5 | © Copyright 8/16/23 Zilliz 5 | © Copyright 8/16/23 Zilliz Vector DB for RAG Vector Databases provide the ability to inject your data via semantic similarity Considerations include: scale, performance, and flexibility

6. 6 | © Copyright 8/16/23 Zilliz 6 | © Copyright 8/16/23 Zilliz LLMs are Stochastic LLMs predict future tokens (a-la RNNs) • “Milvus is the world ’s most popular vector ___” • {“database”: 0.86, “search”: 0.11, “embedding”, 0.01, …} Downside: outdated input data could be cause for hallucination • Plausible-sounding but factually incorrect responses

10. 10 | © Copyright 8/16/23 Zilliz 10 | © Copyright 8/16/23 Zilliz • Framework for building LLM Applications • Focus on retrieving data and integrating with LLMs • Loading the Data • Chunk & Chunk Overlap • Integrations with most popular tools Langchain

11. 11 | © Copyright 8/16/23 Zilliz 11 | © Copyright 8/16/23 Zilliz Ollama • Run quantized LLMs Locally • Embeddings Models

12. 12 | © Copyright 8/16/23 Zilliz 12 | © Copyright 8/16/23 Zilliz Milvus 1. Cloud Native, Distributed System Architecture 2. True Separation of Concerns 3. Scalable Index Creation Strategy with 512 MB Segments

15. 15 | © Copyright 8/16/23 Zilliz 15 | © Copyright 8/16/23 Zilliz Examining Embeddings Picking a model What to embed Metadata

16. 16 | © Copyright 8/16/23 Zilliz 16 | © Copyright 8/16/23 Zilliz Embeddings Strategies Level 1: Embedding Chunks Directly Level 2: Embedding Sub and Super Chunks Level 3: Incorporating Chunking and Non-Chunking Metadata

17. 17 | © Copyright 8/16/23 Zilliz 17 | © Copyright 8/16/23 Zilliz Metadata Examples Chunking - Paragraph position - Section header - Larger paragraph - Sentence Number - … Non-Chunking - Author - Publisher - Organization - Role Based Access Control - …

18. 18 | © Copyright 8/16/23 Zilliz 18 | © Copyright 8/16/23 Zilliz Text: “preferences of customers and prospective customers with respect to remote or hybrid working, as a result of the COVID-19 pandemic, leading to a parallel delay, or potentially permanent change, in receiving the corresponding revenue; •our projected financial information, anticipated growth rate, and market opportunity; •our ability to maintain the listing of our Class A Common Stock and Warrants on the NYSE; •our public securities’ potential liquidity and trading;” Vector: [-0.09975282847881317,-0.02853492833673954,-0.047886092215776443,0.01231582183 3908558,-0.004004416521638632,0.08756010979413986,0.013248161412775517,0.01070 4956017434597,-0.06194952502846718,0.021150749176740646,0.02453230880200863,0 .03979797288775444,-0.032914288341999054,-0.011855324730277061,...] What your data looks like

19. 19 | © Copyright 8/16/23 Zilliz 19 | © Copyright 8/16/23 Zilliz Your embeddings strategy depends on your accuracy, cost, and use case needs Takeaway:

21. 21 | © Copyright 8/16/23 Zilliz 21 | © Copyright 8/16/23 Zilliz Chunking Considerations Chunk Size Chunk Overlap Character Splitters

26. 26 | © Copyright 8/16/23 Zilliz 26 | © Copyright 8/16/23 Zilliz How Does Your Data Look? Conversation Data Documentation Data Lecture or Q/A Data

27. 27 | © Copyright 8/16/23 Zilliz 27 | © Copyright 8/16/23 Zilliz Your chunking strategy depends on what your data looks like and what you need from it. Takeaway:

29. 29 | © Copyright 8/16/23 Zilliz 29 | © Copyright 8/16/23 Zilliz Questions? Give Milvus a Star! Chat with me on Discord!

30. 30 | © Copyright 8/16/23 Zilliz 30 | © Copyright 8/16/23 Zilliz Meta Storage Root Query Data Index Coordinator Service Proxy Proxy etcd Log Broker SDK Load Balancer DDL/DCL DML NOTIFICATION CONTROL SIGNAL Object Storage Minio / S3 / AzureBlob Log Snapshot Delta File Index File Worker Node QUERY DATA DATA Message Storage VECTOR DATABASE Access Layer Query Node Data Node Index Node Milvus Architecture

A Beginners Guide to Building a RAG App Using Open Source Milvus

Related slideshows

More Related Content

Similar to A Beginners Guide to Building a RAG App Using Open Source Milvus

Similar to A Beginners Guide to Building a RAG App Using Open Source Milvus (20)

More from Zilliz

More from Zilliz (20)

Recently uploaded

Recently uploaded (20)

A Beginners Guide to Building a RAG App Using Open Source Milvus