Janakiram MSV
Janakiram MSV
  • Видео 223
  • Просмотров 1 363 256
Transforming AI | Inference and Fine-Tuning | Lin Qiao on Fireworks AI
Join us for an in-depth conversation with Lin Qiao, the CEO and founder of Fireworks AI, on this episode of The Encounter. Lin shares her incredible journey from working at Meta, where she played a key role in developing the PyTorch AI framework, to founding Fireworks AI. Discover how Fireworks AI is revolutionizing the AI landscape with its cutting-edge inference and fine-tuning services.
Learn about the challenges enterprises face in transitioning to an AI-first approach and how Fireworks AI is designed to help them overcome these hurdles efficiently. Dive into Lin's vision for the future of AI and understand what sets Fireworks AI apart from the competition. Don't miss this fascinating...
Просмотров: 400

Видео

Fireworks AI | Revolutionizing Inference | Fine-Tuning | Lin Qiao |
Просмотров 1402 месяца назад
Get ready for an insightful journey with Lin Qiao, CEO and founder of Fireworks AI, in this exciting episode of The Encounter! Discover how Lin transitioned from her pivotal role at Meta, where she helped build the revolutionary AI framework PyTorch, to creating Fireworks AI, a cutting-edge platform offering inference and fine-tuning as a service. Learn about the unique challenges faced by ente...
Building Conversational Agents | The Agent vs. Chatbot Difference | Priyanka Vergadia
Просмотров 8592 месяца назад
In this insightful interview, Janakiram MSV sits down with Priyanka Vergadia to delve into the nuances between conversational agents and chatbots. Priyanka expertly explains how to define goals, provide instructions, and design these AI entities to handle specific tasks, using natural language prompts. Discover the key distinctions and gain valuable insights into building effective conversation...
Google Cloud Next 2024 Recap | Top Announcements Unveiled | Priyanka Vergadia | Stay Tuned!
Просмотров 642 месяца назад
In this exclusive teaser, join Janakiram MSV as he sits down with Priyanka Vergadia, Google Cloud's star evangelist, to discuss the most exciting announcements from Google Cloud Next 2024. Get a sneak peek into their insightful conversation as they demystify the latest technologies and innovations. Stay tuned for the full interview, coming soon! Follow: www.linkedin.com/in/pvergadia/ #GoogleClo...
Decade of Kubernetes | Priyanka Sharma | Discusses Its Impact & Future at CNCF
Просмотров 2602 месяца назад
Join Janakiram MSV as he hosts Priyanka Sharma, Executive Director of the Cloud Native Computing Foundation, in a detailed conversation about Kubernetes' impactful journey and its future. Celebrating its 10th anniversary, Kubernetes has evolved from an innovative open-source project to a foundational technology akin to Linux, influencing vast areas including AI and edge computing. Hear Priyanka...
Kubernetes Turns 10 | Priyanka Sharma Explores Its Impact & Future at CNCF | Stay Tuned!
Просмотров 493 месяца назад
Stay tuned to this exhilarating episode of The Encounter as Priyanka Sharma, Executive Director of the CNCF, reflects on Kubernetes' transformative journey over the past decade. From its inception to becoming the backbone of modern infrastructure, hear Priyanka share insider insights on Kubernetes' milestones and its pivotal role in shaping the tech landscape. Don't miss this special edition wh...
Building End-to-End RAG Pipelines Locally on Kubernetes
Просмотров 6143 месяца назад
Watch the replay and a deep dive into my KubeCon Paris demo. Tune in if you are interested in learning how to run an end-to-end RAG pipeline on cloud native infrastructure.
Mastering Gemini Function Calling & LangChain Agents | A Comprehensive Tutorial
Просмотров 2,3 тыс.3 месяца назад
Dive into the world of Gemini Function Calling and LangChain agents with our step-by-step tutorial. This video is designed for developers and enthusiasts keen on exploring advanced generative AI and cloud-native technologies. Learn how to seamlessly integrate Gemini Function Calling with LangChain agents to build, deploy, and manage intelligent applications. Whether you're looking to enhance yo...
Harnessing the Power of Multimodal Interactions with Gemini | A Comprehensive Tutorial
Просмотров 3124 месяца назад
This in-depth tutorial explores the revolutionary capabilities of Gemini for creating and leveraging multimodal applications. Covering a broad spectrum of functionalities, including image prompts, combined image text prompts, multimodal embeddings, and semantic search for images, this guide is designed for developers and researchers eager to delve into the integration of various data types for ...
Saad Malik's Insights on Spectro Cloud's Strategy Kubernetes Evolution Integration at the Edge
Просмотров 924 месяца назад
This interview is a treasure trove for anyone keen on understanding the future of cloud-native technologies, Kubernetes evolution, and the strategic role of AI at the edge. Join us as Saad Malik from Spectro Cloud shares his invaluable perspectives and vision for navigating the complex landscape of modern computing infrastructure. TIME STAMPS 00:00 - Trailer 01:31 - Introduction 1:55 - Has Kube...
Kubernetes & Beyond | Saad Malik on future of Cloud Native Tech | Spectra Cloud
Просмотров 614 месяца назад
Gear up for a session filled with groundbreaking insights that promise to inspire and propel your tech initiatives into the future. Join Saad Malik, the visionary co-founder and CTO of Spectra Cloud, for an exhilarating journey into the heart of cloud-native technologies. In this dynamic conversation, uncover how Spectra Cloud is transforming Kubernetes, edge computing, and AI into powerful too...
Enhancing AI Deployments on Google Cloud | Erwan Menard
Просмотров 1624 месяца назад
Journey into the digital frontier with Erwan from Google Cloud, as we navigate the evolving landscape of AI and cloud technology on The Encounter. As the compass of Cloud AI and Vertex AI, Erwan charts a course through the transformative impact of AI on cloud-native platforms and the metamorphosis of DevOps practices. We delve into the confluence of predictive and generative AI, unearthing how ...
Step Into Tomorrow | How Erwan's Insights from Google Cloud Empower Your DevOps Strategy |
Просмотров 394 месяца назад
Tune in for a glimpse into the exciting advancements shaping our digital world! Unveiling the Future of Cloud-Native Tech: Join us for a riveting chat with Erwan from Google Cloud! This episode peels back the layers on Vertex AI, the impact of generative AI (Gen AI) in the tech sphere, and the pioneering concept of LLM Ops. Perfect for tech enthusiasts, developers, and AI experts eager for a de...
Using LangChain with Gemini and Chroma DB
Просмотров 2,5 тыс.4 месяца назад
This session covers how to use LangChain framework with Gemini and Chroma DB to implement Q&A and Summarization use cases. The code is available at gist.github.com/janakiramm/6546d9734c7872f111b139cda1a8e0de (Q&A) and gist.github.com/janakiramm/8fcfc6c055c09a6f5dc5248b890f0567 What you'll learn: - Detailed session on integrating LangChain framework with Gemini and Chroma DB. - Focus on developi...
Revolutionizing IT Ops | Corey Harrison | Flip AI's Breakthrough in Observability
Просмотров 2774 месяца назад
Join us for this compelling discussion on The Encounter, where generative AI and operational excellence converge to usher in a new era for the tech industry. Looking towards the future, Corey envisions a world where AI not only diagnoses but also automatically remediates technical issues, significantly elevating operational efficiency and allowing engineers to focus on more strategic tasks. As ...
How AI is Changing DevOps | Insights from Flip AI's | Corey Harrison
Просмотров 1004 месяца назад
How AI is Changing DevOps | Insights from Flip AI's | Corey Harrison
Analyzing Tamil Movie Scripts with Google's Most Capable Model - Gemini 1.5 Pro
Просмотров 3324 месяца назад
Analyzing Tamil Movie Scripts with Google's Most Capable Model - Gemini 1.5 Pro
Master RAG on Vertex AI with Vector Search and Gemini Pro
Просмотров 5 тыс.4 месяца назад
Master RAG on Vertex AI with Vector Search and Gemini Pro
Shaping the Future with Percona | Open Source, Cloud and Beyond | Ann Schlemmer
Просмотров 1334 месяца назад
Shaping the Future with Percona | Open Source, Cloud and Beyond | Ann Schlemmer
Ann Schlemmer's Vision | CEO of Percona | Redefining Cloud & Database Landscapes
Просмотров 414 месяца назад
Ann Schlemmer's Vision | CEO of Percona | Redefining Cloud & Database Landscapes
Mastering Gemini | A Developer's Practical Guide for Hands-On Learning
Просмотров 4444 месяца назад
Mastering Gemini | A Developer's Practical Guide for Hands-On Learning
Spotlight on Cloud Native Storage | Murli Thirumale | Portworx | Pure Storage
Просмотров 1285 месяцев назад
Spotlight on Cloud Native Storage | Murli Thirumale | Portworx | Pure Storage
Getting Started with Generative AI on Google Cloud
Просмотров 1,6 тыс.5 месяцев назад
Getting Started with Generative AI on Google Cloud
Cloud Native Storage Trends | What's Shaping the Landscape? | Murli Thirumale | Stay Tuned!
Просмотров 305 месяцев назад
Cloud Native Storage Trends | What's Shaping the Landscape? | Murli Thirumale | Stay Tuned!
Ori Goshen - Co-Founder, Co-CEO of AI21 Labs - Shares Vision and Insights
Просмотров 3115 месяцев назад
Ori Goshen - Co-Founder, Co-CEO of AI21 Labs - Shares Vision and Insights
Ever Wondered How AI21's Jurassic 2 Works? | Ori Goshen | Co-Founder & Co-CEO at AI21 Labs
Просмотров 1515 месяцев назад
Ever Wondered How AI21's Jurassic 2 Works? | Ori Goshen | Co-Founder & Co-CEO at AI21 Labs
Exclusive Encounter | Janakiram MSV and Mumshad Mannambeth | Exploring KodeKloud's Impact
Просмотров 7765 месяцев назад
Exclusive Encounter | Janakiram MSV and Mumshad Mannambeth | Exploring KodeKloud's Impact
Ready to Master Cloud DevOps with Mumshad Mannambeth? | KodeKloud
Просмотров 3655 месяцев назад
Ready to Master Cloud DevOps with Mumshad Mannambeth? | KodeKloud
Walkthrough of Perplexity Labs AI API
Просмотров 4,9 тыс.5 месяцев назад
Walkthrough of Perplexity Labs AI API
Next-Gen Observability Insights and Innovations | GroundCover CEO Shahar Azulay | Janakiram MSV
Просмотров 1305 месяцев назад
Next-Gen Observability Insights and Innovations | GroundCover CEO Shahar Azulay | Janakiram MSV

Комментарии

  • @imransaid1026
    @imransaid1026 2 дня назад

    Is there an inherent benefit to using Vertex Ai rather than AI Studio?

  • @user-wk2vd7je6h
    @user-wk2vd7je6h 2 дня назад

    Very excellent Learning session Janakiram!

  • @captainmcduckyYT
    @captainmcduckyYT 2 дня назад

    Very helpful. Thank you so much good sir.

  • @jagatmohansarvari5681
    @jagatmohansarvari5681 10 дней назад

    really helpful for understanding the concept of embedding and retrieval. Thanks.

  • @anilaknb1
    @anilaknb1 11 дней назад

    Great video & playlist ! Can you please share the source code ?

  • @dhananjaypathak15
    @dhananjaypathak15 12 дней назад

    i want same thing in nodej can some one please help on which library to use

  • @kaushikdas5115
    @kaushikdas5115 15 дней назад

    Sir, I have a long text which contains five bidder details and their financial values such as turnover, revenue etc. and that is stored in a variable, bidders_data. could you pls let me know how can I create a tool or function to draw a bar chart for the bidder names and turnover or any other financial parameters, when I give a prompt such as ''draw a bar chart for bidders (bidder names) and their turnover", so it should draw a bar chart as response. Awaiting for you reply sir!

  • @Manish69
    @Manish69 Месяц назад

    great work sir , this was very useful but throughout the whole video the audio was coming only from left side of headphones i hope you will fix this issue next time

  • @birolyildiz
    @birolyildiz Месяц назад

    ❤🙏

  • @AyushMandloi
    @AyushMandloi Месяц назад

    No voice

  • @wanderlust8367
    @wanderlust8367 Месяц назад

    the code link u have shared is incomplete, load_file is missing and other few stuffs,

  • @ShaliniMohan-vr4rb
    @ShaliniMohan-vr4rb Месяц назад

    Good Content and explanation for all the beginners

  • @jeffpowell860
    @jeffpowell860 Месяц назад

    Why is Chroma such hot garbage?

  • @shaktipawar5372
    @shaktipawar5372 Месяц назад

    Hi Janakiram. Can you explain Self attention and Feed forwarder layer in more detail plz ? Example what are the fundamental purpose / responsibility of each of them...

  • @brenoav99
    @brenoav99 2 месяца назад

    Thank you for the useful video. I'm having a problem using the chroma and google embedding (new versions): >>> vectordb=Chroma.from_documents(pages,embeddings) ValueError: Expected each embedding in the embeddings to be a list, got ['Repeated'] Do you have any ideas? Thanks in advance!

  • @shaktipawar5372
    @shaktipawar5372 2 месяца назад

    Janakiram, This is one of the best Fine Tuning Video I have came across. I need to thank you for getting this concept clear in such a simple manner. Keeping making such videos, it helps community to grow. Thank you !!

  • @tubasweb
    @tubasweb 2 месяца назад

    Can you make an update video onnx?

  • @IanMcAleer-op1xj
    @IanMcAleer-op1xj 2 месяца назад

    Thanks, this is tremendously helpful One point to note - you need to upload the embed file, not the sentence file -> upload_file(bucket_name,embed_file_path)

  • @AaronNicholsonAI
    @AaronNicholsonAI 2 месяца назад

    Super helpful! Thank you so much.

  • @ainanirina758
    @ainanirina758 2 месяца назад

    Awesome work as always! Do you have a link for the collab? Thanks

  • @awakenwithoutcoffee
    @awakenwithoutcoffee 2 месяца назад

    can we make our own "online" models with the Brave-API ?

  • @VijayKumar-ws3gv
    @VijayKumar-ws3gv 2 месяца назад

    Nice Explanation Anna :) expecting Enterprise Gen AI on AWS/AZURE/GCP

  • @larsfrommiddle2162
    @larsfrommiddle2162 2 месяца назад

    Do the online models also give back the source of the websites or papers it uses, when asked?

    • @Statsjk
      @Statsjk 2 месяца назад

      Did you figure it out?

  • @suryarp
    @suryarp 2 месяца назад

    What a pleasure to see Priyanka on your show! Awesum❤❤

  • @suryarp
    @suryarp 2 месяца назад

    Teaser really teased😊

  • @MarceloFerreira-rl6hh
    @MarceloFerreira-rl6hh 2 месяца назад

    Great job! Thanks a lot. What’s the difference between this approach and using langchain?

  • @softreviewed
    @softreviewed 3 месяца назад

    perplexity now supports mixtral-8x22b-instruct so if we uses can we get more accurate answers ?

  • @GAURAVRAUT007
    @GAURAVRAUT007 3 месяца назад

    Excellent video - can u please do same with Langchain with retrieval

  • @tavishi3884
    @tavishi3884 3 месяца назад

    Sir Can we use this gemini 1.5 pro with Langchian ..?

  • @fusionxfitness_
    @fusionxfitness_ 3 месяца назад

    you didn't shared the pdf

  • @vikasbammidi1340
    @vikasbammidi1340 3 месяца назад

    Can you please do a video on "How to use the same in Langchain with retrieval"

  • @ammvr
    @ammvr 3 месяца назад

    how can we deploy this to the web?

  • @elibessudo
    @elibessudo 3 месяца назад

    Thanks for sharing. Do you have any insight on how to incorporate the feature where the response includes citation URL's?

  • @harishankar7587
    @harishankar7587 3 месяца назад

    impressive slide show

  • @user-mk3qb3iv2i
    @user-mk3qb3iv2i 3 месяца назад

    why always python is there any way to use js?

  • @venkatathota633
    @venkatathota633 3 месяца назад

    could you please provide git repo for the above code?

  • @AIWithShrey
    @AIWithShrey 3 месяца назад

    Any reason why you chose BAAI and not any other Embedding model? What are the impacts of mix and matching the Embedding model and the LLM? My current app works just fine with GPT4ALL Embeddings, and Gemma 1.1 7B. Another note: Deploying a quantized LLM will significantly reduce VRAM usage, Gemma 7B Q8_0 quantized takes up 12 gigs of VRAM for me. Implementing KEDA and using Quantization in tandem will be a game-changer.

  • @goldendrake1387
    @goldendrake1387 3 месяца назад

    To whoever is experiencing problems with the following lines of code: URI='http:...' client = InferenceClient(model=URI) Do this instead: find a model that you want to use on huggingface, such as Mistral 7B, create an access token, save the model URL and use the following code: API_URL = "your_chosen_model_url" headers = {"Authorization": "Bearer your_created_token"} inference_client = InferenceClient(model=API_URL, headers=headers)

  • @sharathkumar8422
    @sharathkumar8422 3 месяца назад

    Concise and to the point. Superbly explain! Thank you...

  • @Hitish99999
    @Hitish99999 3 месяца назад

    Thanks for the tutorial. I am bit confused which file to be uploaded to bucket. sentence file or embedding file

  • @ashokvaddepally5181
    @ashokvaddepally5181 3 месяца назад

    This was a great close encounter and looking forward for the GenAI courses from Jani sir in collab with kode kloud way of doing 😃

  • @AhmedBesbes
    @AhmedBesbes 3 месяца назад

    Thanks for the tutorial! Instead of going through the ids in the json file to fetch the sentences, is it possible to integrate those directly as metadata in the index?

  • @sureshkumarselvaraj8911
    @sureshkumarselvaraj8911 3 месяца назад

    Great video! What is the difference between Vertex Search service VS Vector Search for RAG application? which one is better in terms of handling better retrieval of relevant documents for RAG application where we deal with 100+ PDF documents? Can you share some insights?

  • @sagarvaiyata9510
    @sagarvaiyata9510 3 месяца назад

    What is the confusion at 14:23? What are meta data files? What are these phrases in a realtime scenario? You are doing good but please give examples that are actually possible in realtime scenarios

  • @jagatchaitanyaprabhala8668
    @jagatchaitanyaprabhala8668 3 месяца назад

    is oscer-index file saved in s3?

    • @Janakirammsv
      @Janakirammsv 3 месяца назад

      No. It’s just an empty collection at this point. Watch part 1 on how to ingest the data from the CSV file into the collection.

    • @jagatchaitanyaprabhala8668
      @jagatchaitanyaprabhala8668 3 месяца назад

      @Janakirammsv yes found it after this comment 😊. What are the accesses/setups I need to ask my MLOps team to enable me start working on bedrock?

  • @ShirishBhagnani
    @ShirishBhagnani 3 месяца назад

    Thank you for putting all the content in a very simple and understandable way!

  • @sreehari514
    @sreehari514 3 месяца назад

    Source code please

  • @shraddha9640
    @shraddha9640 3 месяца назад

    is this on sagemaker?

    • @Janakirammsv
      @Janakirammsv 3 месяца назад

      No. This is a new service called Bedrock.

  • @sachinworld_
    @sachinworld_ 3 месяца назад

    Can we add two different documents in collection ?

    • @Janakirammsv
      @Janakirammsv 3 месяца назад

      Absolutely. The vector index doesn't know the source. You can add the details in the metadata of each item in the collection.

  • @TomFord-mv2mx
    @TomFord-mv2mx 4 месяца назад

    Great Video. One question, I noticed you used a different model (gecko) to Gemini Pro for the embeddings. Is this ok to do? I assumed the models needed to be the same for both training and inference? Thanks again

    • @Janakirammsv
      @Janakirammsv 3 месяца назад

      Text embedding models are independent of LLMs. You only have to ensure that the same embedding model is used for indexing the documents and the query. This is critical to retrieving the context based on the similarity.