Presented by:

Rudraksh Karpe

from open source contributor at openSUSE project

Rudraksh Karpe, a Google Summer of Code 2024 contributor at openSUSE, is passionate about AI/ML and the Cloud-Native space. He is determined to contribute to open-source software and give back to the community. During his early years, switching between operating systems ignited a spark of curiosity within him to explore technology that has continued to burn bright. He has been contributing to the Analytics Edge Ecosystem Project by openSUSE and Rancher, building ML solutions on the Edge for various business verticals.

No video of the event yet, sorry!

The exponentially growing AI/ML and LLMs advancements bring concerns about privacy, as there is a risk of data exposure to online LLMs service providers. Setting up LLMs in-house requires a high computational cost which is a major obstacle for businesses across various sectors such as Retail, Healthcare, Finance, etc. These industries seek to leverage the power of LLMs to drive profitability in their overall business while maintaining control over their data.

In this session, we will explore the Edge Ecosystem Analytics and its transformative potential in GenAI Applications. Through seamless orchestration via Rancher managed Kubernetes which can help individuals overcome challenges in adopting and deploying cutting edge Gen-AI applications at the edge.

Key Topics:

  • Overview of Large Language Models (LLMs)
  • Scope for Edge Computing in AI revolution
  • Benefits over privacy concerns by localization of LLMs
  • Real-world Application Showcase by leveraging GenAI for Edge Ecosystem Analytics
  • Integration of Retrieval Augmentation Generation (RAG) Pipeline into Rancher & K3s
  • Challenges while deploying GenAI applications at the Edge

This short talk will showcase a real-world GenAI-based application, highlighting the utilization of the RAG pipeline as well as a data modeling pipeline to continually improve analytic outputs and its seamless integration with Rancher and K3s. Attendees would learn about Rancher, K3s in managing Kubernetes deployments for GenAI applications, LLMs optimizations techniques such as RAG, overview of Fine Tuning and AI Agents.

Date:
2024 June 28 - 16:15
Duration:
30 min
Room:
Saal
Language:
Track:
New Technologies
Difficulty:
Medium

Happening at the same time:

  1. Branding Meetup: Next steps
  2. Start Time:
    2024 June 28 15:30

    Room:
    Seminar Room 2

  3. Uyuni Community Hours
  4. Start Time:
    2024 June 28 16:00

    Room:
    Seminar Room 1

  5. Pluggable CPU schedulers in openSUSE
  6. Start Time:
    2024 June 28 16:00

    Room:
    Gallerie

  7. warewulf - making cluster installations fast and reliable
  8. Start Time:
    2024 June 28 16:30

    Room:
    Gallerie