What You Need to Know Before
You Start

Starts 8 June 2025 14:05

Ends 8 June 2025

00 days
00 hours
00 minutes
00 seconds
course image

Crunching Your Data In Place With Open Source LLMs

Discover how to use open-source LLMs with Bacalhau to process data where it resides, eliminating the need to move data to third-party AI services. Perfect for data scientists and ML enthusiasts.
Open Data Science via YouTube

Open Data Science

2544 Courses


43 minutes

Optional upgrade avallable

Not Specified

Progress at your own speed

Free Video

Optional upgrade avallable

Overview

Discover how to use open-source LLMs with Bacalhau to process data where it resides, eliminating the need to move data to third-party AI services. Perfect for data scientists and ML enthusiasts.

Syllabus

  • Course Introduction
  • Overview of Open Source LLMs
    Introduction to Bacalhau
    Importance of In-Place Data Processing
  • Understanding Large Language Models (LLMs)
  • Structure and Functionality of LLMs
    Popular Open Source LLMs (e.g., GPT-Neo, BLOOM)
    Comparing Open Source and Proprietary LLMs
  • Overview of Bacalhau
  • What is Bacalhau?
    Core Features and Benefits
    Setting Up the Bacalhau Environment
  • Data Processing with Open Source LLMs
  • Data Preprocessing Techniques
    Loading and Configuring LLMs for In-Place Processing
    Managing Resources and Optimizing Performance
  • Integration of Bacalhau with LLMs
  • Running LLMs with Bacalhau
    Data Security and Privacy Ensuring
    Case Studies: Practical Applications
  • Hands-On Tutorials
  • Installing and Configuring Bacalhau
    Implementing LLMs for Specific Use Cases
    Troubleshooting and Common Pitfalls
  • Advanced Topics
  • Scaling LLM Operations
    Customizing LLMs for Specific Data Needs
    Future Trends in In-Place Data Processing
  • Project and Assignment
  • Real-World Application Project using Bacalhau and Open Source LLMs
    Peer Feedback and Presentation
  • Conclusion and Next Steps
  • Recap of Key Learnings
    Resources for Continued Learning
    Networking and Community Engagement Opportunities
  • Course Assessment
  • Written Assessment
    Practical Demonstration with Bacalhau and LLMs

Subjects

Data Science