What You Need to Know Before
You Start
Starts 6 June 2025 17:59
Ends 6 June 2025
00
days
00
hours
00
minutes
00
seconds
57 minutes
Optional upgrade avallable
Not Specified
Progress at your own speed
Conference Talk
Optional upgrade avallable
Overview
Explore techniques for managing massive data warehouses using Python and PostgreSQL, covering connection pooling, data replication, optimization, and efficient querying strategies.
Syllabus
- Introduction to Python and PostgreSQL
- Setting Up the Environment
- Connection Pooling in PostgreSQL
- Data Replication Strategies
- Data Warehousing Optimization
- Efficient Querying Strategies
- Advanced Python Techniques for Data Management
- Security and Maintenance
- Capstone Project
Overview of Python for Data Management
Overview of PostgreSQL in Data Warehousing
Installing and Configuring PostgreSQL
Setting Up Python Development Environment
Connecting Python to PostgreSQL
Introduction to Connection Pooling
Using Psycopg2 for Connection Pooling
Best Practices for Managing Connections
Understanding Postgres Replication
Configuring Streaming Replication
Logical Replication and Use Cases
Identifying and Designing Data Models
Indexing Strategies for Large Datasets
Partitioning Strategies
Vacuuming and Analyzing Databases
Query Planning and Execution
Writing Optimized SQL Queries
Using Explain and Analyze for Performance Tuning
CTEs and Window Functions
Using Pandas and NumPy with PostgreSQL
ETL Pipelines using Python
Integrating PostGIS for Geospatial Data
Implementing Database Security Practices
Regular Database Maintenance and Backups
Designing a Scalable Data Warehouse
Implementing Optimization Techniques
Presenting Findings and Solutions
Subjects
Conference Talks