Dive into Data Meetup #11
Details
Hi there,
join us for for an evening focused on learning and innovation, featuring two engaging talks from industry experts!
đź“… When? 09/12/2024, 17:30
🏢 Where? SoftServe Office, Warsaw, Q22, al. Jana Pawła II 22, 11th floor
Save your seat now 👉 https://hubs.ly/Q02zDdB20
📊 What’s on the agenda?
17:30 – 17:50 Come and grab a drink!
18:00 – 18:10 Official Start
18:10 – 18:30 Let’s break the ice 🤝
18:30 – 19:00 Talk#1 "The DOs and DONt's of LLM Supervised Fine-Tuning" by Vladyslava Tyshchenko, Data Scientist
19:00 – 19:30 Networking and Pizza
19:30 – 20:00 Talk#2 "Apache Iceberg in modern Data Lake" by Michał Mroczek, Senior Big Data Software Engineer
Talk#1 “The DOs and DONt's of LLM Supervised Fine-Tuning" by Vladyslava Tyshchenko
In this talk, we will explore various peculiarities of supervised fine-tuning of LLM and the toolbox needed to accomplish it based on real examples successfully. We will cover the main components required to teach the LLM to follow instructions better, including the dataset, model selection, adapters, quantization, and more. If you are keen to learn how to create your own specialized LLM - join this session!
Talk#2 "Apache Iceberg in modern Data Lake" by Michał Mroczek
During the session, we'll explore Apache Iceberg, a high-performance table format for massive analytic datasets that enables ACID transactions, time travel, and seamless schema evolution in modern data lakes.
⚡ It is free of charge, but registration is required ⚡
Discover more about our speakers:
Vladyslava Tyshchenko - Vladyslava is a Senior Data Scientist at SoftServe. She has experience in building Machine Learning solutions and leading teams starting from business ideas to successfully deployed applications. Her knowledge spans across the field of Natural Language Processing back to the days when Transformer architecture was not yet invented. In her daily work she mostly builds LLM-based solutions for educational companies and institutions providing students with personalized educational experiences.
Michał Mroczek - Data and Cloud Engineer who transforms business challenges into scalable AWS solutions, specializing in data architecture and cloud infrastructure. With several years of hands-on experience, they excel in designing ETL pipelines, implementing and managing infrastructure as code. His expertise spans both technical implementation and team collaboration, consistently delivering data-driven solutions while maintaining robust security and governance standards.
Dive into Data Meetup #11