What we’re about
Machine Learning Meetup (MLMU) is a community of those interested in (not only) machine learning. Visit our regular meeting and get inspired and educated by interesting talks and people. Cutting edge methods, experience, tools, algorithms, applications and much more ...
More details at www.mlmu.cz ..
You can also contact as via email [email protected] .
Upcoming events (1)
See all- Why You Should [Not] Fine-Tune on Synthetic DataImpact Hub Brno, Brno-střed-Trnitá
Speaker:
Roman GrebennikovDescription
Custom task-specific LLMs offer significant benefits in terms of privacy (they can be run locally), costs (eliminating per-request API fees), and quality (optimized for your specific business problem). Building such a model with existing tools is straightforward—if you have enough training data. However, in practice, you often don't.In this talk, we'll share the story of how we built a synthetic training data generation tool for the open-source search engine Nixiesearch. We'll use the open ESCI dataset and explore how much we can improve search relevance with synthetic training data in a practical use case. Does this approach even work? Is it a viable low-cost alternative to proper fine-tuning on explicit labels? How much does the LLM prompt matter? We'll compare OpenAI, LLama3, and a custom-made model, discussing all the challenges and pitfalls we encountered during the project.This time we will not be streaming.
Program:
17:30 Welcome chat
18:00 Talk
18:50 Discussion
19:10 Networking (Impact Hub)About MLMUs:
Machine Learning Meetups (MLMU) is an independent platform for people interested in Machine Learning, Information Retrieval, Natural Language Processing, Computer Vision, Pattern Recognition, Data Journalism, Artificial Intelligence, Agent Systems and all the related topics. MLMU is a regular community meeting usually consisting of a talk, a discussion and subsequent networking. Except of Prague, MLMU also spread to Brno, Bratislava and Košice.