Biography:

Roberto Calandra is a Full (W3) Professor at the Technische Universität Dresden where he leads the Learning, Adaptive Systems and Robotics (LASR) lab. Previously, he founded at Meta AI (formerly Facebook AI Research) the Robotic Lab in Menlo Park. Prior to that, he was a Postdoctoral Scholar at the University of California, Berkeley (US) in the Berkeley Artificial Intelligence Research (BAIR) Lab. His education includes a Ph.D. from TU Darmstadt (Germany), a M.Sc. in Machine Learning and Data Mining from the Aalto university (Finland), and a B.Sc. in Computer Science from the Università degli Studi di Palermo (Italy). His scientific interests are broadly at the conjunction of Robotics and Machine Learning, with the goal of making robots more intelligent and useful in the real world. Among his contributions is the design and commercialization of DIGIT -- the first commercial high-resolution compact tactile sensor, which is currently the most widely used tactile sensor in robotics. Roberto served as Program Chair for AISTATS 2020, as Guest Editor for the JMLR Special Issue on Bayesian Optimization, and has previously co-organized over 16 international workshops (including at NeurIPS, ICML, ICLR, ICRA, IROS, RSS). In 2024, he received the IEEE Early Academic Career Award in Robotics and Automation.

Stanisław Jastrzębski

Molecule.one

Saturday / 9 November 09:30 - 10:30 Main Hall

Abstract:

The rapid introduction of novel medical technologies necessitates swift, reliable scientific validation of their safety and efficacy to protect patient welfare. Traditional clinical trials, while essential, face challenges such as detecting low-frequency side effects, high costs, and practical limitations, especially with paediatric patients, rare diseases, and underrepresented ethnic groups. In-silico trials (IST), powered by Computational Medicine, offer a promising solution by using computer simulations to test medical products on virtual patient populations. This approach allows for the a-priori optimisation of clinical outcomes, thorough risk assessment, and failure mode analysis before human trials. Although in-silico evidence is still emerging, it has the potential to revolutionise health and life sciences R&D and regulatory processes.

Biography:

Prof. Alejandro Frangi FREng, holds the Bicentennial Turing Chair in Computational Medicine at the University of Manchester, UK, with joint appointments in the Schools of Computer Science and Health Science. Additionally, he is the Royal Academy of Engineering Chair in Emerging Technologies, specialising in Precision Computational Medicine for in silico trials of medical devices. He serves as the Director of the Christabel Pankhurst Institute for Health Technology Research and Innovation and is a Fellow at the Alan Turing Institute. Recently, his research vision was recognised with an ERC Advanced Grant from the European Research Council. He leads the InSilicoUK Pro-Innovation Regulations Network (www.insilicouk.org). Professor Frangi's main research interests lie at the crossroads of medical image analysis and modelling with an emphasis on machine learning (phenomenological models) and computational physiology (mechanistic models). His work has had a profound impact on the field, particularly in the areas of cardiovascular, musculoskeletal and neurosciences. He is particularly interested in statistical methods applied to population imaging and in silico clinical trials. Prof. Frangi's contributions to the field have been widely recognised. He has received numerous accolades, including the IEEE Engineering in Medicine and Biology Technical Achievement Award (2021) and Early Career Award (2006). In 2011, he was honored with the UPF Medal for his service as Dean of the Escuela Politècnica Superior. He also received the ICREA-Academia Prize from the Institució Catalana de Recerca i Estudis Avançats (ICREA) in 2008, a President's International Initiative Award from the Chinese Academy of Science in 2019. Prof. Frangi has also edited a textbook on Medical Image Analysis, published in the MICCAI-Elsevier Book Series by Academic Press.

Tom Rainforth

University of Oxford

Invited talk 9: Modern Bayesian Experimental Design

Saturday / 9 November Hall A

slides

Abstract:

Bayesian experimental design (BED) provides a powerful and general framework for optimizing the design of experiments. However, its deployment often poses substantial computational challenges that can undermine its practical use. In this talk, I will outline how recent advances have transformed our ability to overcome these challenges and thus utilize BED effectively, before discussing some key areas for future development in the field. time: 09:30 - 10:30

Biography:

Tom is a Senior Researcher in Machine Learning and leader of the RainML Research Lab at the Department of Statistics in the University of Oxford. He is the principal investigator for the ERC Starting Grant Data-Driven Algorithms for Data Acquisition. His research covers a wide range of topics in and around machine learning and experimental design, with areas of particular interest including Bayesian experimental design, deep learning, representation learning, generative models, Monte Carlo methods, active learning, probabilistic programming, and approximate inference.

Lucas Beyer

Google DeepMind

Invited talk 10: Computer Vision in the age of LLMs

Saturday / 9 November 16:10 - 17:10 Hall A

Abstract:

I will discuss how computer vision has changed with the integration of language and the advent of LLMs, with focus on the recent works of our group. Depending on the audience's familiarity, I may spend most of the time covering the way these modalities are integrated via SigLIP, CapPa, and PaLI, as well as touch on some fairness and cultural diversity aspects ("No Filter" paper), or spend more time on the advanced way in which classically "typical vision" tasks such as detection, segmentation, monocular depth, are finding their way into VLMs and the standard language-modeling approach, covering UViM, RL-tuning, GIVT, and more recent approaches towards fully end-to-end multimodal learning.

Biography:

Lucas grew up in Belgium wanting to make video games and their AI. He went on to study mechanical engineering at RWTH Aachen in Germany, then did a PhD in robotic perception and computer vision there too. Now, he is a staff research scientist at Google DeepMind (formerly Brain) in Zürich, leading multimodal vision-language research.

Yuki Asano

University of Technology Nuremberg

Invited talk 11: Improving Foundation Models (with academic compute)

Saturday / 9 November 16:10 - 17:10 Hall B

slides

Abstract:

I will talk about how we can build on top of pretrained Foundation Models to achieve better models for vision, language, audio and multi-modal tasks. First I will show that despite its strong performance, DINOv2 and other vision backbones often lack spatial understanding of images. To counteract this, we use NeCo, a new post-pretraining approach based on patch-nearest neighbors, which significantly improves the dense performances of this and any other model despite using only 16 GPU hours. Next I will introduce two parameter-efficient finetuning methods for LLMs and for VLMs that significantly reduce the amount of parameters required for successfully tuning these models. Finally, I will present our latest work, where we show that gradients from self-supervised losses can be successfully used as features for improved retrieval performances across vision, audio and text.

Biography:

Yuki Asano is the head of the Fundamental AI (FunAI) Lab and full Professor at the University of Technology Nuremberg. Prior to this, Yuki lead the QUVA lab at the University of Amsterdam, where he closely collaborated with Qualcomm AI Research. His PhD was at the Visual Geometry Group (VGG) at the University of Oxford, where he worked with Andrea Vedaldi and Christian Rupprecht. Also, he loves running, the mountains, and their combination.

/ Discussion Panels

Discussion Panel 1: AI in Law

Thursday / 7 November 14:45 - 15:45 Main Hall

Join us for the "AI in Law" panel, where we would like to explore the influence of artificial intelligence on the legal sector. During discussion we will delve into the potential role for AI-driven technologies and their promise to increase access to justice as well as challenges for responsible AI adoption.

Organized in cooperation with OCTO Legal. Moderators: Marek Ballaun, Emilia Wiśnios

Gabriela Bar

Founder of the Gabriela Bar Law & AI firm, an experienced expert in the field of new technology law and the law and ethics of Artificial Intelligence (AI), researcher, member of Women in AI, recognised in Forbes’ list of the 25 Best Business Lawyers (2022) and the TOP100 Women in AI in Poland (2022). Member of the IEEE Legal Committee within the IEEE Global Initiative on Ethics of Autonomous and Intelligent Systems, the Association for New Technology Law, and the FBE New Technologies Commission. Teaches at several universities, actively participates in scientific conferences and industry seminars. Author of numerous publications in the field of AI, digital services, and personal data protection. Lawyer in EU projects: Smart Human Oriented Platform for Connected Factories – SHOP4CF, Multi-Agent Systems for Pervasive Artificial Intelligence for assisting Humans in Modular Production Environments – MAS4AI, and an independent AI ethics expert in the EXTRA-BRAIN project.

Michał Jackowski

Prof. Michał Jackowski - Executive MBA at ESCP Europe, International Cooperation Leader of the AI Working Group at the Ministry of Digitital Affairs, member of Plenary Group drafting LLM Code of Practice for EU AI Office, law professor (SWPS University), attorney and tax advisor with over 20 years of experience as an entrepreneur. Advisor and representative of the technology industry in difficult legislative processes. Co-founder of DSK Law Firm - a law and tax firm specializing in advising the IT sector, and co-founder of innovative startups AnyLawyer and LexDigital. Arbitrator of the arbitration court at the Polish Chamber of Information Technology and Telecommunications. Advisor and representative of the technology industry in difficult legislative processes. Author of several books and numerous scientific publications in the field of law. A promoter of knowledge at the intersection of law and AI - he hosts the Monday Bagel podcast on YT, where he regularly publishes interviews with scientists, legal practitioners and AI experts. Privately - he is a marathon runner and skier, and enjoys reading books - especially about digital transformation in the future, medieval history and Asian culture.

Alkan Dogan

Alkan Dogan is the European Lead for Legal Engineering at Simmons & Simmons Wavelength. Based in Frankfurt, he is spearheading the firm’s legal engineering activities and projects across Europe. Alkan specializes in initiating and creating tech-driven solutions for our clients to ensure a streamlined delivery of legal services. This includes the utilization of technology, data analytics, AI, design thinking and process optimization techniques to increase usability and efficiency of client processes. In addition to his position as European Lead for Legal Engineering, Alkan is currently pursuing a doctorate degree (Dr. rer. pol.) at the Friedrich Schiller University in Jena. His field of study focuses on the impact of behavioural biases on effective innovation management and measures to mitigate these effects as an organization.

Discussion Panel 2: Career Paths in ML

Friday / 8 November 12:20 - 13:20 Main Hall

Join us for the "Career Paths in Machine Learning" panel, where we will explore the critical decisions and turning points in the careers of machine learning professionals. This discussion will delve into the important choice between pursuing a career in industry or academia.

Moderators: Alicja Grochocka-Dorocińska, Maciej Chrabąszcz

Bartłomiej Twardowski

IDEAS NCBR, Computer Vision Center UAB

Bartłomiej Twardowski is a Research Team Leader at the IDEAS NCBR and a researcher at the Computer Vision Center, Universitat Autònoma de Barcelona. His research interests focus on computer vision and continual learning of neural networks. He earned his Ph.D. in 2018, focusing on recommender systems and neural networks. Following his doctoral studies, he served as an assistant professor at Warsaw University of Technology in the AI group for 1.5 years before deciding to join the Computer Vision Center, UAB, for a post-doctoral program. He has been actively involved in various research projects related to DL/NLP/ML (ranging from €40k to €1.4M). He is a Ramón y Cajal fellow. He has a wide industry experience (more than 12 years), including international companies, e.g., Zalando, Adform, Huawei, Naspers Group (Allegro), as well as helping startups with research projects (Sotrender, Scattered). Throughout his career, he has had the opportunity to publish papers in prestigious conferences such as CVPR (2020, 2024, 3 papers), NeurIPS (2020, 2023), ICCV (2021, 2023), ICLR (2023, 2024), and ECIR (2021, 2023). Additionally, he has served as a reviewer for multiple AI/ML conferences, i.e., AAAI, CVPR, ECCV, ICCV, ICML, WACV and NeurIPS. Currently, his research primarily focuses on lifelong machine learning in computer vision, efficient neural network training, transferability and domain adaptation, as well as information retrieval and recommender systems.

Anna Dawid

Leiden University

Ania is an assistant professor at the Leiden Institute of Advanced Computer Science (LIACS) at Leiden University in the Netherlands, happily playing with interpretable machine learning for science, ultracold platforms for quantum simulations, and the theory of machine learning. Before joining LIACS, she was a research fellow at the Center of Computational Quantum Physics of the Flatiron Institute in New York. In 2022, she defended her joint Ph.D. in physics and photonics under the supervision of Prof. Michał Tomza (Faculty of Physics, University of Warsaw, Poland) and Prof. Maciej Lewenstein (ICFO – The Institute of Photonic Sciences, Spain). Before that, she did an MSc in quantum chemistry and a BSc in biotechnology at the University of Warsaw. Ania is the first author of the book "Machine Learning in Quantum Sciences" by Cambridge University Press (in press). She is also a 2022 FNP START laureate, awardee of two NCN grants, and one of the selected participants in the Lindau Nobel Laureate Meeting in 2024.

Tomek Korbak

UK AI Safety Institute

Discussion Panel 3: AI in Medicine

Saturday / 9 November 16:10 - 17:10 Main Hall

Join us for a panel discussion on "AI in Medicine", where we will delve into the influence of artificial intelligence on the healthcare sector. The focus areas are practical applications of AI in clinical settings and the role of AI in expediting research breakthroughs across medicine and biology.

Moderators: Błażej Dolicki, Aleksandra Daniluk

Anna Gambin

University of Warsaw

Professor Anna Gambin is deputy dean for research and international cooperation at the Faculty of Mathematics, Computer Science and Mechanics at the University of Warsaw (term 2016-2024). In her scientific work she deals with mathematical modeling of molecular processes and efficient algorithms for the analysis of biomedical data. Recently, her research is focused on computational methods supporting medical diagnostics based on genomic and proteomic data. She is the author of over 100 scientific publications and, to date, has supervised 13 PhDs in computational biology.

Wouter Bulten

Aiosyn

Wouter is the Chief Operation and Product Officer (COO & CPO) of Aiosyn. At Aiosyn he works on precision pathology for cancer and kidney diseases using AI. Wouter studied Artificial Intelligence and worked as a software engineer and data scientist. He holds a Ph.D. in computational pathology with a focus on using artificial intelligence for clinical diagnostics. Wouter’s research showed that AI algorithms could grade prostate cancer on the level of experienced pathologists and actively assist pathologists in performing better diagnoses. Wouter was also one of the main organizers of the PANDA challenge, collaborating with Karolinska Institute and Google Health. Wouter’s research was published in top journals like The Lancet Oncology and Nature Medicine.

Piotr Wygocki

MIM Fertility

Piotr Wygocki, PhD, is the Co-Founder and CEO of MIM Fertility, a Polish deep tech company developing AI solutions tailored to the needs of IVF clinics. He is also the Co-Founder of MIM Solutions, a software house specializing in MedTech innovations. Piotr Wygocki holds a Ph.D. in Informatics and dual master's degrees in Informatics and Mathematics from the University of Warsaw, where he serves as an Assistant Professor. A winner of KaggleDays Warsaw 2019. He is a member of the AIFS and the Business Advisory Group to EC President Ursula von der Leyen for the Global Gateway Programme. He has also earned recognition in the Deloitte Technology Fast 50 Central Europe 2023.

Discussion Panel 4: AI Safety

Friday / 8 November 16:10 - 17:10 Main Hall

Join us for the “AI Safety” panel, powered by ElevenLabs, where we will discuss real-world challenges of building and assessing models for safety, while maintaining top performance. We will explore ways for AI manipulation, as well as automated and human-in-the-loop systems for preventing misuse.

Moderator: Aleksandra Pedraszewska

Aleksandra Pedraszewska

ElevenLabs

Aleksandra leads AI Safety Operations at ElevenLabs – the most realistic AI audio tool for generating speech, voices and dubbing of content in 32 languages, with over 1 million users (including TIME, HarperCollins, and nvidia) and $1.1bn valuation. She is responsible for ensuring that ElevenLabs’ products are developed, deployed, and used in a safe way, maximising audio AI's transformative potential. Aleksandra has extensive operational experience, having directed a venture-backed deep tech company for 7 years, and holds an MPhil in Technology Policy from Cambridge Judge Business School, and a BA from the University of Cambridge. She supports IP-driven companies as an Entrepreneur-in-Residence at Cambridge and a mentor at Conception X.

Anna Bialas

Cohere

Anna is a Machine Learning Engineer at Cohere, a provider of foundational LLM models for enterprise customers. Her work focuses on post-training techniques to customize models and ensure safety, consistency, and accuracy, while also designing evaluation frameworks to assess model performance. Previously, she worked as a Quant at Goldman Sachs and as a Natural Language Data Scientist at Harvard Business School. Anna holds a Bachelor's degree in Computer Science from Oxford University and a Master's in Data Science from Harvard University. Her research on adversarial attacks on large language models was featured at ICML 2023.

Julia Bazinska

Lakera AI

Julia is a Machine Learning Engineer at Lakera AI, developing their core product. Lakera AI is a frontrunner in AI security, known for empowering developers to build secure AI applications with Lakera Guard, which protects against prompt injections, data leaks, and other risks. Her career trajectory includes internships at Google, DeepMind, and IBM Research. Julia earned her Bachelor's degree in Computer Science from the University of Warsaw and completed her Master's at ETH Zurich in 2023. Her professional interests are in Machine Learning for Natural Language Processing, AI security, and performance optimization of ML systems.

Matija Franklin

Human In the Loop / UCL

Matija is an AI Safety Researcher who has worked with OpenAI, DeepMind, the AI Objectives Institute, ContextualAI, and Mercor on advancing methods for collecting human data for post-training, developing evals and benchmarks. His work on AI Manipulation and General Purpose AI Systems has impacted the EU AI Act, and he is currently involved in crafting the Codes of Practice for the EU AI Office. He holds a BA/MSc in Psychological Sciences and Experimental Psychology from the University of Cambridge, and completed his PhD in Cognitive Science at the Causal Cognition Laboratory, at University College London.

/ Sponsor talks

Alicja Rączkowska

Allegro

Sponsor talk 1: AlleNoise - large-scale text classification benchmark dataset with real-world label noise

Friday / 8 November 12:20 - 13:20 Hall A

Abstract:

Label noise remains a challenge for training robust classification models, as it might negatively impact their classification performance. To help with the development of new algorithms, we've published AlleNoise, a curated text classification dataset with real-world instance-dependent label noise. In this presentation, we will show how we've evaluated existing robust classification methods and argue why they are ill-equipped for handling such realistic noise patterns.

Biography:

Alicja Rączkowska is a Senior Research Engineer in the Machine Learning Research team at Allegro, where she works on applying and advancing NLP methods in the e-commerce domain. Obtained her PhD from the University of Warsaw, where she focused on machine learning methods for histopathology.

Charles Martinez

G-Research

Scarlett Bailey

G-Research

Sponsor talk 2: Careers in Quant Finance

Friday / 8 November 12:20 - 13:20 Hall B

Abstract:

An introduction to G-Research Quantitative Finance activities, what we look for in potential quants and how our recruitment process works.

Biography:

Biography:

Tomasz Sapiński is a highly experienced IT professional with a strong background in software development and image processing using AI. He holds a degree from Lodz University of Technology and has spent over 15 years in the industry, gaining extensive experience in project management and IT system implementations around the world. Tomasz is also a skilled Data Scientist with a keen interest in the development of AI and neural networks, and their practical applications.

Daniel Śliwiński

LOT Polish Airlines

Patryk Radoń

LOT Polish Airlines

Sponsor talk 6: Leveraging Feature Store for high-sparsity recommendations in LOT Polish Airlines

Saturday / 9 November 15:30 - 16:00 Hall B

Abstract:

Recommendation systems are an essential part of most e-commerce industries, often responsible for a significant portion of revenue. However, every branch of this industry has its own set of exceptions and challenges that affect how recommender systems have to be designed. In airlines, these exceptions become extreme as returning visitors become sparse, many purchases are anonymous, and items, such as flight tickets, can be sold at different prices depending on the circumstances. To overcome these challenges, we propose a simple method that utilizes information collected about users and items, omitting the need for extracting user/item embeddings with matrix factorization. Additionally, we will talk about how we used a Feature Store as a foundation for this project and why it could be beneficial to implement it in your Data Science team as well.

Biography:

Daniel Śliwiński holds a master’s degree in Data Science and Business Analytics from the University of Warsaw, where he focused on exploring the intersection of airline e-commerce and machine learning. He also has a bachelor’s degree in Japanese Studies. Currently, Daniel is a Junior Data Scientist at LOT Polish Airlines, where he has contributed to various projects for over two years, working primarily on personalization, segmentation, forecasting, and big data initiatives. Outside of work, Daniel trains in Olympic weightlifting and competes in triathlon.

Patryk Radoń with a bachelors and master’s degree from Cracow’s AGH and Cracow University of Technology, he has honed his skills in data science over five years of practical experience. Currently working at LOT Polish Airlines, he specializes in modeling customer behavior with statistical background in causal inference and customer personalization, focused primarily on areas related to CRM and ecommerce. His expertise lies in leveraging data to drive business growth, optimize customer interactions, and enhance marketing strategies. In his free time he strives to balance his time between rock climbing, traveling and practicing martial arts.

/ Contributed talks

Klaudia Balcer

Maciej Chrabaszcz

NASK - National Research Institute / Warsaw University of Technology

Co-authors:

Hubert Baniecki, Piotr Komorowski, Szymon Płotka, Przemysław Biecek

Contributed talk 5: Aggregated Attributions for Explanatory Analysis of 3D Segmentation Models

Friday / 8 November 11:05 - 11:30 Hall B (CfC Session 2)

slides

Abstract:

Analysis of 3D segmentation models, especially in the context of medical imaging, is often limited to segmentation performance metrics that overlook the crucial aspect of explainability and bias. Currently, effectively explaining these models with saliency maps is challenging due to the high dimensions of input images multiplied by the ever-growing number of segmented class labels. To this end, we introduce Agg$^2$Exp, a methodology for aggregating fine-grained voxel attributions of the segmentation model's predictions. Unlike classical explanation methods that primarily focus on the local feature attribution, Agg$^2$Exp enables a more comprehensive global view on the importance of predicted segments in 3D images. Our benchmarking experiments show that gradient-based voxel attributions are more faithful to the model's predictions than perturbation-based explanations. As a concrete use-case, we apply Agg$^2$Exp to discover knowledge acquired by the Swin UNEt TRansformer model trained on the TotalSegmentator v2 dataset for segmenting anatomical structures in computed tomography medical images. Agg$^2$Exp facilitates the explanatory analysis of large segmentation models beyond their predictive performance.

Biography:

Maciej Chrabąszcz is a dedicated researcher in the field of Artificial Intelligence, with a particular focus on AI model behavior analysis, alignment, and efficient computing. As a PhD student in Computer Science, his work contributes to the critical areas of AI development and understanding. Having completed his Master's in Mathematical Statistics at the Warsaw University of Technology (WUT), Maciej is now pursuing his doctoral studies at the same institution. Concurrently, he contributes his expertise to NASK - National Research Institute.

Barbara Klaudel

France Rose

University of Cologne

Co-authors:

Monika Michaluk, Timon Blindauer, Bogna M. Ignatowska-Jankowska, Liam O’Shaughnessy, Greg J. Stephens, Talmo D. Pereira, Marylka Y. Uusisaari, Katarzyna Bozek

Contributed talk 9: Uncertainty-aware self-supervised learning on multi-dimensional time series for animal behavior

Friday / 8 November 15:30 - 15:55 Main Hall (CfC Session 3)

slides

Abstract:

Studying freely moving animals is essential to understand how animals behave and make decisions -- e.g. when they escape predators, find mates, or raise their young -- in an undisturbed manner. Although animal behavior has been studied for decades, animal movements can only now be recorded at high throughput thanks to recent technical progress. On one hand, videos from synchronized cameras can be coupled with deep learning pose estimation methods, automatically tracking the trajectories of a few keypoints. On the other hand, motion capture systems directly outputs the 3D trajectories of physical reflectors apposed on the body (reflectors on a suit for humans, reflecting piercings for rodents). However, these methods are not perfect and contain missing data. Since animal behavior cannot be easily scripted and additional recordings are not always possible due to constraints in experimental design, missing data is a more pressing problem in animal compared to human behavior analysis. So far, few works have effectively addressed these issues in animal recordings, with most relying on linear interpolation and smoothing (e.g. Kalman filter) only suitable for short gaps, or lacking large-scale testing. We hypothesized that recent advances in deep learning architectures and self-supervised learning (SSL) can help recover missing data by learning dynamics within and between keypoints. Specifically masked modeling has proven to be successful in recent large language models and computer vision transformers. Mimicking the missing data during training via masked modeling, we tested several neural network architectures: Gated Recurrent Unit (GRU), Temporal Convolutional network (TCN), Spatio-Temporal Graph Convolutional Network (ST-GCN), Space-Time-Separable Graph Convolutional Network (STS-GCN), and a custom transformer encoder named DISK (Deep Imputation for Skeleton data). For testing, we gathered seven datasets, covering five species (human, fly, mouse, rat, fish), in 2D and 3D, from one to two animals, and a variety of number of keypoints (from 3 to 38 per animal). Furthermore we adapted a probabilistic head, initially proposed for probabilistic forecasting of time-series, to assess the reliability of the imputed data at inference time. We found that DISK outperformed other architectures and linear interpolation baseline (42% to 89% root mean square error improvement compared to linear interpolation, calculated between true coordinates and imputed ones on a held-out test set - one value per dataset). DISK probabilistic head outputs an estimated error linearly correlated with the real error (Pearson correlation coefficient: 0.746 to 0.890 - one value per dataset). This estimated error allows to filter out less reliable predictions and control the amount of noise in the imputed dataset. As SSL methods are known to learn general properties about input data, we further explored the latent space of DISK and showed motion sequences clustered by behavior categories (e.g. attack, mount, investigation). While animal behavior experiments are expensive and complex, tracking errors make sometimes large portions of the experimental data unusable. DISK allows for filling in the missing information and for taking full advantage of the rich behavioral data. Available as a stand-alone imputation package (github.com/bozeklab/DISK.git), DISK is applicable to results of any tracking method (cameras or motion capture) and allows for any type of downstream an.

Biography:

France Rose is a post-doctoral researcher at the University Hospital of Cologne. Her research topics cover biomedical image and time-series analysis. At the time of exploding data generation in Biology and Medical Sciences, it is exciting to meet the needs in image analysis and challenge current scientific knowledge.

Natasha Al-Khatib

Symbio

Contributed talk 10: How LLMs are Revolutionizing the cybersecurity field

Friday / 8 November 14:30 - 14:55 Hall A (CfC Session 4)

slides

Abstract:

The ever-evolving threat landscape demands constant adaptation. Traditional methods struggle. Large Language Models (LLMs) emerge, wielding the power of language. This talk explores LLMs' revolution in cybersecurity. LLMs are AI models trained on massive text and code datasets. This grants them an understanding of complex linguistic patterns, invaluable in cybersecurity. Firstly, LLMs excel at advanced threat detection. Analyzing vast amounts of data, they identify subtle anomalies indicating brewing attacks. Traditional methods rely on pre-defined rules, vulnerable to novel attack vectors. LLMs, with their ability to learn and adapt, identify unseen threats, providing a crucial early warning system. Secondly, LLMs offer proactive threat analysis. By ingesting vast quantities of threat intelligence data, including past attack methods and attacker motivations, LLMs uncover patterns and predict future attack vectors. This allows security teams to take a pre-emptive approach, focusing resources on fortifying potential weaknesses before attackers exploit them. Imagine an LLM analyzing a hacker forum, identifying discussions about targeting a specific software vulnerability. This foresight empowers security professionals to patch the vulnerability before a widespread breach. Furthermore, LLMs can revolutionize vulnerability research . Traditionally, identifying vulnerabilities is time-consuming and laborious. LLMs, with their ability to analyze vast code repositories, pinpoint potential vulnerabilities through code patterns and language constructs associated with known weaknesses. This streamlines the vulnerability discovery process, allowing security teams to address critical issues before attackers identify them. While LLMs offer a powerful new frontier, challenges remain. Issues surrounding explainability, bias in training data, and potential misuse require careful consideration. However, the potential benefits are undeniable. As these models continue to evolve and integrate with existing security solutions, they hold the promise of a more secure and resilient digital landscape.

Biography:

Dr. Natasha Al-Khatib is a researcher and engineer with expertise in cybersecurity and artificial intelligence (AI) for the automotive industry. Her passion for securing vehicles against cyber threats led her to pursue a Ph.D. thesis at the prestigious Institut Polytechnique de Paris. Her doctoral research focused on leveraging AI to develop robust solutions against cyberattacks in connected and autonomous vehicles. Currently, Dr. Natasha Al-Khatib applies her expertise at ETAS Bosch, a leading provider of embedded systems for the automotive industry. In this role, she is instrumental in developing AI-based solutions that enhance the cybersecurity of automotive products. She plays a key role in ensuring the safety and security of future generations of vehicles.

Klaudia Bałazy

NVIDIA / Jagiellonian University

Co-authors:

Mohammadreza Banaei, Karl Aberer, Jacek Tabor

Contributed talk 11: Efficient Fine-Tuning of LLMs: Exploring PEFT Methods and LoRA-XS Insights

Friday / 8 November 15:00 - 15:25 Hall A (CfC Session 4)

slides

Abstract:

The rapid scaling of large language models (LLMs) has underscored the need for parameter-efficient fine-tuning (PEFT) methods to manage increasing computational and storage demands. Among these methods, Low-Rank Adaptation (LoRA) has emerged as a prominent solution, often matching or exceeding the performance of full fine-tuning with significantly fewer parameters. Despite its success, LoRA faces challenges related to the storage of numerous task-specific or user-specific modules on top of a base model. In this talk, I will discuss the importance of parameter-efficient fine-tuning in natural language processing (NLP) and provide an overview of various PEFT approaches for large language models. I will introduce our latest research, LoRA-XS (Low-Rank Adaptation with eXtremely Small number of parameters), which leverages Singular Value Decomposition (SVD) to further enhance parameter efficiency. I will also highlight emerging trends and future possibilities in efficient fine-tuning.

Biography:

Klaudia Bałazy is a Senior Deep Learning Engineer at NVIDIA and a PhD student at the Jagiellonian University. She is also an active member of the Group of Machine Learning Research (GMUM). Her research primarily focuses on enhancing the efficiency of deep learning solutions, with particular emphasis on model compression, dynamic neural networks, and the parameter efficiency of large language models. Klaudia holds both a Master's and an Engineer's degree in Computer Science from the AGH University of Science and Technology. Throughout her career, she has led and participated in various AI-based projects across several tech startups, contributing to the development of practical AI applications.

Adam Dziedzic

Przemysław Spurek

Jagiellonian University

Co-authors:

Joanna Waczyńska, Piotr Borycki, Weronika Smolak, Dawid Malarz

Contributed talk 15: Neural rendering: the future of 3D modeling

Friday / 8 November 15:30 - 15:55 Hall B (CfC Session 5)

slides

Abstract:

The presentation will present the central concept of neural rendering for modeling 3D objects. We concentrate on Neural Radiation Fields (NeRFs) and Gaussian Splatting (GS). Then, new results obtained by the GMUM Neural Rendering group will be presented. NeRF has demonstrated the remarkable potential of neural networks to capture the intricacies of 3D objects. NeRFs excel at producing strikingly sharp novel views of 3D objects by encoding the shape and color information within neural network weights. Recently, numerous generalizations of NeRFs utilizing generative models have emerged, expanding its versatility. In contrast, GS offers a similar render quality with faster training and inference, as it does not need neural networks to work. It encodes information about the 3D objects in the set of Gaussian distributions that can be rendered in 3D similarly to classical meshes.

Omar Rivasplata's top-level topics of interest are statistical learning theory and machine learning theory. In the limit of tending to praxis, these days he is very interested in strategies to train and certify machine learning models. Currently Omar is Senior Lecturer (Associate Professor) in Machine Learning in the Department of Computer Science at The University of Manchester, where he is a member of the Manchester Centre for AI Fundamentals and a supervisor at the UKRI AI CDT in Decision-Making for Complex Systems. Before joining The University of Manchester (July 2024), He had positions at University College London and DeepMind. Omar have a PhD in Mathematics (University of Alberta, 2012) and a PhD in Statistical Learning Theory (University College London, 2022). Back in the day he studied undergraduate maths (BSc 2000, Pontificia Universidad Católica del Perú).

Kinga Kwoka

Warsaw University of Technology

Co-authors:

Mateusz Zembroń

Poster 5: Model fusion for multimodal prediction of plant species composition

Friday / 8 November 17:10 - 18:40 (Poster Session 1)

Abstract:

Prediction of plant species composition across space and time at a fine resolution is crucial for biodiversity management, inventarisation or creation of high-resolution maps. However, the annotation of large datasets with multi-label species information is resource intensive. The GeoLifeCLEF 2024 competition addresses this by posing a challenge of learning from a small amount of high quality presence-absence multi-label data and a large number of presence-only single-label samples. The training dataset consists of five million plant observations across Europe, supplemented by various environmental data such as remote sensing imagery, land cover, and climate variables. To approach the multimodal aspect of the problem we propose an architecture utilizing feature fusion of Visual Transformer (ViT-B/32) and two convolutional networks (ResNet18). Each modality is processed by a network best suited for particular task and then concatenated into a single vector representing the combined features from all modalities. To address the issue of class imbalance arising from detecting a small number of species among numerous possibilities, we employ a focal loss function down-weighing the influence of easy negatives. This model fusion approach, which integrates multiple deep learning models, achieved promising results, securing 13th place in the GeoLifeCLEF CVPR 2024 competition.

Biography:

Kinga Kwoka is a Master's student at Warsaw University of Technology studying Computer Science with focus on artificial intelligence. Her professional experience includes a data analyst role at Deloitte. She is broadly interested in computer vision as well as multi-modal machine learning.

Filip Ręka

AGH University of Krakow

Poster 6: Generating music with Large Language Models

Friday / 8 November 17:10 - 18:40 (Poster Session 1)

poster

Abstract:

The work is devoted to explore the possibilities of music generation using large languagemodels (LLM). The aim of the study is to analyze the architectures of language models and their application in the process of music creation. The paper focuses on the structural similarities between music and text, arguing that these two forms of expression share common sequential features, which allows the adaptation of language models for the generation of musical compositions. The paper also provides an overview of existing formats for digital representation of music that can be used to train generative models. By conductinga series of experiments using a variety of architectures, such as transformers and state space models, the effectiveness of different approaches in the context of music generation was analyzed. An attempt was made to evaluate the generated musical fragments using commercially available LLM models or those trained to understand music. The work offers new perspectives on the potential of LLM in the field of artificial musical creativity, paving the way for further research in this fascinating interdisciplinary space.

Biography:

Filip Reka just started his PhD with the thesis of developing domain specific LLM for cancer treatment in AGH University in Krakow. Outside of AI his interests are music, cycling and airplanes.

Bartłomiej Sadlej

University of Warsaw

Co-authors:

Bartłomiej Sobieski, Jakub Grzywaczewski

Poster 7: Region-constrained Visual Counterfactual Explanations

Friday / 8 November 17:10 - 18:40 (Poster Session 1)

poster

Abstract:

Visual counterfactual explanations (VCEs) have recently become widely recognized for their ability to enhance the interpretability of image classifiers. This surge in interest is driven by the potential of VCEs to highlight semantically relevant factors that can alter a classifier's decision. Nevertheless, we contend that current leading methods lack a critical feature – region constraint – which hinders the ability to draw clear conclusions and can even foster issues like confirmation bias. To overcome the limitations of prior approaches, which alter images in a highly entangled and scattered fashion, we introduce region-constrained VCEs (RVCEs). These constraint modifications to a specific region of the image in order to influence the model's prediction. To efficiently generate examples from this subclass of VCEs, we present Region-Constrained Counterfactual Schrödinger Bridges (RCSB), which adapt a traceable subclass of Schrödinger Bridges to handle conditional inpainting, with the conditioning signal coming from the classifier of interest. Our approach not only establishes a new state-of-the-art, but also allows for exact counterfactual reasoning, ensuring that only the predefined region is semantically modified, and allows the user to interactively engage with the explanation generation process.

Biography:

Student and practitioner diving deep into different branches of Machine Learning with special interest in practical, explainable and simple solutions guided by observing the nature. Currently working on diffusion models.

Dawid Płudowski

Warsaw University of Technology

Co-authors:

Katarzyna Woźnica

Poster 8: Adaptivee: Adaptive Ensemble for Tabular Data

Friday / 8 November 17:10 - 18:40 (Poster Session 1)

Abstract:

Ensemble methods are widely used to improve model performance by combining multiple models, each contributing uniquely to predictions. Traditional ensemble approaches often rely on static weighting schemes that do not account for the varying effectiveness of individual models across different subspaces of the data. This work introduces adaptivee, a dynamic ensemble framework designed to optimize performance for tabular data tasks by adjusting model weights in response to specific data characteristics. The adaptivee framework offers flexibility through various reweighting strategies, including emphasizing single models for subspace specialization or distributing importance among models for robustness. Experiments on the OpenML-CC18 benchmark demonstrate that adaptivee can significantly boost performance, achieving up to a 6% improvement in balanced accuracy over traditional static ensemble methods. This framework opens new avenues for advancing ensemble techniques, particularly in tabular data contexts where model complexity is constrained by the nature of the data.

Biography:

Data Science student working at the research laboratory at WUT. Interested in AutoML and TimeSeries analysis

Emilia Wiśnios

Independent

Co-authors:

Gracjan Góral

Poster 9: When All Options Are Wrong: Evaluating Large Language Model Robustness with Incorrect Multiple-Choice Options

Friday / 8 November 17:10 - 18:40 (Poster Session 1)

poster

Abstract:

The ability of Large Language Models (LLMs) to identify multiple-choice questions that lack a correct answer is a crucial aspect of educational assessment quality and an indicator of their critical thinking skills. This paper investigates the performance of various LLMs on such questions, revealing that models experience, on average, a 55\% reduction in performance when faced with questions lacking a correct answer. The study also highlights that Llama 3. 1-405B demonstrates a notable capacity to detect the absence of a valid answer, even when explicitly instructed to choose one. The findings emphasize the need for LLMs to prioritize critical thinking over blind adherence to instructions and caution against their use in educational settings where questions with incorrect answers might lead to inaccurate evaluations. This research establishes a benchmark for assessing critical thinking in LLMs and underscores the ongoing need for model alignment to ensure their responsible and effective use in educational and other critical domains.

Biography:

University of Warsaw graduate with a Master's in Machine Learning. Specializes in Natural Language Processing (NLP), large language models, and the intersection of NLP with political science.

Warsaw University of Technology

Poster 13: Position: Do Not Explain Vision Models Without Context

Friday / 8 November 17:10 - 18:40 (Poster Session 1)

poster

Abstract:

Does the stethoscope in the picture make the adjacent person a doctor or a patient? This, of course, depends on the contextual relationship of the two objects. If it’s obvious, why don’t explanation methods for vision models use contextual information? The role of context has been widely covered in Natural Language Processing and Time Series but much less in Computer Vision. I will explain what contextual information within images is, using some real-world examples. I will outline how the issue of spatial context was addressed in the Deep Learning models and contrast it with the small number of works concerning the topic within the field of Explainable AI (XAI). I will show examples of failures of popular XAI methods when the spatial context plays a significant role. Finally, I will argue that there is a need to change the approach to explanations from 'where' to 'how'.

Biography:

Paulina Tomaszewska is a PhD student at the Warsaw University of Technology. She gained experience in the field of AI at universities in Singapore, South Korea, Austria and Switzerland. Her research covers Explainable AI, the importance of context in images and digital pathology.

Joanna Kaleta

Warsaw University of Technology; Sano Centre for Computational Medicine

Co-authors:

Kacper Kania, Tomasz Trzcinski, Marek Kowalski

Poster 14: LumiGauss: High-Fidelity Outdoor Relighting with 2D Gaussian Splatting

Friday / 8 November 17:10 - 18:40 (Poster Session 1)

Abstract:

Decoupling lighting from geometry using unconstrained photo collections is notoriously challenging. Solving it would benefit many users as creating complex 3D assets takes days of manual labor. Many previous works have attempted to address this issue, often at the expense of output fidelity, which questions the practicality of such methods. We introduce LumiGauss - a technique that tackles 3D reconstruction of scenes and environmental lighting through 2D Gaussian Splatting. Our approach yields high-quality scene reconstructions and enables realistic lighting synthesis under novel environment maps. We also propose a method for enhancing the quality of shadows, common in outdoor scenes, by exploiting spherical harmonics properties. Our approach facilitates seamless integration with game engines and enables the use of fast precomputed radiance transfer. We validate our method on the NeRF-OSR dataset, demonstrating superior performance over baseline methods. Moreover, LumiGauss can synthesize realistic images when applying novel environment maps.

Biography:

Joanna Kaleta is a PhD student at the Warsaw University of Technology and the Sano Centre for Computational Medicine. She holds a Master’s degree in Computer Science from the Warsaw University of Technology. Joanna’s current research focuses on the intersection of computer graphics and deep learning, particularly in neural rendering. At Sano, she is part of the Health Informatics team, where she applies deep learning methods to image-guided therapy, working on advancements in medical technology.

Alicja Dobrzeniecka

Lingaro / NASK National Research Institute

Poster 15: Continual Learning of Multi-Modal Models

Saturday / 9 November 10:30 - 12:00 (Poster Session 2)

Abstract:

AI models can become obsolete after training as new data becomes available. Re-training large models is costly and energy inefficient. Continual Learning attempts to find a solution to one of the most challenging bottlenecks of current AI models - the fact that data distribution changes over time. In my poster I would like to show the capabilities of Continual Learning methods for multimodal models, and in particular for vision-language models such as CLIP. Vision-Language models can handle both textual and visual data, which has a wide range of use cases such as image analysis, object recognition and scene understanding, image captioning, answering visual questions, and more. I will present the current state of the art in applying Continual Learning to vision-language models, their limitations and opportunities for improvement, and the results of experiments on selected methods.

Biography:

Alicja Dobrzeniecka have been studying and researching AI for a number of years. She hold a Master of Science in Artificial Intelligence from the Vrije Universiteit Amsterdam and a Bachelor of Arts in Philosophy from the University of Gdansk. She has recently published an article entitled "A Bayesian Approach to Uncertainty in Word Embedding Bias Estimation" in Computational Linguistics in MIT Press Direct. My Master's thesis focused on the interpretability of large language models such as BERT. Alicja share some of my research with a wider audience by publishing on the Medium platform. She has commercial experience as a Data Scientist, developing machine learning and deep learning models for business. In her last role, she worked on the use of LLMs for machine translation applications. Alicja currently focused on exploring the area of Continual Learning for multimodal models, which she believe will be a crucial direction for AI in the near future due to energy and resource constraints.

Valeriya Khan

IDEAS NCBR, Warsaw University of Technology

Co-authors:

Kamil Deja, Bartłomiej Twardowski, Tomasz Trzcinski

Poster 16: Assessing the Impact of Unlearning Methods on Text-to-Image Diffusion Models

Friday / 8 November 17:10 - 18:40 (Poster Session 1)

Abstract:

Text-to-image diffusion models like Stable Diffusion and Imagen set a new standard in generating photorealistic images. However, their widespread use raises concerns about the nature of the content they produce, particularly when models are trained on large datasets that may include inappropriate or copyrighted material. In response, various unlearning methods have been developed to effectively remove unwanted information. This research evaluates the impact of unlearning methods on the overall performance of text-to-image diffusion models. Specifically, we examine how unlearning certain content influences the models' ability to generate accurate and diverse images across different concepts. Through a series of experiments, we investigate potential trade-offs, such as unintended reductions in image quality or diminishing features related to the remaining classes. Our findings offer valuable insights into balancing the need to eliminate specific content with the goal of preserving the broader functionality and integrity of diffusion models.

Biography:

Valeriya Khan is a PhD student at IDEAS NCBR and Warsaw University of Technology with focus on continual learning and unlearning of generative models.

Katarzyna Zaleska

Warsaw University of Technology

Co-authors:

Łukasz Staniszewski*, Kamil Deja

Poster 17: Style and Object Low-Rank Continual Personalization of Diffusion Models

Biography:

Małgorzata Łazęcka is a scientist and statistician with experience in biomedical research. She received her Ph.D. from the Warsaw University of Technology, where her research focused on hypothesis testing, specifically on conditional independence testing using information-theoretic measures. Currently, she is a postdoctoral researcher in Ewa Szczurek's lab, working on new approaches to integrating multi-modal data for tumor analysis.

Dominik Lewy

Lingaro Group

Co-authors:

Karol Piniarski

Poster 20: Beyond Benchmarks: What to consider when evaluating foundational models for commercial use?

Friday / 8 November 17:10 - 18:40 (Poster Session 1)

Abstract:

This presentation provides a comprehensive overview of critical considerations for utilizing foundational models within commercial use cases, with a focus on Computer Vision and Natural Language Processing domains. It outlines a systematic framework comprising essential steps for verification. Additionally, the presentation illuminates the process through examples of evaluation protocols, offering practical insights into assessing model performance and applicability in real-world scenarios. The analysis will concern mainly generative models, particularly text-to-image synthesis, and Large Language Models (LLMs). Through this detailed exploration, participants will gain a deeper understanding of the strategic and technical prerequisites for leveraging foundational models to drive innovation and efficiency in commercial applications.

Biography:

Dominik has over 10 years of hands-on experience in Machine Learning, Deep Learning, Data Exploration and Business Analysis projects primarily in the FMCG industry. He is a technical leader setting goals and preparing road maps for projects. He is also a PhD candidate at Warsaw University of Technology where he focuses on the study of neural networks for image processing. He tries to be a bridge between commercial and academic worlds. His main research interest is digital image processing in context of facilitating adoption of deep learning algorithms in business context where training data is scarce or non-existing.

Jędrzej Warczyński

Poznan University Of Technology

Co-authors:

Mateusz Lango, Onfrej Dusek

Poster 21: Interpretable Rule-Based Data-to-Text Generation Using Large Language Models

Friday / 8 November 17:10 - 18:40 (Poster Session 1)

poster

Abstract:

In the field of natural language generation (NLG), converting structured data into coherent text poses significant challenges. "Interpretable Rule-Based Data-to-Text Generation Using Large Language Models," introduces a novel approach that integrates the interpretability and precision of rule-based systems with the generative power of large language models (LLMs). This method focuses on generating Python code to transform RDF triples into readable text, achieving a balance between accuracy and flexibility. Approach: The core innovation lies in automating the creation of a rule-based system using LLMs. The process involves three key steps: Rule Generation: An LLM is prompted to write Python code that specifies how to convert given RDF triples into natural language text. Rule Testing: The generated code is checked for syntactic correctness and its output is compared to desired references to ensure alignment. Rule Refinement: The code undergoes iterative refinement using silver-standard references, reducing hallucinations and enhancing accuracy. This approach leverages the strengths of both rule-based and neural methods, creating a system that runs efficiently on a single CPU without the need for GPU resources. Experimental Results: Evaluations on the WebNLG dataset demonstrate that this system outperforms zero-shot LLMs in BLEU and BLEURT scores, and significantly reduces hallucinations compared to a fine-tuned BART model. The system's interpretability allows for easy modification and extension by developers, providing high control over the output. Highlights: The system achieves higher text quality than zero-shot LLMs. It produces fewer hallucinations than a fine-tuned BART baseline. The rule-based approach offers full interpretability and control over generated text. It operates efficiently on a single CPU, eliminating the need for costly GPU resources. This research presents a promising step towards creating efficient, interpretable, and flexible NLG systems by combining the strengths of rule-based and neural approaches. It opens new avenues for further advancements in the field, particularly in multilingual text generation.

Biography:

Jędrzej Warczyński is a computer scientist pursuing a Master's degree in Artificial Intelligence at Poznań University of Technology. He earned his Bachelor's degree in Computer Science with honors from the same institution. With over two years of experience as a full-stack Java developer, Jędrzej has contributed to building robust web applications. His research focuses on natural language processing and natural language generation (NLG). His recent paper, "Interpretable Rule-Based Data-to-Text Generation Using Large Language Models," was accepted for oral presentation at INLG 2024.

Pisula Juan Ignacio

University of Cologne

Co-authors:

Katarzyna Bozek

Poster 22: Addressing data heterogeneity in federated learning with Mixture-of-Experts models

Friday / 8 November 17:10 - 18:40 (Poster Session 1)

poster

Abstract:

Federated Learning (FL) offers a solution to collaborative learning when sharing private data is not possible. However, domain shift among the different clients in the federation remains an important challenge. When this occurs, the Federated Averaging (FedAvg) strategy typically performs poorly, as each client optimizes towards its local empirical risk minimum, which may be inconsistent with the global direction. Handling this issue is not only of theoretical interest, but could be critical in real-world scenarios, for example, in medical applications where each client acquires geographically-biased data using its own protocol. The problem of non-independent, identically distributed (non-iid) data in FL has been studied mainly on situations where it is the distribution of labels that shifts among clients, and there is limited work on data originated from different domains. In the FL literature, non-iid distributions are commonly addressed with novel federated algorithms that train a better global model, or that include local models that mitigate the biases of their respective clients. In this work, we study how the domain shift problem can be overcome by using Mixture-of-Experts architectures (MoEs). The MoE layers that we employ compute their output as a linear combination of the outputs of a pool of experts, where the coefficients are predicted by a router network. Furthermore, if the routing to the experts is sparse, the computation of unused experts can be spared, providing a boost in inference speed. Our experiments show that the ability of MoEs to process different inputs with different experts can be exploited to automatically deal with data heterogeneity among clients, and a single global model can be trained even with a naive FedAvg strategy without compromising performance. Additionally, we report an increase in accuracy when the gradients of the MoE layers are estimated using a heuristic. Overall, we show that MoEs make a solid solution to federated scenarios where data heterogeneity is a concern.

Biography:

Electronics engineer born and raised in La Pampa.

Jolanta Śliwa

AGH University of Krakow

Co-authors:

Paulina Jędrychowska, Bogumiła Papiernik, Oskar Simon

Poster 23: Application of machine learning to support pen & paper RPG game design

Friday / 8 November 17:10 - 18:40 (Poster Session 1)

poster

Abstract:

Many real-life scenarios require the prediction of ordinal variables, a task well-suited for ordinal regression methods. These methods include classical regression models with rounding as well as specialized approaches designed specifically for ordinal data. In this talk, we will introduce the topic of ordinal regression and its algorithms and evaluation methods. One of many possible applications of these methods is determining challenge levels of monsters in pen & paper RPG games. Currently there is no automatic way to estimate these levels. However, it is a natural task for machine learning, as opponents are described by long vectors of numerical features and levels are ordinal values. Usage of ordinal regression can help reduce costs for publishers during the design process. We will describe the experiments, evaluation framework, and results.

Biography:

Jolanta Śliwa is a Data Science student at the AGH University of Krakow. As part of my engineering thesis, she co-developed an application that supports the design of opponents in a pen & paper RPG game, using Machine Learning. For this reason, Jolanta have recently been spending her free time playing this type of game, and she also immerse myself in the fascinating world of animation.

Bartłomiej Fliszkiewicz

Military University of Technology

Poster 24: Repurposing Pharmaceuticals for Organophosphorus Poisoning

Friday / 8 November 17:10 - 18:40 (Poster Session 1)

poster

Abstract:

Organophosphorus (OP) compounds found in pesticides and chemical warfare agents (CWA), continue to pose a significant global health risk. There are still over 385 million cases of unintended acute pesticide poisoning annually, mostly in southern Asia and east Africa, resulting in approximately 11,000 deaths, despite a global push to reduce the use of pesticides. There is also a significant number of intended OP poisoning, mostly in suicide attempts. The number of OP poisoning could escalate rapidly in the event of terrorist incidents, warfare and other crises. Due to geographic focus of the problem and the relatively small number of cases, there is little interest in developing new drugs against OP poisoning. Notably the most used antidote, pralidoxime (2-PAM), was developed in 1950s. Most studied antidotes are charged molecules and therefore poorly penetrate the blood-brain-barrier. Repurposing existing pharmaceuticals offers a strategic solution to the lack of interest in developing novel antidotes, as the drug discovery is both costly and time-consuming. This study employs a structure-based method for repurposing compouds from the ChEMBL database. a machine learning model constructed with Light Gradient Boosting Machine algorithm is applied to classify compounds as actives or inactives in treating organophosphorus poisoning. The training database was created by curating PubChem compounds tested against acetylcholinesterase (gene ID 43), focusing on bioassays containing the terms „reactivation” and „nimp”, „gb”, „sarin”, „sp-gbc” or „sp-gb-am”. The model was trained using the structural representations of 62 molecules. The approach was evaluated using the Leave-One-Out cross validation method, yielding an area under the ROC curve of 0.93. Since 52 of the training molecules contained an oxime moiety, the classification was limited to such compounds. Among 34 oximes from ChEMBL database 16 were classified as actives and chosen for further analysis including protein - ligand docking.

Biography:

Bartek is a research assistant at the Department of Radiology and Contamination Monitoring at the Military University of Technology. His scientific interest is in cheminformatics and drug design. He is planning to obtain a PhD soon and start postgraduate studies in bioinformatics. As a hobby project Bartek developed an Android app called Gaslands Builder.

Jan Dubiński

Warsaw University of Technology; IDEAS NCBR

Co-authors:

Piotr Warchoł, Maciej Kafel

Poster 25: Efficiently enhancing product design process with Stable Diffusion

Friday / 8 November 17:10 - 18:40 (Poster Session 1)

Abstract:

Navigating the landscape of product design demands a nuanced approach, necessitating considerable time and expertise. The integration of AI-aided design systems, though promising, demands substantial resource allocation. In response to these imperatives, this work introduces an innovative solution poised to enhance and expedite the product design process. Leveraging the state-of-the-art Stable Diffusion Model, a cutting-edge framework for generative image generation, our approach propounds a streamlined and resource-efficient methodology to efficiently empower the product design process. Our solution demonstrates three key capabilities: 1) facilitating the generation of new product designs and styles observed on e-commerce platforms; 2) swiftly creating product prototypes based on existing products or new designer sketches; 3) enabling precise modifications to product designs according to the designer's preferences. Leveraging the dreambooth technique, we seamlessly incorporate new styles or products with minimal input data, diversifying design possibilities dynamically. Precision is attained through the ControlNET mechanism, informed by a visual prior, aligning output with a desired product shape. Finally, a masking mechanism allows for product editing to enhance customization. Noteworthy, our solution requires only a single 8GB RAM GPU. Successfully developed, tested, and applied at Eljot Sp. z o. o., specialists in wooden product design, our solution showcases the potential to revolutionize and accelerate the product design process.

Biography:

Jan Dubiński is currently pursuing a PhD degree in deep learning at the Warsaw University of Technology. He is a member of the ALICE Collaboration at LHC CERN. Jan has been working on fast simulation methods for High Energy Physics experiments at the Large Hadron Collider at CERN. The methods developed in this research leverage generative deep learning models such as GANs to provide a computationally efficient alternative to existing Monte Carlo-based methods. More recently, he has focused on issues related to the security of machine learning models and data privacy. His latest efforts aim to improve the security of self-supervised and generative methods, which are often overlooked compared to supervised models.

Bartosz Cywiński

Warsaw University of Technology

Co-authors:

Kamil Deja, Tomasz Trzciński, Bartłomiej Twardowski, Łukasz Kuciński

Poster 26: GUIDE: Guidance-based Incremental Learning with Diffusion Models

Friday / 8 November 17:10 - 18:40 (Poster Session 1)

poster

Abstract:

We introduce GUIDE, a novel continual learning approach that directs diffusion models to rehearse samples at risk of being forgotten. Existing generative strategies combat catastrophic forgetting by randomly sampling rehearsal examples from a generative model. Such an approach contradicts buffer-based approaches where sampling strategy plays an important role. We propose to bridge this gap by incorporating classifier guidance into the diffusion process to produce rehearsal examples specifically targeting information forgotten by a continuously trained model. This approach enables the generation of samples from preceding task distributions, which are more likely to be misclassified in the context of recently encountered classes. Our experimental results show that GUIDE significantly reduces catastrophic forgetting, outperforming conventional random sampling approaches and surpassing recent state-of-the-art methods in continual learning with generative replay.

Biography:

Bartosz Cywiński is a student at Warsaw University of Technology. His main research interests are mechanistic interpretability and continual learning. He previously interned at IDEAS NCBR and CISPA.

David Bertram

University of Cologne

Co-authors:

Katarzyna Bozek, Michael Sommerauer

Friday / 8 November 17:10 - 18:40 (Poster Session 1)

Abstract:

Light pollution is a significant ecological and social problem that negatively affects human health, disrupts the natural circadian rhythms of organisms, and disrupts the functioning of ecosystems, leading to serious environmental consequences. According to current knowledge, one of the most important factors contributing to the increase in light pollution is not the intensity of light, but the large number of light sources in a given area. In this study, an object-detection model was used to detect street lights in aerial photos taken at night by a drone. In particular, the focus was on using the R-CNN algorithm for fast and efficient detection of street lights. The image processing model was trained based on collected and hand-annotated data from eight selected areas of Warsaw. The model showed high precision in locating the street lights, which is of great importance for urban planners in developing strategies to reduce light pollution and optimize the layout of urban lighting.

Biography:

Małgorzata Kurcjusz-Gzowska is a PhD student in the field of Civil Engineering, her research is focused on Artificial Intelligence in Architecture. Her recent project revolves around using object detection to measure light pollution in Warsaw from drone footage. She graduated from both Architecture and Civil Engineering at Warsaw University of Technology.

Karol Szymański

Tooploox

Co-authors:

Szymon Płaneta

Poster 31: Comparing Large Language Models in Retrieval-Augmented Generation: A Multi-Metric Evaluation

Friday / 8 November 17:10 - 18:40 (Poster Session 1)

poster

Abstract:

The rapid evolution of generative AI has led to widespread use of Large Language Models (LLMs) in various industries. However, a comprehensive comparison highlighting their strengths and weaknesses is often lacking. This study aims to fill that gap by evaluating popular open-source and commercial LLMs, including GPT-3.5, GPT-4, GPT-4 Turbo, Mistral, and Llama13B, in conjunction with Retrieval Augmented Generation (RAG) systems. Our methodology involved a standardized dataset, a set of relevant questions, and a suite of metrics like answer correctness, faithfulness, and context relevance. The results revealed significant performance variations across models, with GPT-4 generally providing the most accurate answers. Interestingly, open-source models like Mistral demonstrated competitive performance, particularly in faithfulness. Furthermore, while GPT-4 was the only model to admit to lack of necessary information, others tended to generate hallucinated responses when unable to provide accurate answers. This study underscores the importance of choosing the right LLM for specific use cases and the potential of open-source models as viable alternatives to their commercial counterparts.

Biography:

Karol Szymański completed his Master’s degree in 2017, focusing on the application of autoencoders in herding tasks. After graduating, he worked at Intel and Amazon, gaining experience in the industry. Since 2020, he has been working at Tooploox, where he focuses on building deep learning-based solutions for image processing.

Aleksander Obuchowski

TheLion.AI

Co-authors:

Mikołaj Badocha, Kinga Marszałkowska, Maciej Gierczak, Barbara Klaudel

Poster 32: Eskulap - The First Polish Open-source Medical Large Language Model

Saturday / 9 November 10:30 - 12:00 (Poster Session 2)

Abstract:

Although large language models like GPT, Gemini and Claude have demonstrated promising capabilities in processing the Polish language and representing basic medical knowledge, their application in production medical environments faces significant limitations, including issues with data privacy, lack of transparency and control over the model, and instability over time. To address these challenges and increase the potential of AI utilisation in Polish medicine, we have developed Eskulap - an open-source medical model designed for safe integration with hospital infrastructure. This model addresses the previous lack of dedicated medical models in Polish, similar to those available in English. To build Eskulap we have gathered medical information from a diverse array of sources: medical websites, Polish Wikipedia, healthcare flyers, scientific publications, and anonymised clinical notes. The cornerstone of our data strategy was the creation of 800,000 synthetic medical instructions, transforming unstructured data into a rich learning foundation. We have then used Bileik-v2 as the base model and fine-tuned it using LoRa techniques to make it aligned with medical instructions This new model aims to open unprecedented possibilities for AI applications in Polish healthcare while addressing key challenges related to privacy, control, and stability. It has the potential to transform various aspects of medical practice, from assisting in documentation to supporting clinical decision-making.

Biography:

Aleksander Obuchowski is a co-founder of a research group, TheLion.AI devoted to creating AI-based open source solutions for healthcare. Worked on projects such as the Universal Medical Image Encoder and the Polish medical language model Esculap. Head of AI at K-2.AI. Lecturer at the Polish-Japanese Academy of Information Technology. Awarded Forbes 25 under 25.

Miłosz Gajowczyk

Hemolens Diagnostics

Poster 33: Evaluation of AI-Based Coronary Artery Calcium Scoring in Non-Contrastive Cardiac CT

Saturday / 9 November 10:30 - 12:00 (Poster Session 2)

poster

Abstract:

In recent years, artificial intelligence (AI) based tools have grown in popularity in medical imaging processing. They offer a rapid and non-invasive way of monitoring diseases and assessing their progression. In the coronary artery disease (CAD) monitoring, a key diagnostic feature is a so-called calcium scoring which is extracted from computed tomography (CT) scans. This feature measures an amount of calcifications that are collected inside the patient's vessels, obstructing a healthy blood flow and subsequently causing a cardiac ischemia. This work focuses on the problem of evaluating coronary artery calcium scoring methods with non-contrastive cardiac CT. Practical issues related to convolutional neural networks such as local relationships between heart muscle and small anatomical objects such as plaques are discussed. Additionally, we validate different models used to detect abnormalities in medical images using artificial intelligence.

Biography:

With four years of experience as a researcher in medical AI, Miłosz Gajowczyk specializes in applying advanced techniques to computed tomography and magnetic resonance imaging modalities. His work focuses on segmentation, point detection, and mesh deformation in cardiac and brain scans. Although his professional work centers on cardiac scans, he have a strong interest in advancing brain scan research.

Łukasz Niedźwiedzki

Faculty of Physics, University of Warsaw

Co-authors:

Dr Józef Ginter

Poster 34: Improving Physics-Informed Neural Networks for Modeling Molecular Transport in the Human Brain

Saturday / 9 November 10:30 - 12:00 (Poster Session 2)

Introduction Spatial proteomics, such as multiplex immunofluorescence (mIF) is a medical imaging technique that generates multi-channel data at a single-cell resolution from a tumour sample. When sampled for cohorts of cancer patients, spatial proteomics data can provide useful prognostic information and insights into the organisation of tumour immune microenvironment (TME). Current approaches towards spatial proteomics data analysis focus only on small cellular neighbourhoods, ignoring the broader spatial context. Method Here, we introduce Spacelet - a novel algorithmic and statistical approach designed to discover patterns of infiltration within spatial tumour data. Given the nature of cancer spatial proteomics data, we incorporate a graph-based approach toward modelling and represent the data as a collection of disjoint spatial tumour cell islets (hence Spacelet). Later, we decompose these islets into a sequence of layers that start from the tumour border and go deeply into the tumour interior. These structures are useful in capturing the variety of infiltration patterns, which we obtain by clustering cellular abundances at consequent layers using Wasserstein distance. Results We validated the performance of Spacelet on 576 samples from NSCLC2 192 patients, collected by the IMMUcan consortium (immucan.org) and 68 samples from 34 melanoma patients from the National Institute of Oncology (NIO), Poland. In both datasets, Spacelet identified the same patterns of infiltration (uniform and interior-excluded), which correlate strongly with clinical variables. Among others, in the NSCLC2 cohort, Spacelet discovered that interior-excluded and uniform infiltration patterns differentiate histology subtypes, while in the Melanoma cohort, Spacelet linked interior-excluded infiltration with survival, progression, and response to treatment. Conclusions Results from the application of Spacelet to different datasets prove the efficacy of our approach in inferring biologically meaningful spatial characteristics of infiltration patterns in tumour tissues. With further applications of our method, we aim to deepen the understanding of the spatial organisation of TME and provide insights for potential directions of improvements for immunotherapy treatment strategies.

Biography:

Joanna Krawczyk is a bioinformatics data scientist, deeply passionate about machine learning applications in healthcare, especially in the computational oncology field. Graduated from Bioinformatics and Mathematics at the University of Warsaw. Currently working as a bioinformatics data scientist in a Polish biopharmaceutical company and as a researcher in Ewa Szczurek's Computational Computational Medicine Laboratory, where she focuses on a research project on modeling tumor immune microenvironment.

Mateusz Kapusta

Astronomical Observatory, University of Warsaw

Poster 40: Iris-ML: Simulation-Based Inference for the Spectral Energy Distribution fitting.

Saturday / 9 November 10:30 - 12:00 (Poster Session 2)

Abstract:

Markov chain sampling is a versatile algorithm used in modern astronomy for inference tasks. Unfortunately, it is not always suitable for large inference tasks where the evaluation of the likelihood function is computationally expensive. Here, an alternative approach is presented, in the form of a Simulation-Based Inference. I use it to tackle the Spectral Energy Distribution fitting problem. The basic idea for this type of analysis is to uncover the true physical properties of objects by fitting complicated physical models to broadband brightness measurements. Mastering analysis of the photometric data is essential for modern astronomical research. Based on the transformer architecture for the preprocessing, the proposed model greatly accelerates the sampling process with the help of the MAF Normalizing Flow. Such models are a great step forward compared to the usually used MCMC, as they allow for a much faster sampling procedure. They will become more influential, as the next generation of astronomical surveys will produce unprecedented amounts of data, that need to be processed.

Biography:

Mateusz Kapusta have been working in the field of Observational Astronomy for 3 years, mainly applying various Bayesian models in real-case astronomical scenarios. He is involved in research in Simulation-Based Inference, with an application to big-astronomical surveys.

Emilia Majerz

AGH University of Krakow

Co-authors:

Aleksandra Pasternak

Poster 41: Siamese Ensembles for image data augmentation

Saturday / 9 November 10:30 - 12:00 (Poster Session 2)

poster

Abstract:

Even the most powerful neural network architectures can be useless when provided with a small amount of training data. In this scenario, the use of data augmentation techniques can help with generalization. However, they should be applied carefully, as some modifications can alter the labels of the samples, which may be difficult to spot without expert knowledge. In this work, we introduce a simple label-preserving image data augmentation technique, especially suitable for small datasets. This network training method allows for expanding the data by using pairs of images instead of single samples in an ensemble learning-like manner and is inspired by Siamese neural networks, with two networks working together to achieve a common goal. It can be easily implemented to improve the accuracy of various image classification tasks and be particularly useful for smaller, medical or technology industry-related datasets. In our experiments, we focus on a difficult and very specific aircraft dataset, containing images of fuselage of aircraft structures, with corroded and non-corroded surfaces. We also provide results on standard baseline data. Our preliminary experiments showed that the proposed augmentation improves the classification accuracy by even over a dozen percentage points, and the gain in accuracy is especially visible in the case of a smaller dataset size.

Biography:

Emilia Majerz is a PhD candidate at the AGH University of Krakow, working on theory-inspired Machine Learning. She hold an MSc in Data Science and a BEng in Computer Science. Her main research area is incorporating Physics knowledge into Machine Learning models, focusing on the detectors of ALICE at CERN.

Moritz Staudinger

TU Wien

Co-authors:

Wojciech Kusa, Florina Piroi, Aldo Lipani, Allan Hanbury

Poster 42: Beyond ChatGPT: A Reproducibility and Generalizability Study of Large Language Models for Query Generation

Saturday / 9 November 10:30 - 12:00 (Poster Session 2)

poster

Abstract:

Systematic literature reviews (SLRs) are a cornerstone of academic research, yet they are often labour-intensive and time-consuming due to the detailed literature curation process. The advent of generative AI and large language models (LLMs) promises to revolutionize this process by assisting researchers in several tedious tasks, one of them being the generation of effective Boolean queries that will select the publications to consider including in a review. This paper presents a extensive study of Boolean query generation using LLMs for systematic reviews, reproducing and extending the work of Wang et al. and Alaniz et al. Our study investigates the replicability and reliability of results achieved using ChatGPT and compares its performance with open-source alternatives like Mistral and Zephyr to provide a comprehensive analysis of LLMs for query generation. Therefore, we implemented a pipeline, which automatically creates a Boolean query for a given review topic by using a previously selected LLM, retrieves all documents for this query from the PubMed database and then evaluates the results. With this pipeline we first assess whether the results obtained using ChatGPT for query generation are reproducible and consistent. We then generalize our results by analyzing and evaluating open-source models and evaluating their efficacy in generating Boolean queries.

Biography:

Co-authors:

Tomasz Danel, Sabina Podlewska

Poster 48: PROFIS: Design of structurally-novel drug candidates by probing molecular fingerprint space with RNNs

Saturday / 9 November 10:30 - 12:00 (Poster Session 2)

poster

Abstract:

The contemporary landscape of drug discovery is characterized by the increasing complexity of the tasks, the rising cost of research and development, and the demand for faster and more efficient ways to bring innovative therapeutics to market. As a solution to these challenges, computational methods have become more prevalent, with generative ML paving the way to faster and more effective drug discovery in the recent years. Before any computational algorithm can process a molecular structure, it needs to be encoded in a way that allows the machine to parse it. Several textual encoding methods have emerged, including SMILES (Simplified Molecular Input Line Entry System), its more recent and ML-suited counterpart DeepSMILES, and SELFIES (Self-referencing Embedded Strings). Another common way to represent molecular structures is to use molecular fingerprints (FPs). Those are structural representations of chemical compounds in the form of binary or numerical vectors that capture critical information about a molecule's constituent atoms, bonds, and substructures. In contrast to molecular graphs or textual encodings, FPs have the potential to extract information about biochemically relevant functional groups and present it in a compact, machine-readable format, and have a great potential to be used as features for ML-based QSAR (quantitative structure-activity relationship) modeling. In this study, we propose a novel generative model, PROFIS, which allows for the design of target-focused compound libraries by probing continuous fingerprint space with RNNs. PROFIS is an innovative molecular VAE that maps molecular fingerprints into a continuous, low-dimensional space and decodes molecule structures in a sequential notation, ensuring alignment with the initial FP description. In the task of generating potential novel ligands, PROFIS employs a Bayesian search algorithm in tandem with a QSAR model to traverse the space of embedded molecular FPs and identify subspaces that correspond to potential good binders. The latent vectors sampled from those subspaces are then decoded into textual formats, such as SMILES or DeepSMILES using a recurrent neural network. Since many FPs do not determine the full chemical structure, our method can generate diverse molecules that match the particular FP description. The generated structures are target-specific, which allows for generating potential ligands tailored to a specific receptor. We prove that PROFIS exhibits excellent scaffold-hopping capabilities, enabling the exploration of novel chemical space, an essential feature of computational tools for de novo ligand generation. We present the application of our protocol in the task of ligand generation for the dopamine D2R. However, the developed methodology is universal and can be applied to any biological target provided a dataset of known ligands is available. To facilitate the widespread use of PROFIS, we share all the scripts needed to run the developed protocol via GitHub.

Biography:

Hubert Rybka graduated from the Faculty of Chemistry, Jagiellonian University in 2023 with a Master's degree in Chemistry. Currently pursuing PhD at Łukasz Skalniak's group of Bioorganic and Medicinal Chemistry, employing computational methods for modern, data-based drug design. Research interests include ML-assisted molecular design, cheminformatics, and molecular dynamics of biologically relevant systems. When not doing research - a rock climber and a friend of small animals.

Jakub Poziemski

Institute of biochemiostry and biophysics Polish Academy of Sciences

Co-authors:

Paweł Siedlecki

Poster 49: Application of vision transformers to protein-ligand affinity prediction.

Saturday / 9 November 10:30 - 12:00 (Poster Session 2)

poster

Abstract:

The transformer architecture has revolutionized many areas related to AI. It was originally adopted for natural language processing (NLP), but in recent years there has been rapid development of transformer architectures for computer vision (CV) data, the so-called Vision Transformers (ViT). ViT is achieving spectacular results in many CV areas, displacing architectures based on convolutional neural networks. (CNN). In this paper, we present a successful application of ViT to the problem of protein-ligand affinity prediction based on 3D crystallographic complexes. Despite the relatively small dataset and the very complex nature of the problem, ViT achieves results comparable to the best methods used for this problem. The paper also includes extensive model diagnostics that provide information on important aspects of the input data and its representation.

Biography:

Jakub Poziemski completed my bachelor's and master's degree in Bioinformatics and Systems Biology at the University of Warsaw, Faculty of Mathematics, Informatics and Mechanics. He is currently a PhD student at the Institute of Biochemistry and Biophysics of the Polish Academy of Sciences (IBB PAS) in the Chemoinformatics and Molecular Modeling Laboratory. His PhD thesis focuses on protein-ligand affinity prediction using artificial intelligence and machine learning methods. He has 8 years of experience in the areas of AI and ML, with expertise in natural language processing (NLP), AI applications in bioinformatics and chemoinformatics, programming in Python, data analysis and visualization. Jakub has gained experience in both commercial and scientific projects.

Paweł Skierś

Warsaw University of Technology

Co-authors:

Kamil Deja

SentiOne

Co-authors:

Agnieszka Pluwak

Saturday / 9 November 10:30 - 12:00 (Poster Session 2)

poster

Abstract:

Novel diffusion models (DMs) can synthesize photo-realistic images with integrated high-quality text. In this work, we demonstrate through attention activation patching that less than 0.5% of DMs' parameters influence the text generation within the images. In contrast to prior work, our localization approach is broadly applicable across various diffusion model architectures, including both U-Net and Transformer-based, utilizing diverse text encoders. Building on this observation, by precisely targeting specific parameters of the model, we improve the efficiency and performance of existing image-editing methods, which often inadvertently modify not only the text but also the other visual elements within an image. Furthermore, we demonstrate that fine-tuning solely the localized parameters enhances the general text-generation capabilities of large diffusion models, providing a more efficient fine-tuning approach.

Biography:

Ignacy Stępka

Carnegie Mellon University, Poznan University of Technology

Co-authors:

Nicholas Gisolfi, Artur Dubrawski

Poster 57: Adaptive fill-in: how to mitigate the loss of an agent in decentralized federated learning

Saturday / 9 November 10:30 - 12:00 (Poster Session 2)

poster

Abstract:

In decentralized learning, agents collaborate by training models on their local data while regularizing based on information from their neighboring agents, aiming to achieve a common model and maximize overall performance. However, the permanent loss of an agent, especially one with unique knowledge about the data distribution (non-iid), can significantly degrade system performance. To address this issue, we introduce a model-inversion technique as an adaptive fill-in strategy for agent reconstruction. This method reconstructs data points similar to those used by the lost agent during training and utilizes them to create and deploy a new agent, effectively restoring system performance and maintaining the optimization process. We demonstrate the effectiveness of this approach across various data distribution scenarios, including non-overlapping data distributions, distinct class assignments, and uniform distributions. Via experimental analysis, we show that our adaptive patching method not only recovers performance after a persistent agent failure but also accelerates convergence compared to other baseline approaches.

Biography:

Ignacy Stepka is a fourth-year Artificial Intelligence student at Poznan University of Technology. His research experience includes work at the Robotics Institute of Carnegie Mellon University, where he has contributed to a project on the resilience of decentralized learning algorithms in adverse scenarios, funded by the U.S. Army. He has also developed formal verification approaches for Bayesian Networks in critical care trauma delivery under a DARPA initiative. At Poznan University of Technology, Ignacy's research focuses on robust counterfactual explanations. Previously, he worked on methods utilizing multi-criteria analysis for generating counterfactual explanations, and more recently, he has developed a statistical framework to ensure their robustness against model shifts. In addition to his research, Ignacy has gained significant professional experience over three years at the Poznan Supercomputing and Networking Center, where he has contributed to EU HORIZON-funded projects. His work includes predictive maintenance for Volkswagen assembly lines, explainable AI analyses for air traffic management, and anomaly detection in large HPC clusters. Ignacy is also actively involved in the academic community, having led seminar sessions on Machine Unlearning and Explainable AI as part of his university's student research group, GHOST.

Mikołaj Zieliński

Commonwealth Scientific and Industrial Research Organisation, Poznan University of Technology

Co-authors:

Dominik Belter, Peyman Moghadam

Poster 58: Smart sampling for object removal operations in Neural Radiance Fields

Saturday / 9 November 10:30 - 12:00 (Poster Session 2)

poster

Abstract:

Neural Radiance Fields (NeRF) have emerged as a powerful tool for generating immersive space representations. However, once trained, modifying these representations poses significant challenges due to the implicit storage of scene information within the weights of these coordinated neural networks. Existing approaches can be categorized into two main types: methods that modify the input dataset using inpainting techniques and methods that manipulate the density and sampling functions. The first category often involves time-consuming retraining, as edits cannot be applied to an already trained model without starting the training process anew. In contrast, the second category enables real-time editing without the need for model retraining. In the context of editing, the second category of methods provides significant flexibility, allowing for on-the-fly adjustments. However, object removal often results in distortions and artifacts in the scene behind the removed object. These distortions arise from na\"ive object removal techniques that suppress unwanted density function values, resulting in undersampled regions that should be reconstruced. Although the network properly encodes the knowledge of these regions, their reconstruction is impaired due to insufficient sampling. To address these issues, we propose a novel sampling technique that accounts for spatial regions containing the object to be removed and avoids sampling from these areas. Our approach focuses on sampling from regions underrepresented by existing methods, resulting in enhanced sampling of the regions behind the removed object. This technique mitigates distortion issues and improves the quality of rendered novel views. Additionally, our method reduces the number of samples required for successful rendering. Unlike other approaches, we demonstrate that our sampling strategy enables precise reconstruction of scene geometry, provided the network has seen the reconstructed regions from different angles during training. In cases where this is not possible, the network may exhibit hallucinations. However, it can still interpolate to approximate the geometry of the unseen regions.

Biography:

Mikołaj Zieliński is a PhD student at Poznań University of Technology and an intern at the Commonwealth Scientific and Industrial Research Organisation (CSIRO). He completed a Master’s degree in Automation and Robotics, with my research focusing on neural space representations. My work is dedicated to developing advanced representations to improve how robots manipulate objects and navigate their environments. Outside of my academic and professional pursuits, He enjoy machining, drinking tea and travelling. My hobbies often influence my approach to both my research and everyday life.

Piotr Stefański

University of Economics in Katowice

Poster 59: Improved Scene Classification in Dynamic Combat Sports by Video Frame Segmentation

Wojciech Zarzecki

Computational Medicine Group, MIMUW

Co-authors:

Paulina Szymczak, Roberto Olayo, Krzysztof Koras, Marcin Możejko, Małgorzata Łazęcka, Krzysztof Oksza Orzechowski, Ewa Szczurek

Poster 62: BATTLE-AMP - Benchmarking Assessment Tests for The Leading Efficacy of Antimicrobial Peptides

Saturday / 9 November 10:30 - 12:00 (Poster Session 2)

Abstract:

Antimicrobial peptides (AMPs) have emerged as a promising alternative to combat the growing threat of antibiotic-resistant bacteria. However, they have not been widely adopted in clinic due to their high toxicity and low activity. Therefore, discriminative methods are crucial to select AMP candidates with desired properties. The comparison of predictive models in AMP discovery is challenging and lacks consistency due to the absence of standardized benchmarks. New methods are evaluated on custom datasets that are often not related to key AMP properties such as activity and are typically released in a manner that is difficult to reproduce. In this work we reviewed over 50 methods for AMP prediction and observed that code reproducibility was a significant challenge, with only 10 methods meeting our criteria for reproducibility. To address this fundamental problem, we propose an extendable framework for the systematic comparison of AMP prediction methods. We evaluated the robustness of these methods in key biological contexts, such as activity against specific species or adversarial syntactic variations. Our framework ensures higher reproducibility and plug-and-play assessment of new models, aiming to redefine AMP classification in a way that aligns more closely with the biological context.

Biography:

Wojciech Zarzecki is a Computer Science student at the Warsaw University of Technology. He is also a member of the Computational Medicine Group at MIMUW led by Prof. Ewa Szczurek. His main interests are the application of deep learning in antimicrobial peptide discovery and computer vision.

Mateusz Piechocki

Poznan University of Technology

Co-authors:

Marek Kraft

Poster 63: Enhancing Solar Irradiance Forecasts with On-Device Continual Learning

Saturday / 9 November 10:30 - 12:00 (Poster Session 2)

poster

Abstract:

With the increasing contribution of solar energy to the overall renewable energy system, accurate solar irradiance forecasting is crucial for optimizing energy production and managing grid stability. Traditional forecasting approaches rely on static and centralized algorithms that often struggle to adapt to local conditions and rapidly changing atmospheric phenomena. These limitations can lead to less stable and reliable forecasts, potentially undermining the consistency and efficiency of solar energy systems. Hence, we propose a novel approach to maintain the highest level of solar irradiance forecasting based on on-device continual learning. The presented solution leverages incoming data and an incremental learning strategy to continuously refine its forecasting skills directly on-site in a decentralized way, without constant communication with a central server. By combining new, relevant data samples with historical training data, our on-device continual learning pipeline can rapidly adjust to evolving environmental conditions, ensuring that the forecasting model remains accurate and responsive to local atmospheric changes. The developed pipeline improves the forecast accuracy of the deployed model, protects against catastrophic forgetting, and maintains this process energy efficient, relying only on the available constrained resources of edge devices. In this study, we present a comprehensive evaluation of our method across various deployment scenarios, demonstrating significant improvements in the precision and reliability of solar irradiance forecasts. Our results highlight the potential of on-device continual learning to advance solar irradiance forecasting, providing a scalable and adaptive solution to enhance energy management and facilitate more effective grid integration in the renewable energy sector.

Biography:

Mateusz has been affiliated with Poznan University of Technology (PUT) since 2021. In 2021, he graduated from the same University with an MSc degree in Automatic Control and Robotics (Robots and Autonomous Systems specialization). Currently, he is a PhD student in the PUT Vision Laboratory at the Institute of Robotics and Machine Intelligence. His research interests include various topics related to machine and deep learning, computer vision, or robotics, focusing on real-time processing and edge computing.

/ Student Research Workshop (SRW) Talks

Pawel Knap

University of Southampton / University of Freiburg

Co-authors:

Peter Hardy, Alberto Tamajo, Hwasup Lim, Hansung Kim

SRW Talk 1: Real-Time Omnidirectional 3D Multi-Person Human Pose Estimation with Occlusion Handling

Thursday / 7 November 8:10 - 8:30 Main Hall (Student Research Workshop)

slides

Abstract:

Current human pose estimation systems primarily focus on obtaining an accurate 3D global estimate of a single person. Our work introduces one of the first real-time 3D multi-person human pose estimation systems capable of handling basic forms of occlusion. First, we adapt an off-the-shelf 2D detector and an unsupervised 2D-3D lifting model for use with 360° panoramic camera and mmWave radar sensors. We then make several key contributions, including camera and radar calibrations and improved matching of individuals between image and radar spaces. Our system addresses the depth and scale ambiguity problems by utilizing a lightweight 2D-3D pose lifting algorithm that operates in real-time with high accuracy in both indoor and outdoor environments, providing an affordable and scalable solution. Notably, the system maintains nearly constant time complexity regardless of the number of detected individuals, achieving a frame rate of approximately 7-8 fps on a laptop with a commercial-grade GPU. My presentation and poster will feature material from our initial paper, which proposed the system and was presented at the ACM SIGGRAPH European Conference on Visual Media Production in December 2023. I will also cover our second paper, which details system improvements and was presented in January 2024 at the International Conference on Electronics, Information, and Communication. Additionally, I will reference my latest article, "Human Modelling and Pose Estimation Overview," available on arXiv. All these resources can be found on my webpage: pawelknap.github.io.

Biography:

Pawel Knap is a PhD student at the University of Freiburg, Germany. He holds both an MEng and a BEng in Electronic Engineering with Artificial Intelligence from the University of Southampton. Pawel is the first author of several papers on human pose estimation. His current academic focus is on computer vision, with a particular emphasis on medical/neuroscience image data analysis. Additionally, he has published work on reinforcement learning and has contributed to real-time object detection and satellite image segmentation projects in the private sector.

AGH University of Kraków

SRW Talk 6: Towards Sustainable Cloud Environments by Leveraging Time Series Forecasting for Enhanced Resource Utilization

Thursday / 7 November 10:10 - 10:30 Main Hall (Student Research Workshop)

slides

Abstract:

The increasing importance of cloud computing underscores its undeniable flexibility, which renders it indispensable in modern organizations. However, the operational model inherent in cloud environments carries significant risks that can impact both cost-effectiveness and energy utilization. While the pay-as-you-go pricing scheme provides a convenient interface for accessing cloud resources, expenses can escalate uncontrollably. Ensuring top-notch QoS (Quality of Service) and steering clear of SLA (Service Level Agreement) violations often involves provisioning more resources than necessary, leaving a considerable margin of unused cloud capacity. While overprovisioning leads to resource wastage and higher costs, underprovisioning risks service downtime. As cloud computing continues to grow in scale and complexity, optimizing resource utilization is crucial for achieving environmental sustainability. Moreover, improving resource management aligns with the principles of GCC (Green Cloud Computing) and sustainable computing. Therefore, machine learning-based resource usage prediction emerges as a significant optimization category. However, workload patterns in cloud environments are influenced by various factors, resulting in time series data that represent historical resource usage characterized by complex, multi-seasonal dependencies and considerable variability. Consequently, introducing a cloud resource usage optimization system tackles the issue of inefficient resource utilization by applying long-term time-series forecasting within the cloud computing domain to generate dynamic resource reservation plans based on predicted demand. Due to the multi-faceted nature of system architecture, the thematic scope covers various aspects, such as the role of exploratory data analysis combined with unsupervised anomaly detection, the critical importance of leveraging cloud FinOps (Financial Operations) principles, the evaluation of different machine learning models for time-series forecasting (including recurrent neural networks and transformers), and the qualitative and quantitative assessment of resource reservation plans. A key focus will be showcasing the role of custom domain-specific evaluation measures and demonstrating their relationship with standard machine learning evaluation metrics, highlighting that the model best suited for forecasting may not always be the one that enables the most efficient dynamic resource reservation planning. Additionally, given the potential negative impacts and risks associated with applying machine learning, aspects related to monitoring and enhancing the interpretability of system detections are of key importance. As cloud environments involve different types of virtual machines, the application of forecasting is considered in the context of both HPC (High-Performance Computing) machines and general-purpose ones, highlighting the differences in approaches between them. In environments dominated by general-purpose or diverse-purpose machines, not just those for long-running scientific workflows, the key to resource usage optimization may lie in focusing on optimizing the time-series forecasting process itself, enabling scalable optimization along with the dynamic evolution of environments. Ultimately, ML-based optimization represents a tradeoff between minimizing costs, maximizing resource utilization, and maintaining high service availability.

Biography:

Mateusz Smendowski, MSc in Computer Science, is a PhD student at AGH University of Krakow. His main interests include the application of machine learning within the domain of cloud resource usage optimization and sustainable computing, with a particular focus on long-term time series forecasting and unsupervised anomaly detection.

Wojciech Zarzecki

Computational Medicine Group, MIMUW

Co-authors:

Paulina Szymczak, Roberto Olayo, Krzysztof Koras, Marcin Możejko, Małgorzata Łazęcka, Krzysztof Oksza Orzechowski, Ewa Szczurek

SRW Talk 7: BATTLE-AMP - Benchmarking Assessment Tests for The Leading Efficacy of Antimicrobial Peptides

Thursday / 7 November 10:30 - 10:50 Main Hall (Student Research Workshop)

slides

Abstract:

Biography:

Sai Preetham, Sata

Otto-von-Guericke-University Magdeburg

Co-authors:

Dmitry Puzyrev, Ralf Stannarius

SRW Talk 8: Statistical criteria for the prediction of dynamical clustering in granular gases

Thursday / 7 November 10:50 - 11:10 Main Hall (Student Research Workshop)

Abstract:

Granular matter, which consists of ensembles of interacting macroscopic particles, plays a prominent role in many natural and industrial processes, such as cosmic body formation, processing of coal and ore in mining, etc. It is ubiquitous in our environment (e.g. sand, gravel) and takes great part in our daily life applications (salt, sugar, coffee beans, etc). While exhibiting their own unique properties, most granular materials can be roughly attributed to liquid state (as in case of hourglass), gaseous state (dust cloud) and solid state (e.g. clogged aggregation of particles). Granular gases are relatively sparse ensembles of free-moving macroscopic particles which interact mainly via inelastic collisions. One fascinating property of granular gas is the dynamical clustering, i.e. spontaneous local increase of particle density which leads to decrease of particle mobility. In order to understand dynamical clustering, experiments are performed in microgravity conditions and matching numerical models are developed. Due to several disadvantages of direct numerical simulations such as higher computation time, machine learning based approaches provide a promising alternative to predict dynamical clustering. With the help of machine learning, a function that maps the input parameters of the system (number of particles, container size, etc.) to a variable that states whether the system is in the gaseous or cluster state can be built. With the help of this function, the state of the system for a given set of system parameters can be predicted without the need of extensive numerical simulations. In order to quantify the property of dynamical clustering in granular gases, several statistical criteria have been developed over recent years. Three such criteria include the Kolmogorov-Smirnov Test (KS-Test), the so-called caging-effect that is based on the critical local packing fraction and analysis of local density distributions. We performed multiple numerical simulations based on the VIP-GRAN experiment and the clustering criteria were evaluated for various combinations of system parameters. These criteria were compared in order to investigate their advantages and drawbacks. A dataset that contains system parameters and clustering criteria variables have been prepared, and several machine learning models were trained and validated using this dataset with the help of standard regression performance metrics. Based on the performance on these metrics, best models were identified for each of clustering criteria.

Biography:

Sai Preetham Sata is currently working as a research associate and pursuing PhD studies at Otto-von-Guericke University, Magdeburg in the field of machine learning and granular physics. Previously, he has worked as data scientist and software developer in several startups and research institutes and acquired immense knowledge and passion towards machine learning, computer vision, data science, deep learning, robotics and reinforcement learning. After completing his Master studies in Mechatronics from Technical University of Harburg-Hamburg with intelligent systems and robotics specialisation, he has developed expertise in computer vision, robotics, machine learning and deep learning by working on several projects in various applications. This keen interest and facination about machine learning and its applications in various domains has motivated him to pursue PhD in this field.

/ Tutorials

Przemysław Spurek

IDEAS NCBR / GMUM, Jagiellonian University

Weronika Smolak-Dyżewska

Jagiellonian University

Piotr Borycki

Jagiellonian University

Joanna Waczyńska

Jagiellonian University

Dawid Malarz

Jagiellonian University

Tutorial 1: Gaussian Splatting

Sunday / 10 November 9:00 - 13:00 5070, MIM UW

Description:

In this hands-on tutorial, we will introduce you to Gaussian Splatting, an advanced technology for generating detailed 3D scenes from 2D images. You will begin the session by creating your own dataset, where we will capture you or your chosen object. Each participant will work then on their own dataset. Throughout the tutorial, you will dive deep into the practical aspects of Gaussian Splatting. We’ll guide you step-by-step through the entire workflow, from preparing the dataset from videos to training the model and refining your final 3D objects. By the end of the session, you will not only have your own 3D scene but also a solid understanding of the technology behind it. You will gain insight into critical techniques for optimizing the splatting process, addressing common challenges, and achieving high-quality 3D results. Whether you are a beginner or have some experience with computer graphics, this tutorial will equip you with the skills to employ Gaussian Splatting technology in your own projects. Join us for an exciting journey into the world of cutting-edge 3D scene reconstruction!

Prerequisites: You must know 3D Gaussian distribution and how to train fully connected neural networks.

Biography:

Weronika is currently pursuing her PhD in Technical Computer Science at Jagiellonian University in Kraków. Her main area of interest is neural rendering models for 3D scene reconstruction, especially Gaussian Splatting. She employs it in different areas from physical simulations to medical data.

Piotr Borycki is currently pursuing a Master’s degree in Computer Mathematics at Jagiellonian University in Kraków. His research focuses on 3D object representation, particularly using Neural Radiance Fields (NeRF) and Gaussian Splatting.

Joanna is pursuing her PhD in Technical Computer Science at Jagiellonian University in Kraków, where she focuses on object representation in computer vision. Her research primarily revolves around models based on Gaussian Splatting, enabling fast rendering and modification of visual data. In recent years, she has had the privilege of collaborating with CERN and the University of Cambridge, while also participating in the first edition of AI Tech program at Wrocław University of Technology.

Dawid has three years of experience working as a Machine Learning Engineer and is now focused on research in 3D object reconstruction using Neural Radiance Fields (NeRF) and Gaussian Splatting. His work aims to enhance the efficiency and accuracy of 3D object representation techniques in modern computer vision applications.

Natasha Al-Khatib

Symbio

Tutorial 2: How LLMs are Revolutionizing the cybersecurity field

Sunday / 10 November 9:00 - 13:00 5060, MIM UW

Przemysław Uznański

Pathway

Tutorial 3: Beyond transformers - new sequence processing architectures

Sunday / 10 November 9:00 - 13:00 4070, MIM UW

Description:

The transformer neural architecture took by storm the AI community and is now used in many applications, from language models to image generation. With its widespread use we start to better understand transformer’s operating principles, limitations, and possible solutions to them. This tutorial aims to offer a clear picture of where transformer models are today and where they might be heading in the future or what might replace them. This tutorial will open with an overview of how transformers work, their strengths, their weaknesses, and recent theoretical findings about their capabilities, such as their ability to simulate different types of computations and their scalability with hardware. We will next cover important topics about how transformers handle context and attention and compare them with newly proposed alternatives, such as state-space models, to highlight the differences and trade-offs. We will discuss learning mechanisms both in the case of learning from training data during pre-training and in-context learning doing evaluation. We’ll look at techniques for handling long contexts and speculate on the relationship between in-context and from data learning. This will lead us to open questions about the future of AI models, such as understanding where knowledge is actually stored in sequence prediction models and is there a potential for models with an almost unlimited learning capacity.

Prerequisites: Familiarity with transformer architecture. Python, PyTorch, Colab.

Biography:

Michał Bartoszkiewicz designs the Pathway data processing framework. He is a competitive programmer with a long list of achievements including Topcoder finals, Google Code Jam and Facebook HackerCup. He co-founded nasza-klasa.pl, the first Polish social network.

Jan Chorowski is the CTO at Pathway building Live AI systems, underpinned by a proprietary real-time data processing engine and an AI framework. He received his M.Sc. degree in electrical engineering from Wrocław University of Technology and Ph.D. from University of Louisville. He has worked at the University of Wroclaw and has collaborated with several research teams, including Google Brain, Microsoft Research and Yoshua Bengio’s lab.

Adrian Kosowski specializes in network theory, discrete dynamical systems, graph navigability, and graph learning. He obtained his PhD in Computer Science at the age of 20, and has co-authored over 100 publications across Theoretical Computer Science, Physics, and Biology. Before co-founding Pathway, he was a tenured researcher at Inria and an associate professor at Ecole Polytechnique. He is also a co-founder of Spoj.com.

Adrian Łańcucki is a senior engineer at NVIDIA. His research focuses on representation learning and generative modeling for text and speech, as well as improving quality and efficiency at scale. In 2019, Adrian obtained a Ph.D. in machine learning from the University of Wroclaw, Poland. Since then, he has actively collaborated with academia.

Przemek Uznański is the streaming algorithms and data structure expert at Pathway, and a former competitive programmer (finalist of ACM ICPC, TopCoder Open and Facebook HackerCup). He did his PhD at the INRIA Bordeaux on the topic of distributed computing, then was a Post-doc at ETH Zurich, Aalto (Finland), and in Marseille. He was an assistant professor at University of Wrocław

Jakub Adamczyk

AGH University of Krakow / Placewise

Piotr Ludynia

AGH University of Krakow

Tutorial 4: Machine learning on molecules and molecular fingerprints

Sunday / 10 November 9:00 - 13:00 4060, MIM UW

Description:

Machine learning on molecules is a vital subject in chemoinformatics and de novo drug design. Tasks like molecular property prediction and virtual screening are crucial in modern pharmaceutical workflows. However, molecules are nontrivial to process, typically being represented as attributed graphs. As such, they are naturally non-Euclidean and have no notion of distance, requiring vectorization before performing classification, regression, or other ML tasks. Dedicated embedding methods are required, in order to encode relevant structural and functional information. Molecular fingerprints are the most popular group of algorithms in this regard, offering efficient solutions for many problems. One of the most recent developments in this area is scikit-fingerprints, a scikit-learn compatible library for easy and efficient computation of molecular fingerprints, which will be extensively used during the tutorial. This workshop will introduce participants to machine learning on molecules, molecular fingerprints, and how to apply them to practical problems. We will cover basics of chemoinformatics, reading and processing data, how molecular fingerprints work, and how to apply them to molecular property prediction or virtual screening. As a bonus, participants will learn why graph neural networks (GNNs) are not a silver bullet, and why molecular fingerprints are still very much relevant in the era of GNNs popularity.

Prerequisites: We assume reasonably good Python programming knowledge, as well as general familiarity with machine learning and popular data science libraries, e.g. NumPy, Pandas, matplotlib, scikit-learn. Any previous experience with chemoinformatics is not required. We will work with Jupyter Notebooks, and attendees can use either local development environment or Google Colab.

Biography:

Jakub Adamczyk is a PhD candidate in Computer Science at AGH University of Krakow. His research concerns graph representation learning, graph classification, chemoinformatics, and molecular property prediction. He also works at Placewise as Data Science Engineer, focusing on various ML problems in tabular learning, CV and NLP, and their end-to-end MLOps. Besides his professional work, he does Historical European Martial Arts (HEMA) and likes reading.

Piotr Ludynia is currently pursuing a Master’s degree at AGH University of Cracow - Poland, specializing in machine learning with a focus on graph and molecular learning. He also works on deep learning research and neural network acceleration at Intel. In his free time he writes, plays modern metal guitar and produces music.

Anastasia Psarou

Jagiellonian University

Ahmet Onur Akman

Jagiellonian University

Tutorial 5: Multi-Agent Reinforcement Learning Tutorial for Optimal Urban Route Choice Using TorchRL

Sunday / 10 November 14:30 - 16:30 5060, MIM UW

Description:

In this tutorial, we will demonstrate how to implement Multi-Agent Reinforcement Learning (MARL) scenarios in an urban setting using our custom PettingZoo framework, RouteRL. We will showcase a simplified traffic route choice environment integrating Simulation of Urban MObility (SUMO), an open-source traffic simulation, with the reinforcement learning library, TorchRL. This framework aims to replicate the daily decision-making process involved in route selection. It incorporates two types of agents: human drivers, which are modeled using human route choice behavioral models from transportation research, and Automated Vehicles (AVs), RL agents with individual or collective goals. We will present our environment and its functionality as well as experiments with different state-of-the-art RL algorithms aiming to assess how the agents learn and compare the learning of human agents and AVs. The tutorial is expected to take approximately 2 hours.

Prerequisites: Participants should have a basic understanding of reinforcement learning concepts before attending this tutorial. Additionally, they should prepare their development environment by installing and setting up the necessary tools. This includes creating and activating a Conda environment with Python 3.12. Within this environment, they should use pip to install the required libraries: Gym, PettingZoo, and torchrl. Additionally, the participants should download and install SUMO software from https://eclipse.dev/sumo/.

Biography:

Anastasia Psarou is a PhD student in the Faculty of Mathematics and Computer Science at Jagiellonian University. Currently, she is working as part of the COeXISTENCE team towards discovering what happens in the future cities when intelligent machines (autonomous vehicles) and humans share limited resources of urban mobility.

Ahmet Onur Akman is a Computer Engineer with a specialization in Artificial Intelligence. Currently, he is a PhD student in the Faculty of Mathematics and Computer Science at Jagiellonian University interested in foreseeing what happens when our cities are shared with autonomous, intelligent robots, competing with human drivers for limited resources.

Marek Adamczyk

University of Wrocław

Tutorial 6: Practical Submodular Optimization

Sunday / 10 November 14:30 - 18:30 4060, MIM UW

Description:

In today’s rapidly evolving landscape, Generative AI (GenAI) and large language models (LLMs) dominate the spotlight, sparking excitement and revolutionizing industries across the board. While these advancements capture imaginations, it’s easy to overlook the continued relevance of classical algorithmic tools that remain fundamental to solving real-world problems. Among these, submodular optimization stands as a cornerstone of algorithmic heuristics, providing a powerful and elegant abstraction for a wide range of practical scenarios. This tutorial highlights how submodular optimization offers efficient solutions to problems where greedy algorithms excel — thanks to its unique structure that allows ideas from convex continuous optimization to be applied in discrete scenarios. From sensor placement to data summarization through feature selection, submodular optimization elegantly balances simplicity and performance. Moreover, there are situations where sophisticated deep learning models are just impossible to apply due to the very nature of the problem itself. For example, when working with streaming data—where the full dataset isn’t available in advance and decisions must be made on the fly—submodular optimization becomes indispensable. Its adaptability to such constraints showcases why it remains a critical tool. The tutorial will be divided into three parts. It will begin with an introduction to the fundamental mathematics behind submodular functions, providing a clear and accessible overview. The second part will focus on practical results, drawing from selected ICML, NeurIPS, and AAAI papers to demonstrate how submodular optimization is applied across various domains. The final part will examine a detailed case study, illustrating how submodularity can be used to model dynamic pricing in ride-hailing platforms, connecting theory with a real-world application.

Biography:

Marek Adamczyk works at the University of Wrocław’s Institute of Informatics. He is a theoretical computer scientist focused on algorithms that predict the future. His research addresses combinatorial problems, drawing from fields such as probability theory, stochastic optimization, online optimization, mechanism design, and algorithmic game theory. While his work has a strong foundational and theoretical focus, all of the problems he explores are motivated by natural business and practical applications. Modern internet environments require solutions at the intersection of machine learning, data science, and advanced algorithmics due to the vast amounts of data from which predictions must be inferred and the need for rapid decision-making in large data streams.

/ Agenda

Day 1: Thursday / 7 November

Copernicus Science Centre

Wybrzeże Kościuszkowskie 20, 00-390 Warsaw

Student Research Workshop (open event, no ticket required)

Opening remarks

Real-Time Omnidirectional 3D Multi-Person Human Pose Estimation with Occlusion Handling

by Pawel Knap (University of Southampton / University of Freiburg)

Accumulated Local Effects and Graph Neural Networks

by Paulina Kaczyńska (University of Warsaw / IPPT PAN)

Interrogating Time Series Foundation Models

by Michal Wilinski (Carnegie Mellon University / Poznan University of Technology)

Harnessing YouTube for Evaluating General-Purpose Speech Recognition Machine Learning Models

by Tomasz Wojnar (Jagiellonian University)

Cherish Every MOMENT

by Nina Zukowska (FU Berlin / CMU)

Towards Sustainable Cloud Environments by Leveraging Time Series Forecasting for Enhanced Resource Utilization

by Mateusz Smendowski (AGH University of Kraków)

BATTLE-AMP - Benchmarking Assessment Tests for The Leading Efficacy of Antimicrobial Peptides

by Wojciech Zarzecki (Computational Medicine Group, MIMUW)

Statistical Criteria for the Prediction of Dynamical Clustering in Granular Gases

by Sai Preetham, Sata (Otto-von-Guericke-University Magdeburg)

Pizza

Registration

(Also open after 12:00)

Opening remarks

Opening remarks

Invited Talk 1: Towards Real-World Fact-Checking with Large Language Models

by Iryna Gurevych (Technical University of Darmstadt)

Invited Talk 2: Evolving programs with LLMs

by Bernardino Romera Paredes (Google DeepMind)

Lunch

Discussion Panel 1: AI in Law

organized in cooperation with OCTO Legal

Invited Talk 3: Explainable AI for LLMs

by Wojciech Samek (Technical University of Berlin)

Coffee

ElevenLabs AI Audio Challenge Final

Talk: AI Audio: New Research and Applications Frontier for Generative AI Models

by Georgy Marchuk (ElevenLabs)

Presentations of Finalists

in Front of Expert Jury

Estimator Quiz

with Prizes for the Audience

Winners Announcement

Conference Party

Bolek Pub & Restaurant, al. Niepodległości 211, 02-086 Warszawa

AI Art Exhibition

Day 2: Friday / 8 November

Copernicus Science Centre

Wybrzeże Kościuszkowskie 20, 00-390 Warsaw

Registration

(Also open after 09:30)

Invited talk 4: Perceiving, Understanding, and Interacting through Touch

by Roberto Calandra (Technical University of Dresden)

Invited talk 5: molecule.one: Candid Stories and Hard Lessons from our Journey Building an AI for Science Startup

by Stanisław Jastrzębski (Molecule.one)

Talks by Witold Lipski Award Laureates

Adaptiveness in deep learning models

Good practices for applied computer science: field-tested in ML-based medical and space projects

On (dynamic) shortest paths data structures

Contributed Talks Session 1:

Exceeding Historical Exposure in Session-Based Recommender Systems

by Klaudia Balcer (University of Wrocław)

Leveraging Multi-Armed Bandit Algorithms for Dynamic Decision Making

by Tudor Coman (Adobe)

From Theory to Practice: A Practitioner's Journey with Knowledge Graphs

by Patryk Wielopolski (DataWalk)

Contributed Talks Session 2:

Deep Learning for Effective Analysis of High Content Screening

by Adriana Borowa (Ardigen)

Aggregated Attributions for Explanatory Analysis of 3D Segmentation Models

by Maciej Chrabaszcz (NASK / Warsaw University of Technology)

Towards Medical Foundation Model - A Unified Dataset for Pretraining Medical Imaging Models

by Barbara Klaudel (TheLion.AI)

Coffee

Discussion Panel 2: Career Paths in ML

Sponsor Talk 1: AlleNoise - large-scale text classification benchmark dataset with real-world label noise

by Alicja Rączkowska (Allegro)

Sponsor Talk 2: Careers in Quant Finance