Data Management Header Image

 

Data Management
データ管理

Harness Data to Accelerate Discoveries and Advancements in Life Sciences
データを活用し、ライフサイエンスにおける発見と進歩を加速

2025年4月2日〜4日 米国東部標準時(EDT)

ライフサイエンス研究者の間で計算能力に対する需要が高まるにつれ、膨大で多様なデータセットの管理がますます困難になっています。何十億ものデータポイントやファイルを効率的に処理するには、スケーラブルなデータストレージインフラが不可欠です。データの統合、アクセシビリティ、共有、リンク、分析、継続的なメンテナンスなど、データの効果的な管理が重要です。このトラックでは、データメッシュやデータファブリックなどの新興テクノロジーが、これらの課題にどのように対処しているかについて掘り下げ、複雑なデータセットから実用的なインサイトを引き出す戦略を探ります。主なトピックには、FAIR原則、データ再利用、ガバナンス、リテラシー、データフェデレーション、キュレーション、ハーモナイゼーションなどがあります。さらに、堅牢なデータ基盤がAIのデータ準備をどのようにサポートし、人工知能と機械学習モデルの効果的な展開を保証するかについても検討します。これらの革新的なアプローチが、データ管理を強化し、ライフサイエンスの重要な進歩を促進する方法をご覧ください。

4月2日(水)

Registration Open8:00 am

Recommended Pre-Conference Workshops and Symposia*9:00 am

On Wednesday, April 2, 2025, Cambridge Healthtech Institute is pleased to offer five pre-conference Workshops scheduled across two time slots (9:00 am–12:00 pm and 1:15–4:15 pm) and three Symposia from 9:00 am–4:20 pm. All are designed to be instructional, interactive, and provide in-depth information on a specific topic. They allow for one-on-one interaction and provide a great way to explain more technical aspects that would otherwise not be covered during the main conference tracks that take place Thursday–Friday.

*Separate registration required. See details on the Symposia here and details on the Workshops here.

4:40 pm

Organizer's Remarks

Cindy Crowninshield, Executive Event Director, Cambridge Healthtech Institute

4:45 pm Talk Title to be Announced

Speaker to be Announced, CLOVERTEX

4:55 pm PLENARY KEYNOTE PRESENTATION:

From Bytes to Breakthroughs: Generative AI Driving the Future of Life Sciences and Healthcare

Sofia Guerra, Vice President, Bessemer Venture Partners

Subha Madhaven, Vice President and Head, AI/ML, Quantitative and Digital Sciences, Global Metrics and Data Management, Pfizer Inc.

Generative AI has the potential to transform life sciences and deliver unprecedented insights, automation, and efficiency. But is it? This keynote panel brings together leaders from biopharma, healthcare, and emerging tech who are leveraging AI to advance drug discovery, diagnostics, and patient care. Panelists will share their own case studies and real-world applications and discuss how they’ve tackled challenges—both technical and cultural. Look beyond the hype curve to see how this technology is really being used now and where the next opportunities lie.

Welcome Reception in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)6:10 pm

The Bio-IT Kickoff Reception is a reunion—reconnect with friends, explore cutting-edge research, and celebrate innovation! Enjoy poster presentations, networking, and vote for the Best of Show and Poster awards.

Close of Day7:25 pm

4月3日(木)

Registration Open7:00 am

8:00 am

Organizer's Remarks

Allison Proffitt, Editorial Director, Bio-IT World and Clinical Research News

8:05 am Talk Title to be Announced

Speaker to be Announced, Snowflake Computing Inc

AI-POWERED PLATFORMS IN DRUG DISCOVERY: TACKLING ANTIBIOTIC RESISTANCE AND AGING THERAPEUTICS
創薬におけるAI活用のプラットフォーム:抗生剤耐性と老化治療への取り組み

8:15 am PLENARY KEYNOTE PRESENTATION:

Deep Learning for Antibiotic Discovery

James J. Collins, PhD, Termeer Professor, Medical Engineering & Science, Massachusetts Institute of Technology

In this presentation, we highlight the Antibiotics-AI Project, which is a multi-disciplinary, innovative research program that is leveraging MIT's strengths in artificial intelligence, bioengineering, and the life sciences to discover and design novel classes of antibiotics. The Antibiotics-AI Project is focused on developing, integrating, and implementing deep learning models and chemogenomic screening approaches: (1) to predict novel antibiotics from expansive chemical libraries with diverse properties, (2) to design de novo novel antibiotics based on learned structural and functional properties of existing and newly discovered antibiotics, and (3) to identify, using explainable deep learning models, the chemical structures and molecular mechanisms underlying the newly discovered and/or designed antibiotics. With these deep learning approaches, we are utilizing multi-scale computation to embrace and harness the complexity of biology and chemistry, so as to discover, design, and develop new classes of antibiotics, up through preclinical studies. Our platform has been designed so that it can be utilized and applied in a rapid fashion to emerging and re-emerging bacterial pathogens, including multidrug-resistant (MDR) bacteria and extensively drug-resistant (XDR) bacteria.

8:45 am PLENARY KEYNOTE PRESENTATION:

Generative AI, Aging Research and Robotics as a Platform for Drug Discovery: From Hype to Clinical Efficacy

Alex Zhavoronkov, PhD, Founder & CEO, Insilico Medicine

9:15 amSession Q&A

Coffee Break in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)9:30 am

Start your morning with coffee, connections, and cutting-edge research! Enjoy poster presentations, network in the Exhibit Hall, vote for awards, and a chance at a fabulous raffle prize!

Organizer's Welcome Remarks10:15 am

TRANSFORMING DATA MANAGEMENT IN LIFE SCIENCES: STRATEGIES FOR INNOVATION AND COMPLIANCE
ライフサイエンスにおけるデータ管理の変革:イノベーションとコンプライアンス戦略

Chairperson's Remarks (Sponsorship Opportunity Available)10:20 am

10:25 am

From Documents to Data: Collaborative Strategies for Data-Centric Submissions

Sophie Bailes, PhD, R&D IT Principal Business Partner, AstraZeneca

As the pharmaceutical industry embraces data-driven CMC submissions, AstraZeneca pioneers collaboration between science and IT. Recognizing the pivotal role of data, the industry lacks standardized practices for generating FAIR at source data. Regulatory authorities are increasingly adopting in silico modeling and machine learning for evaluations, paving the way for faster drug approvals and improved patient access. This talk provides an illustrative example of AstraZeneca’s integrated FAIR Data Hub, which automates data practices for advanced analytics and regulatory compliance. This partnership between science and technology addresses industry challenges of data complexity while expediting processes to enhance patient access to new medicines.

10:45 am

Building an Approachable Cost-Effective Data Management Platform

Kory Draughn, Chief Technologist, iRODS Consortium, RENCI Renaissance Computing Institute

Long-term data management is best executed when policies are clear and infrastructure is abstracted and swappable. iRODS has a desire to be normal and boring for the administrator and approachable and powerful for the user. This talk will cover recent advances and interfaces which allow companies to sustain FAIR data practices, enforce consistency and reproducibility, and realize cost-savings through open-source software.

11:05 am

Implementation of NextGen ELN and ELN Data Products

Amrik Mahal, PhD, Global Head, IT Research, AstraZeneca

Discover the immediate benefits of implementing a NextGen ELN, including unlocking valuable legacy ELN data and enhancing research capabilities. This talk will explore AstraZeneca's large-scale deployment to over 3,500 scientists, sharing best practices and lessons learned. Attendees will gain insights into how NextGen ELN integration accelerates decision-making by connecting research communities and creating data products that drive scientific innovation and discovery.

11:25 am

Enabling R&D and Its AI Ambitions through Data Products

Kiran Kodali, MBA, Head of R&D Data Strategy & Governance & Data Foundations, Sanofi

As Sanofi strives to become the first biopharma company powered by AI at scale, data is essential to realizing this vision. The data capabilities being developed are critical for implementing AI and GenAI solutions, providing actionable insights to decision-makers across various organizational levels, and driving transformative advancements in life sciences. Join me to explore best practices, strategies, and real-world use cases that navigate data management complexities, foster collaboration, and unlock value from data assets.

11:45 amSession Q&A with Speakers
11:55 am

Harnessing AI to Identify Causal Relationships and Enhance Research and Scientific Validation in Pharma

Peter Doerr, Director, Presales, metaphacts

This talk discusses how AI methods can help find gaps between curated knowledge in knowledge graphs and unstructured knowledge in scientific texts. We provide examples of how databases like OpenTargets can be enriched by using AI to identify causal relationships in scientific documents. With Knowledge Graph technology, these relationships are used to augment existing databases, allowing users to compare, spot gaps, and, crucially, find the relevant literature to ensure scientific validation.

12:10 pm

Improving FAIRness of Omics Data through Metadata Harmonization

Sehyun Oh, Assistant Professor, Research Foundation, City University of New York

National efforts have established comprehensive biological data repositories, but cross-study analysis is limited by heterogeneous metadata. This lack of harmonization impedes the findability and AI/ML application of high-throughput omics data. The OmicsMLRepo project harmonizes metadata through schema consolidation and ontology incorporation, improving the FAIRness and AI/ML-readiness of metagenomics and cancer genomics datasets through R/Bioconductor packages for researchers.

12:25 pm Talk Title to be Announced

Speaker to be Announced, Glencoe Software Inc

Presentation to be Announced (Sponsorship Opportunity Available)12:40 pm

Session Break and Transition to Lunch12:55 pm

1:05 pm Talk Title to be Announced

Speaker to be Announced, DNAnexus, Inc.

Refreshment Break in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)1:35 pm

Bio-IT's hall is bigger than ever—one break won’t cut it! Enjoy dessert and coffee after lunch, explore booths and posters, vote for awards, and participate in our raffle for a chance to win a prize!

STREAMLINING MULTI-MODAL DATA INTEGRATION IN RESEARCH: INNOVATIONS IN LIMS AND DISCOVERY DATA MANAGEMENT
研究におけるマルチモーダルなデータ統合の合理化:LIMSと発見データ管理におけるイノベーション

Chairperson's Remarks (Sponsorship Opportunity Available)2:25 pm

2:30 pm Reimagining Data Commons, Lakes, and Warehouses in Life Sciences

Speaker to be Announced, BioTeam, Inc.

Scientific organizations have long needed a space to house different types of research data and manage interoperability and accessibility across an organization. With the rise of Al/ML, it is more imperative than ever that data be organized and annotated in a way that allows it to be FAIR (Findable, Accessible, Interoperable, and Reusable). Join us as we discuss key principles around designing a robust, scalable system to meet these growing needs and a new vision for scientific data platforms.

3:00 pm

OligoLake: A Fast and Efficient Solution for Integrating Multi-Modal Research Data

Alexander Wyss, Data Engineer, Roche

FAIRifying discovery data is a highly complex and very time-consuming endeavor posing significant challenges across pharma. We have developed the OligoLake to integrate multimodal data from oligonucleotide discovery projects utilizing various existing pRED systems for compound and project registration, in vitro assays, in vivo studies, in silico predictions, and high-dimensional omics. We employed a minimalistic approach combining efficient cloud storage (AWS S3), transparent data modeling and documentation (DBT), easy-to-maintain orchestration (GitLab CI), and a light-weight database solution (DuckDB) offering both Python and R packages as well as a GUI for easy and user-friendly interaction with the harmonized data. As a result, within several weeks we were able to successfully develop an MVP which proved reusable and scalable for other data products at pRED.

3:30 pm

Configuration-Driven LIMS Management: A Code-First Approach for Adapting to Dynamic Lab Workflows

Nirmit Damania, Senior Software Engineer, Dyno Therapeutics

Staying ahead of a constantly changing landscape of scientific process and experimentation is a challenge for any ELN/LIMS engineer. This is compounded by the increasing demand for the integration of LIMS data with downstream data systems. Learn how to stay ahead of change by applying best practices from Data Engineering and software CI/CD to your LIMS system. This talk walks you through the practical steps to achieve a single source of truth for schema configurations synchronized across multiple LIMS tenants. We will explore the decisions we made to develop a clear change management process and tooling landscape that ensures timely and accurate access to laboratory data integrated across the company’s data platform.

4:00 pm Talk Title to be Announced

Speaker to be Announced, Quilt Data Inc

Best of Show Awards Reception in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)4:30 pm

Unwind with colleagues at our lively reception! Explore posters, vote for the best, network with exhibitors, enjoy a drink, and try to win a raffle prize. Celebrate Best of Show winners!

Close of Day5:45 pm

4月4日(金)

Registration Open7:00 am

Quick Bytes & Networking Breakfast—Lifted Rooftop Restaurant & Bar (Sponsorship Opportunity Available)7:00 am

Start your morning with ‘Quick Bytes & Networking’! Enjoy a cozy restaurant-style setting, quick bites, and speed networking. Connect, converse, and energize your Bio-IT experience before the plenary keynote!

8:00 am

Organizer's Remarks

Cindy Crowninshield, Executive Event Director, Cambridge Healthtech Institute

8:05 am

Innovative Practices Awards: Excellence in Technological Innovation

Allison Proffitt, Editorial Director, Bio-IT World and Clinical Research News

The Innovative Practices Awards recognizes and celebrates technology innovation in the life sciences. Bio-IT World is currently accepting entries for the 2025 Innovative Practices Awards, a competition designed to recognize partnerships and projects pushing our industry forward. Winners will be announced in mid-March 2025, recognized during the Friday, April 4 Plenary Keynote Program, and scheduled to give a podium presentation about their project during the conference. For more details about the Awards and to submit an application, visit www.bioitworldexpo.com/innovativepractices.

8:20 am Talk Title to be Announced

Speaker to be Announced, Illumina

ADVANCING DRUG DISCOVERY AND HEALTHCARE THROUGH DATA-DRIVEN INNOVATION: FROM GENOMICS TO THERAPEUTICS
データドリブンのイノベーションによる創薬とヘルスケアの進歩:ゲノミクスから治療まで

8:30 am PLENARY KEYNOTE PRESENTATION:

Scaling Genomic Medicine: Transforming Newborn Screening through Informatics and Innovation

Robert C. Green, MD, MPH, Professor and Director of Genomes2People Research, Mass General Brigham, Broad Institute, Ariadne Labs and Harvard Medical School

The BabySeq Project has pioneered the integration of genomic sequencing into newborn and childhood screening, uncovering unexpected risk variants and transforming healthcare delivery. This keynote explores the groundbreaking progress in genomic medicine, featuring real-world stories of families impacted by these discoveries. Learn about the informatics challenges and innovative solutions required to scale genomic screening for national and global implementation, reshaping the future of precision medicine.

9:00 am PLENARY KEYNOTE PRESENTATION:

Unlocking the Power of Machine Learning and Data-at-Scale to Deliver with Speed the Best Therapeutic Candidates

Justin M. Scheer, PhD, Vice President In Silico Discovery & Head, Molecular Computational Team, Johnson & Johnson Innovative Medicine

The challenges of high costs, lengthy timelines, and significant attrition have prompted our industry to integrate AI/ML into all aspects of the business. This presentation highlights J&J's strategic investments in AI/ML technologies to enhance the drug discovery processes, including molecule design and optimization. By investing in these technologies with a modality agnostic approach, J&J aims to tackle the hardest targets in drug discovery, ultimately increasing the success rate of delivering better molecules faster.

9:30 amSession Q&A

Coffee Break in the Exhibit Hall with Poster Competition Winners Announced (Sponsorship Opportunity Available)9:45 am

Bio-IT is all about connections! Explore booths, award-winning posters, and network with clients, colleagues, and exhibitors. Grab coffee, build relationships, and stay for a chance to win a raffle prize!

Organizer's Remarks10:30 am

BEST PRACTICES IN TECHNOLOGY INNOVATION
テクノロジーイノベーションにおけるベストプラクティス

10:35 am

Chairperson's Remarks

Cindy Crowninshield, Executive Event Director, Cambridge Healthtech Institute

10:40 am

Innovative Practices Awards: Excellence in Technological Innovation

Cindy Crowninshield, Executive Event Director, Cambridge Healthtech Institute

Since 2003, Bio-IT World has hosted an elite awards program with the goal of highlighting outstanding examples of how technology innovations and strategic initiatives are being applied to advance life sciences research. Winners of the 2025 Bio-IT World Innovative Practices Awards, recognized during the morning plenary keynote session, will give podium presentations during this session. For more details about the Awards and to submit an application, visit www.bioitworldexpo.com/innovativepractices.

Presentation to be Announced (Sponsorship Opportunity Available)12:10 pm

Session Break and Transition to Lunch1:10 pm

1:20 pm Talk Title to be Announced

Speaker to be Announced, ZONTAL Inc

Refreshment Break in the Exhibit Hall with Last Chance for Poster Viewing (Sponsorship Opportunity Available)1:50 pm

Feeling tired? Recharge during the final Networking Exhibit Hall break! Visit booths, explore posters, connect with peers, and turn in your Game Cards for a chance to win a raffle prize.

Chairperson's Remarks (Sponsorship Opportunity Available)2:30 pm

TRENDS FROM THE TRENCHES: BRIDGING TRADITIONAL INSIGHTS WITH INNOVATIVE ADVANCEMENTS
現場のトレンド:従来の見識と革新的な進歩の橋渡し

2:35 pm

Transforming Big Data into Actionable Insights: Leveraging the Sequence Read Archive (SRA) for Life Sciences and Public Health

J. Rodney Brister, PhD, Acting Program Head, Sequence Read Archive, NCBI, NLM, NIH

As the world's largest publicly available repository of raw sequence data, the Sequence Read Archive (SRA) plays a pivotal role in advancing public health and life sciences research. This presentation highlights state-of-the-art tools and strategies for managing and analyzing the SRA’s massive datasets, showcasing its impact on infectious disease surveillance, genomic epidemiology, and precision medicine. Discover how innovative informatics solutions are transforming raw data into actionable insights for global health challenges.

3:05 pm

Refactoring Research Systems with AI: Modernizing Code for Life Sciences Innovation

Cindy Crowninshield, Executive Event Director, Cambridge Healthtech Institute; Additional Speakers to be Confirmed

With AI taking center stage in coding, research teams are leveraging AI-enabled IDEs to refactor legacy systems into modern, efficient languages. This talk presents a balanced view of generative AI-supported coding, cutting through the hype to focus on real-world applications. Discover practical strategies and examples of how AI is transforming outdated research systems into scalable, maintainable, and future-ready platforms for life sciences.

3:35 pm

Trends from the Trenches

Ari E. Berman, PhD, CEO, BioTeam, Inc.

Since 2010, “Trends from the Trenches” has been a cornerstone of the Bio-IT program, delivering candid and occasionally blunt assessments of the most impactful and overhyped IT technologies in life sciences. This talk will provide a deep dive into computing, storage, cloud, data science, machine learning, and more, with a focus on supporting data-intensive science. Looking ahead, this talk will share forward-thinking predictions about emerging technologies and trends poised to shape the future of life sciences innovation, offering actionable insights for navigating the next wave of IT evolution.

Close of Conference4:05 pm

* 不測の事態により、事前の予告なしにプログラムが変更される場合があります。

Choose your language
Traditional Chinese
Simplified Chinese
Korean
English


会議の詳細はこちらをご参照ください