ouellette elixir 2017

51
Reflections on Life Science Data Infrastructures in Canada Rome, March 20, 2017 B.F. Francis Ouellette CSO / VP Scientific Affairs, Génome Québec Montréal, QC, Canada [email protected]

Upload: genome-quebec

Post on 13-Apr-2017

240 views

Category:

Science


0 download

TRANSCRIPT

Page 1: Ouellette elixir 2017

Reflections on Life Science Data

Infrastructures in Canada

Rome, March 20, 2017 B.F. Francis OuelletteCSO / VP Scientific Affairs, Génome Québec Montréal, QC, [email protected]

Page 2: Ouellette elixir 2017

Disclamers• I am an employee of Génome Québec, which is part of the Genome Canada family.• I do not (and will not) profit in any way, shape or form, from any of the brands, products or companies I may mention.• I am a big proponent of Open Access, Open Source, Opent Data and Open Courseware • I am on the SAB of many NIH funded projects (SGD, Galaxy, GenomeSpace, H3ABionet, and HMP2), in addition to Elixir.

Page 3: Ouellette elixir 2017

in Memory of Anna Tramontano

Page 4: Ouellette elixir 2017

https://goo.gl/UMBW8f

Page 5: Ouellette elixir 2017

Outline of my Reflections

• The Parameters: “Made in Canada”• Building a National Bioinformatics Strategy

for Canada• The Cancer Genome Collaboratory• Lessons Learned

Page 6: Ouellette elixir 2017

Funding Landscape in Canada• 4.5 time-zones, 35 million people, bilingual country, 10

provinces, 3 territories https://en.wikipedia.org/wiki/Canada

• Tri-Council:• CIHR http://www.cihr-irsc.gc.ca/

• NSERC http://www.nserc-crsng.gc.ca/

• SSHERC http://www.sshrc-crsh.gc.ca/

• Genome Canada (GBC-GAlta-GP-OG-GQ-GAtl) https://www.genomecanada.ca/

• CFI / MSI https://www.innovation.ca/awards/major-science-initiatives-fund

• Compute Canada https://www.computecanada.ca/

• CANARIE https://www.canarie.ca/language/

• Network Centres of Excellence http://www.nce-rce.gc.ca/

• Many provincial funding bodies, with budgets more or less proportional to their population size.

Page 7: Ouellette elixir 2017
Page 8: Ouellette elixir 2017
Page 9: Ouellette elixir 2017

… and there is also lots of

DATA to integrate

• ICGC is in the 10-15 PB scale

• Healthcare is as well • Biology is more

complex than particle physics

Page 10: Ouellette elixir 2017

Why a strategy?

• Genome Canada• CIHR• NSERC• SSHRC

Page 11: Ouellette elixir 2017

The stakeholders need it

• Life scientist and bioinformatics research communities

• Public and private funding bodies• Infrastructure providers

• HPC • Ultra high-speed digital networks

• Alliances working to coordinate research data• the Canadian population

Page 12: Ouellette elixir 2017

Why a strategy?

• Genome Canada• CIHR• NSERC• SSHRC

• CFI• Compute Canada• CANARIE• Universities• Private sector

Page 13: Ouellette elixir 2017

http://www.elixir-europe.org/

Page 14: Ouellette elixir 2017

18 years in the making

1999 1st Canadian Bioinformatics Workshop delivered

2000 Genome Canada is created

2001 GC & CIHR host 1st Bioinformatics strategic workshop

2003 completion of 1 human genome project

2011 GC & CIHR host 2nd Bioinformatics strategic workshop

2014 GC & CIHR puts together a strategic plan working group

2016 Strategic plan working group delivers final document

2017 Plan is integrated in some of the partner’s plans, and made public

Page 15: Ouellette elixir 2017

small expert steering committee

SESC + community

workshop

SESC + stakeholder

workshop

Largercommunity

consult

SESC + community

workshop

stakeholder & community

consult

SESC + community

workshop

Finisheddocument

2015

SESC + community

workshop

SESC + writer >Strategic

Plan

Page 16: Ouellette elixir 2017

Strategic Objective 1: Networking and Coordination

Page 17: Ouellette elixir 2017

Networking and Coordination

Step 1: Organization a Canadian B/CB meeting.

Step 2: Creation of a Canadian B/CB Society

Step 3: Position the B/CB community for future funding opportunities

Page 18: Ouellette elixir 2017
Page 19: Ouellette elixir 2017

http://www.nce-rce.gc.ca/

Page 20: Ouellette elixir 2017

Strategic Objective 2: Strengthening and Sustaining the B/CB Research Enterprise

Page 21: Ouellette elixir 2017

Strengthening and Sustaining

the B/CB Research EnterpriseStep 1: Organization of a workshop to bring together the

B/CB researchers funded in the ongoing B/CB-focused initiatives

Step 2: Development of a five-year coordinated plan among funding agencies and infrastructure providers

Step 3: Development of activities/opportunities to support the integration of B/CB professionals and hardware providers into large-scale life sciences projects generating big data and requiring significant data storage and analysis.

Page 22: Ouellette elixir 2017

Strategic Objective 3: Building Capacity

Page 23: Ouellette elixir 2017

Building Capacity: Connect, Coordinate and

TrainStep 1: The launch of innovative new graduate and postdoctoral training programs

Step 2: Creation of new training opportunities and salary awards embedded in ongoing large-scale projects

Step 3: Development and promotion of new opportunities for undergraduates

Step 4 Support bioinformatics.ca series.

Page 24: Ouellette elixir 2017
Page 25: Ouellette elixir 2017
Page 26: Ouellette elixir 2017

Bioinformatics.ca workshops Content

http://bioinformatics-ca.github.io/

https://goo.gl/CGu13q

https://goo.gl/CGu13q

Page 27: Ouellette elixir 2017

Cancer Genome Collaboratory

• Making a sustainable infrastructure for cancer genome research.

• A place to compute and collaborate on human cancer genome data in a secure way.

Page 28: Ouellette elixir 2017

https://www.cancercollaboratory.org/

Page 29: Ouellette elixir 2017

Cancer Genome Collaboratory

https://goo.gl/nLlVKf

Page 30: Ouellette elixir 2017

Cancer Genome Collaboratory

https://www.cancercollaboratory.org/

Page 31: Ouellette elixir 2017

Cancer Genome Collaboratory

https://www.cancercollaboratory.org/

Page 32: Ouellette elixir 2017

Cancer Genome Collaboratory

https://www.cancercollaboratory.org/

Page 33: Ouellette elixir 2017

Cancer Genome Collaboratory

https://www.cancercollaboratory.org/about-collaboratory

Page 34: Ouellette elixir 2017

( ICGC …

https://icgc.org/ http://icgc.org/icgc/media

Page 35: Ouellette elixir 2017

( ICGC …

• ICGC to collect:• DNA, RNA, methylomes

and clinical data from 50 different tumour types.

• 500 tumour/normals per tumour type

• 25,000 (50,000) genomes

• 1:10 (whole genome: exam)

• SNV, CNV, SV, germline

Page 36: Ouellette elixir 2017
Page 37: Ouellette elixir 2017

…ICGC)

https://dcc.icgc.org/

Page 38: Ouellette elixir 2017
Page 39: Ouellette elixir 2017

Deliverable for PCAWG will include:

• 1st PANCANCER analysis on > 2,800 cancer tumours from a WGS perspective

• RNA, SSM, CNV, Methylation analysis & germline• Published (executable) pipelines• Docker / Dockstore• Mutiple cloud access to data• Multiple portal access to data• Many paper (being written & submitted now!)

Page 40: Ouellette elixir 2017

PCAWG

Page 41: Ouellette elixir 2017

Cancer Genome Collaboratory

Page 42: Ouellette elixir 2017

Vincent Ferretti https://goo.gl/cSW7bD

Page 43: Ouellette elixir 2017

Vincent Ferretti https://goo.gl/cSW7bD

Page 44: Ouellette elixir 2017
Page 45: Ouellette elixir 2017

http://dockstore.org

Page 46: Ouellette elixir 2017

PCAWG pipelines on Dockstore

Page 47: Ouellette elixir 2017

Training available:https://goo.gl/tGmSYW

Page 48: Ouellette elixir 2017

Lessons Learned (1/2)

• Be patient• You need to publish your “stuff”

(“to make publicly or generally known” https://goo.gl/SgSV6R).

• Publish your tools, SOPs, workflows, pipelines.• Virtualization of services, tools and resources• Shared APIs• Good infrastructure is critical, but good data

even more so.

Page 49: Ouellette elixir 2017

Lessons Learned (2/2)• Important to establish great tools and databases,

but even more important to maintain them long term.

• Lack of funding in Canada for maintenance of a resource (database) and the maintenance of a tool (service).

• Training is critical, and you cannot have enough of it. We all need to do it (every country, every language).

• Long term support• Do all this, and then tweet about it!

Page 50: Ouellette elixir 2017

Lessons Learned (2/2)• Important to establish great tools and databases,

but even more important to maintain them long term.

• Lack of funding in Canada for maintenance of a resource (database) and the maintenance of a tool (service).

• Training is critical, and you cannot have enough of it. We all need to do it (every country, every language).

• Long term support• Do all this, and then tweet about it!

Page 51: Ouellette elixir 2017

Acknowledgements

B/CB Advisory CommitteeGary Bader, U of TorontoRobert Beiko, Dalhousie U.Guillaume Bourque, McGill U.Fiona Brinkman, SFUMichael Brudno, U of TorontoLiz Conibear, UBCBill Crosby, U WindsorMark Dietrich, Compute CanadaFrancis Ouellette, OICRPeter Wilenius, CANARIE

Cancer Genome CollaboratoryLincoln Stein, U of TorontoGuillaume Bourque, McGill U.Paul Boutros, U of TorontoKhaled el Emam, U of OttawaVincent Ferretti, OICRBartha Knoppers, McGill U.Francis Ouellette, U of TorontoCenk Sahinalp, SFUSohrab Shah, UBChttps://goo.gl/3wsGui