Career paths and projections in Biocuration: Panel discussion from the Biocuration2021 virtual conference

By Nicole Vasilevsky and Sabrina Toro

The ISB hosted the second session for the Virtual Biocuration Conference on June 15, 2021. The session, chaired by Peter Uetz, Ph.D. from the Virginia Commonwealth University, focused on career paths and projections in Biocuration and hosted three panelists: Pankaj Jaiswal, Ph.D, Professor in Plant Genomics at Oregon State University (OSU) in Corvallis, Oregon; Tanya Berardini, Ph.D co-founder and Chief Scientific Officer at Phoenix Bioinformatics in Newark, California; and Nicola Mulder, Ph.D, Professor of Computational Biology at the University of Cape Town in South Africa. The session recording is available here.

Panelist paths in Biocuration

Dr. Tanya Berardini entered the biocuration field after completing a Ph.D. and a post-doc when she joined the Arabidopsis Information Resource (TAIR) as a curator. When TAIR underwent a funding crisis after many years of serving the plant genome community, Dr. Berardini and her colleagues founded the non-profit Phoenix Bioinformatics which developed a sustainable model to support the TAIR database through subscriptions and has subsequently expanded into assisting other databases and resources to address funding issues, through subscription and membership models. Dr. Beradini’s career path is unique, as she initially performed database curation for a single resource, TAIR, and now also works in an entrepreneurial position. She has learned various aspects about running a business (such as Human Resources, insurance requirements, contract negotiation), as well as curation in additional domains outside of plant biology. Dr. Beradini noted that her detailed-oriented curation skills and experience with databases were very transferable to the business world

Dr. Pankaj Jaiswal’s work on sequencing plant molecules (his initial training was in biochemistry and plant molecular biology) prompted his interest in bioinformatics analyses and genome biology curation. He currently runs a wet lab (“on the bench”) and a dry lab (“at the computer”) at OSU in the Comparative Plant Genomics department. Dr. Jaiswal leads the curation efforts for the Gramene database and the Planteome projects, which require the creation of ontologies for the standardization of plant characteristics such as gene function, phenotypes, pathways, and gene expression. Dr. Jaiswal started curating during his basic science training as he read papers learned about specific subjects and synthesized information to address biological questions. His efforts to facilitate the synthesis of information and ease of interpretation, search, and access, included networking with peers, including Gene Ontology and Model Organism Database curators, and brought him to the field of biocuration.  Dr. Jaiswal currently trains his students, post-docs, and researchers to apply data standards and learn the curation process to build upon the foundations laid by the biocuration community.
Dr. Nicola Mulder holds a Ph.D. from the University of Cape Town in South Africa, where she did basic science research and studied molecular biology of infectious diseases, which ultimately led her to bioinformatics. She became a curator at European Bioinformatics Institute (EBI),  first at SwissProt, then as part of the InterPro project, which she went on to lead. Dr. Mulder currently leads the Pan African Bioinformatics Network for the Human Heredity and Health in Africa (H3Africa) in Cape Town, which supports bioinformatics and genomic analysis in Africa. Her team brought together a global community of experts, including clinicians, biocurators, and ontologists, which led to the development of the Sickle Cell Disease Ontology (SCDO) in response to the need to standardize information around Sickle Cell Disease, and the Hearing Impairment Ontology. Dr. Mulder and her team’s curation efforts include standardizing phenotype data for research cohorts and curating genomic data for African relevance, such as curating single nucleotide polymorphism (SNPs) from African populations and curating diseases that are relevant to Africans.

Becoming a Biocurator

The field of biocuration is still relatively new and small; colleges and universities do not typically offer a degree in biocuration. Therefore, the path to becoming a biocurator rarely follows a straightforward trajectory like many other fields, as many biocurators are subject matter experts in various subdomains of biology who completed a Ph.D. in a biological area or have a background in some aspect of computer science or semantic technologies, and have an interest in standardizing data. Our panelists shared some suggestions for those interested in joining the field:

  • Draw on your area of expertise: Most databases focus on specific subject areas and expert community contributions (such as contribution to open biomedical ontologies, and all of the OBO Foundry ontologies) are always needed, welcomed, and greatly appreciated. If you notice missing information or content in a database, reach out and share your knowledge.  
  • As a researcher, curate your data before it is published: Work with the databases to make sure your data is prepared in a proper format for completeness and efficiency before you publish. Dr. Berardini mentioned that over 10,000 labs work on Arabidopsis, creating a massive backlog of papers to curate. Structuring data before and at the time of publication dramatically assists with the curation process.
  • Volunteer at databases: If you have expertise in a particular field, contact the databases directly to discuss opportunities to contribute. Volunteering can be beneficial to build your experience, provide contributions to biocuration efforts, and provide networking opportunities within the community. In addition, volunteering can reveal whether the field is right for you.  Biocuration requires a particular personality, including attention to detail and a desire to organize. While some people derive extreme satisfaction from it, others can find it quite tedious. Dr. Berardini noted, “if through volunteering, you find biocuration brings you joy, this is the right career for you.”
  • Participate in hackathons, data jamborees, biomedical competitions: these events bring together researchers across various career stages, from junior biologists to practicing clinicians, and are opportunities to network, build your CV, and contribute to impactful work. Examples are biomedical competitions like Dream Challenges, and hackathons, data jamborees, face-to-face meetings, and online events hosted by Dr. Mulder to facilitate community curation of H3Africa projects. 
  • Do as much training as you can:  Courses are available, such as massively open online courses (MOOCs), college courses, and the newer Post-Graduate Certificate in Biocuration offered by the University of Cambridge.
  • Build your skill set: Search for job advertisements to determine what qualifications are needed, and work towards enhancing your skill set and competencies that meet job requirements. As an outcome of the Careers in Biocuration Workshop at the Biocuration 2018 conference, we created a generic position description for a biocuration profession, which is available here.

Biocuration career opportunities

A lot of opportunities exist in the biocuration field: biocuration in academia, which may entail biocuration for grant-funded database projects and ontology development, such as the work of Dr. Jaiswal; community-based bioinformatics and curation projects, such as those led by Dr. Mulder; and biocuration in a non-profit business setting, as Dr. Berardini’s work at Phoenix Bioinformatics. Biocuration opportunities are also available in the industry as companies are recognizing the importance of curating and standardizing data (for example, standardizing clinical trial data),  in government agencies; and even as independent consultants

The skills gained as biocurators, such as attention to detail, the ability to take in and synthesize data, and computational skills, are very valuable and can be translated to different areas, such as other areas of science or technologies.

Biocuration is a growing field and we anticipate that, as the amount of biological data being generated increases, so will the demand for curators. The ISB aims to promote the field and support our community through offering dissemination of job openings (see regular posts on our website here.), training opportunities, and networking. The ISB also promotes collaborations and exchanges between biocuration groups and offers funding for exchange fellowships. This fellowship will fund members to visit another laboratory or organization for training or knowledge sharing; more information is available here.

Researchers have the opportunity to better structure their datasets, share their data in repositories, and better structure the content that they publish, however, they are often unaware of the career opportunities in biocuration. We have not only an opportunity to promote the biocuration field, but also the responsibility to train the future generations, provide knowledge transfer, and have succession plans for those coming up after us. 

The Future of Biocuration: Panel discussion from the Biocuration2021 virtual conference

By: Nicole Vasilevsky and Jane Lomax

Like all in-person gatherings in this past year, the annual International Society for Biocuration conference went virtual in 2021. At the inaugural session on April 13, 2021, a group of panelists discussed ‘the future of biocuration’. The panel was moderated by Rama Balakrishnan, who has served on the ISB Executive Committee since 2017, and is the co-chair (along with Susan Bello from the Jackson Laboratory) of the Biocuration2021 conference. Rama was joined by four panelists from various roles in academia and industry to discuss what is in store for our community. The recording is available here.

What is curation: Distilling knowledge from information

Rama initiated the discussion with the fundamental and relevant question, ‘what does the word curation mean to you?’ Working in the biocuration field, many curators can probably relate to this question, a question that is frequently asked by people who are outside this field. The role of a curator at a museum, for example, may be more familiar, but biocuration is a less well-understood field. Rama, who has held varying roles as a curator (academic and industry), tried to get after how the actual task of curation may differ amongst us. Sandra Orchard, from EBI shared a classical definition of ‘turning unstructured data into structured searchable data’, but recognized this is not always true as, whilst some curation tasks involve making data more structured, text-minable and machine-readable, the outcome of data curation does not always result in completely structured data. Carol Bult from MGI defined curation as “applying semantic standards to ensure data findability and aggregation.” 

Coming from the industry perspective, both Kambiz Karimi (Myriad Women’s Health) and James Malone (SciBite) agreed. Curation involved meaning-based capture and structuring of content using controlled vocabularies. Data curation can also include data cleaning, which is often a pre-curation task. Curation can help improve and enrich data interpretability and ultimately add value. It allows for enhanced search, querying, semantic integration and meta-analysis. 

How can we ensure quality?

Given that the panelists all agreed on a high level definition of curation, Rama then asked about ensuring data quality. What does good quality mean and what are metrics to assess quality? Different quality control (QC) and quality assurance (QA) processes apply, depending on the type of curation that is being done, whether you are curating tax forms (as James did in a summer job long ago) or curating the mouse biology literature. Some processes that were discussed by Carol and others  included intercurator checks, crowdsourcing feedback from downstream users, practices to ensure collaboration, regression testing to ensure continuity and consistency across datasets. Sandra pointed out that curators cannot be all things to everything, and stressed the importance of specialist databases with curators who are domain experts who can take the first pass at the curation, and build re-processing pipelines or scoring mechanisms to export high quality subsets to other data resources.

James and Rama noted how detecting outliers can assist with quality checks. However, it may not always be easy to detect the outliers without the expert knowledge in a specific area. For example, Rama curates patient data at Genentech, and once came across a data reporting a patient had a 100℃ fever (rather than 100℉), which was easy to spot as an error. However, in a more complicated clinical use case, detecting erroneous data points may not be so obvious and require more specialized knowledge.

Kambiz shared that Myriad has several QC approaches, including a peer review process, a spot checking program to have curators spot check each other’s work and a quality check process that compares their classification to previous classifications from the community. 

Sandra also noted the importance of researchers collaborating with curators prior to publication. She shared an anecdote where an author published a paper with an erroneous dataset, a simple mistake where a row in a spreadsheet had been accidentally deleted, causing nonsensical results. The curator picked this up and contacted the author, who was able to correct it, but this speaks to the importance of pre-submitting data to the database before publication and the important role a curator can play with the research community. 

Opportunities with Machine Learning and Automation 

While a lot of biocuration is done manually, more and more processes and workflow are being automated, with text mining, machine learning (ML), natural language processing (NLP) and AI.  The panel was asked their opinion on how AI and ML will affect the work of biocurators? Sandra assured us that machine learning will enhance our work, but is not concerned that it will replace human curation. Data is too messy, the literature is too unstructured, and human review and curation is going to be needed in the foreseeable future. James echoed her sentiments in saying, “[Machine Learning] will become an assistant, it will not replace subject matter experts who are biologists, scientists, curators. It will play a role in helping us.” James sees it as an opportunity for biocuration, where we should work to exploit advances in deep learning, noting the importance of biocuration is more pronounced now than ever. We can train AI to aid in biocuration and we can work together. In addition, quality Machine Learning/AI requires training sets that have been human-curated, and the advances of these technologies will require more curators; this is a new opportunity for this community. Carol agreed, but brought up the point that there may be the perception that these technologies are advanced to the point where curators can be replaced. This is causing challenges with funding for biocuration due to the notion that machine learning can do all or most of what human curators do. While machine learning can assist with making biocuration scalable, we need to do better as a community at communicating how these things interrelate and feed off each other.

“Biocuration has never been more valuable than it is now and yet under appreciated.” It’s something the Society can help us tackle: this perception and articulate how manual and machine learning biocuration can go hand and hand. – Carol Bult

Approaching authors

An audience member inquired whether database curators approached authors for clarification about their published data, and whether authors were responsive. Kambiz shared that they did approach authors when there was ambiguity with the content or data in an article. Sandra concurred, and alluded to the challenge with time dependencies; if a paper was recently published (1 year – 18 months ago), they frequently got a response. If a paper is over 3 years old, in general, they were less likely to get a reply, as the first author may have moved on and the PI is unfamiliar with the details of the data. 

This may speak to an opportunity to better train researchers in becoming familiar with curation methods and standards, to allow for unambiguous reporting in their publications. Requirements to share data at the time of publication will also help address this need.

Getting the journals involved

This led to the next question about working with the journals to publish data in a more structured way. Carol has had some experience working with journals in the mouse community, who are careful about publishing mouse names with the accepted terminology and nomenclature. She did mention that sometimes there is push back as to whether the recommended standard is the accepted standard, and whether this is going to evolve or change in the future. We all may be familiar with the situation below.

Source: https://xkcd.com/927/

This is an opportunity for a systematic community approach, the ISB should promote standards adoption to the journals.

Sandra pointed out that a challenge with approaching journals to use our standards, is the sheer number of journals. A more targeted approach may be more appropriate. For example, the proteomics community was successful in getting a restricted number of journals in their field to require data sharing to ProteomeXchange (http://www.proteomexchange.org/) prior to publication.

Sandra also recommended that we first talk amongst ourselves as a community and define our needs, and what standards to adopt and promote, and then approach the journals.

The elephant in the room: Funding

In recent years, NIH funding has decreased to various databases. How do we sustain our own careers, and train the next generation of curators? 

Kambiz felt it is easier to justify the need for curation due to the regulatory aspect of his industry. Even if there are NLP based processes to extract gene to disease relationships,  manual review will always be needed. He foresees  automated processes will assist with manual curation going forward.

Carol emphasized that we need to promote how important curation is to data science. Data science is recognized as an important field, therefore we should frame curation within its role in data science. We have to be better about explaining return on investment in curation – what can we do when data is curated, and we wouldn’t be able to do, if it wasn’t? She pointed out that the reality that biocuration is considered infrastructure, which is largely ignored, until it is broken. As a Society, can we demonstrate the impact that biocuration has on advancing data science?

Sandra reiterated that we need to make ourselves more visible, we need people outside the community to understand what we do. We need to work together as a community efficiently to not duplicate efforts, we need to align on standards, use specialist databases for initial analysis and data cleaning, and use the baseline resources like accession numbers, and show good examples of good curation.

Continue the conversation on Slack.

Do you have topics you’d like to discuss in a future panel, or suggested speakers? Please let us know (intsocbio@gmail.com).

Biocuration 2020 Meeting is Cancelled

From the Biocuration 2020 conference organizers:

After continuous monitoring of the evolving situation with the coronavirus disease 2019 (COVID-19), the Biocuration 2020 conference organizers in conjunction with the International Society for Biocuration made the decision to cancel the 2020 meeting that was scheduled to take place May 17-20 in Bar Harbor, Maine, USA.

Cancelling this meeting was a difficult decision made after careful thought and deliberation. This evidence-based decision was made after a thorough review of all factors impacting the conference, including the U.S. government’s enforcement of restrictions on international travelers to enter the U.S.; the imposition of travel restrictions issued by U.S. government agencies, cancer centers, academic institutions, and pharmaceutical and biotech companies; the directives of infectious disease experts and the World Health Organization (WHO). However, our primary concern was the health and safety of the attendees, the venue staff and the local community.

We recognize that the presentation of new data, exchange of information, and opportunities for collaboration offered by the chance to meet colleagues in this field are highly valued and we are investigating options for rescheduling this fall or for virtual presentations.

A great deal of work went into planning this meeting, and we wish to thank the members of the scientific committee, the organizing committee, the Jackson Laboratory Courses and Conferences office, the International Society for Biocuration, and the Harborside Hotel and Bar Harbor Club for their time and effort.

We will contact all registered attendees about the refund of registration and conference dinner fees, and we will also contact all sponsors about return of fees associated with contracts. All attendees should cancel their own hotel and travel reservations; please check the cancellation policy for your hotel, airline, etc. We have notified the hotels listed on the meeting website to expect cancellations.

Anyone with urgent and immediate concerns should email biocuration2020@gmail.com. Any additional updates concerning this meeting will be posted onto the meeting website and on social media.

The Biocuration 2020 conference organizers wish to sincerely thank everyone for their patience throughout this decision-making process.

Biocuration 2019 – Workshop Reports

GREEKC

The COST Action GREEKC held a workshop inviting community feedback on its work to align efforts to curate, standardize, archive and share information about the regulation of gene expression. A status report was presented by the Work Group leaders, and feedback on the organisation of the next events was received. Some of the feedback that GREEKC needs can still be given through these surveys: “The Work of Curators” and “The Experience of Curators”. One of the main discussion points concerned a re-design of the Sequence Ontology, and a comprehensive set of term requests necessary to annotate the regulatory genome are now being worked on with the SO team (Eilbeck group, Utah). The status of the SO was further discussed with a much wider group of users within the Biocuration community, at an impromptu lunch discussion later during the Biocuration 2019 main event.  We hope to be able to present a significantly updated SO at our upcoming workshops, the first week of November 2019.

The IMEx Consortium of Molecular Interaction databases

The IMEx Consortium is a collaboration between interaction databases willing to share data and curation effort. This workshop focused on the coordination and further definition of curation practices. Topics covered were curation coordination tools such as IMExcentral and targeted curation practices, glycan-related physical interactions, nucleic acid-protein interactions and the influence of variation upon interaction outcome. In a joint session with the GREEKC community, transcription factor-target gene interactions and causal relationships were also discussed, developing already active areas of collaboration between the two communities on the representation of this type of data. If you are interested in contributing to the work of IMEx, contact us on intact-help@ebi.ac.uk

Practical ontology applications, tooling and interoperability best practices for FAIRification

This workshop provided an interactive introduction to FAIRification and interoperability best practices in the context of ontology services and semantic web technologies such as the OBO Foundry, ontology service suite at the EMBL-EBI and ELIXIR’s Recommended Interoperability Resources (RIRs). The day started with a general introduction to interoperable data management and FAIR principles before a series of talks and practical demonstrations on resources including the OBO Foundry in general, and specifically OBO core, the single cell expression Atlas (scAtlas), the EMBL-EBI Ontology Tooling Suite and a number of ELIXIR Recommended Interoperability Resources (RIRs) such as FAIRsharing (slides), InterMine and BridgeDb. The workshop concluded with an open-floor discussion on the needs of the biocuration community with respect to FAIR resources and ontologies, and ontology and FAIRification best practice.

Curating Evidence for Gene:Disease Validity for Clinical Omics

Three Gene Curation Coalition (GenCC)  member groups (Genomics England PanelApp, ClinGen and Orphanet) presented an overview of their gene curation strategies and focus, leading to dialogue about the merits and challenges of each approach. The conversations reinforced some of the challenges we face in performing manual curation of gene:disease associations, and the rules we have in place to ensure consistent annotation. We reviewed where we could most benefit from incorporating additional ontologies and mappings into our resources, and areas that required further clarity; it quickly became apparent that even the term ‘panel’ can be ambiguous given its different use between resources- are we talking about a panel of genes, or a panel of people? Curators are already familiar with the need for consistent curation terminology, and the workshop provided the perfect opportunity to poll attendees for their views on clinical evidence descriptions. We were then able to demonstrate how the recent efforts of the GenCC to establish consensus terms for validating gene:disease associations will allow us to work together and allow efficient data sharing. Overall we hope that the workshop provided an insight in to the roles and diversity of data curation in the clinical setting.

Equality, Diversity, and Inclusion (EDI)

The introductory slides explained what these terms mean and how they are being embraced by scientific institutes in different countries. This was followed by a more in-depth presentation by the invited speaker Dr Saher Ahmed, head of EDI at the Wellcome Sanger Institute, Cambridge, UK, who discussed gender discrepancies in the workplace, and highlighted some efforts at Sanger to address these issues, such as pay transparency, changes to their leave policies, and creating a family-friendly workplace. The remaining time exchanging views on the gender pay gap, maternity, paternity & carers leave, cultural differences in working practices and accessibility. As an outcome of this workshop, attendees agreed there is a need for the ISB to create an EDI subcommittee and that this workshop should be held at subsequent Biocuration meetings. The EDI subcommittee is currently being formed, and the exact roles are to be defined, but they will address issues including a code of conduct amongst the Society as a whole and at conferences, and accessibility at conferences and for ISB activities.

“Not Everything That Counts Can Be Counted” – how biological resources should be evaluated

As scientific data output continues to grow, biological resources are increasingly critical for data discoverability and re-use. However, many highly-used biological resources find it increasingly difficult to secure and maintain funding. This discordance implies that the value of curated resources to the research infrastructure is still not fully appreciated in the wider scientific community, or that the money saved by curated resources is not fully factored into funding models. In this workshop, we hoped to address questions surrounding this disparity.  A short introduction to the issues was provided by Marc Gillespie. A funder’s perspective was provided by David Carr (Wellcome Open Research). Jo McEntyre (EBI, Literature Services, UK)  provided an overview of the Elixir indicators designed to evaluate resource quality in a standardized way.  Two major priorities emerged from the discussion. Firstly, knowledge bases not only capture data but also synthesize new knowledge. The differences in requirements for evaluating archive and knowledge-base database should be made clearer. Secondly, the need to educate the scientific communit and funding bodies about the hidden work and benefits of data curation is urgently required. Suggestions ideas and recommendations gathered during the conference and post-meeting are documented here, and we encourage curators to add further ideas, with an aim to develop into an ISB position paper during 2019-2020.

Biocuration in Industry

The Biocuration in Industry workshop was organised by Jane Lomax (SciBite) and Yasmin Alam-Faruque (Eagle Genomics) with an aim to discuss the experiences of, and challenges faced by, non-academic biocurators. The workshop attracted ~100 participants, with most coming from academia. The workshop began with short talks from commercial companies, including Nebion, Hoffman-La Roche, Healx and Eagle Genomics, who described their curation pipelines, standards and scientific interests, which included cancer immunotherapy, microbiomes and Parkinson’s disease. A common theme was the use of public standards and ontologies, emphasising the importance of key resources such as MONDO, GO, HPO and MeSH to aid drug discovery and knowledge management. This also came through in the subsequent panel discussion where the panel agreed that, in order to maintain the high-quality of these resources, there is an onus on the commercial sector to contribute back improvements to these open-source efforts. The main challenge for the panel, as in the academic sector, is data cleansing to create high-quality and reproducible datasets for downstream processes. However, this was seen as a valuable, and transferable, skill for biocurators as the biomedical industry increasingly recognises the need for clean data.

The Phenotypes Traversing All the Organisms (POTATO)

The POTATO workshop is part of an ongoing effort to reconcile phenotype ontologies across species. This, the second workshop in the series, brought together 24 curators and ontology developers from a variety of backgrounds including representatives of many important groups in the phenotype curation space: Monarch Initiative, the Alliance of Genome Resources, ZFIN, PomBase, dictyBase, PHIBase, GO, SGD, HPO, FlyBase, MGI, Phenoscape and more. The Phenotype Ontology Reconciliation Effort aims to align phenotype ontologies using a common set of design patterns. These design patterns depend on a variety of external ontologies including the Phenotype and Trait Ontology (PATO) and the multi-species anatomy ontology, Uberon. The workshop included training in editing these two ontologies. It also featured an extended session to develop a strategy to deal with shortcomings and current limitations of PATO and its usage, as identified by the Phenotype Ontology Reconciliation group. During this session, focus groups discussed a number of PATO related issues, including how to improve PATO definitions in general and how to improve PATO representation of increased and decreased amounts (including absence), frequencies and rates. A number of edits to PATO have already been implemented as a result of this work. The results of the discussion are currently being written up as a meeting report, which will guide future improvements to PATO.

Data Licensing Workshop

The data licensing workshop at Biocuration 2019 was focused on helping scientists to understand important factors in the selection of a data license, as well as the implications of that selection on downstream use and reuse.  We had a diverse line up of speakers who each shared their unique perspective — data owners, data miners, and a legal expert — followed by a robust discussion among all participants. The goal of the workshop was not to achieve consensus on the “best” license, but rather to share experiences, perspectives, and questions.

Mapping the Landscape of Biocuration

This well attended pre-conference workshop asked questions such as: What is the state of biocuration in 2019? Where are biocurators based? What are their skills and levels of expertise? What training do they need? What are the tools they use? As part of an ELIXIR Implementation Study, members of EMBL-EBI, FAIRsharing.org/Oxford and SIB ran a survey to capture information on biocurators and the resources they run, the life science/health domains they operate in, and their expertise and training requirements. In the workshop we described the current biocuration landscape, and ran an interactive session to compile feedback on career progression and training roadblocks. Slides from the workshop can be found here: Survey: 10.7490/f1000research.1116798.1; FAIRsharing: 10.7490/f1000research.1116785.1; TeSS: 10.7490/f1000research.1116784.1). More information on the Implementation Study and follow-up work can be found here: https://elixir-europe.org/about-us/implementation-studies/mapping-biocuration