Minutes summerf2f 2009

Contents

Minute from Face to Face meeting summer 2009 @ EBI

Tuesday AM 9th June

Caveat: Minutes may occasionally be inaccurate during capture

Attending: JM, BP, NM, MC, FG, DD, PL, LS


Collaboration with Envo

JM: We have some sample information in AE which relates to Envo scope

NM: Env are holes or sites with qualities that enable the area or site to support life is current thinking. Env sampling would be taking some quantity from this area or site. I assume you would want to represent env sample from the point of a sample going through an assay.

MC: If you have bacteria in an organism then the organism is an environment, so is it a role?

BP: So we don't have environment at moment, we will have sample collected from site in OBI. There is a class site in BFO so we should define things in OBI under this. We don't worry about whether it is an environment this becomes clear when it is involved in a sample collection process.

LS: It would be good from our point of view to have a higher level class for environment

FG: This shouldn't matter if we use BP proposed approach. So any env should be under site and then in future we can adjust this when envo has definition.

MC: How do we deal with having things under material that we want to say are environments?

BP: If we say mucous membrane then we talk about this class defining a site.

FG: So they are completely different

BP: So inside of my body is a site and this is

FG: material has_part site

BP: Barry has said this is completely clear under BFO so we should follow this

MC: What do we (OBI) need to do for Envo

FG: We submit anything we consider site to the Envo tracker (see OBO foundry sourceforge site)

JM: I can generate list of sites from AE and pass that on to NM if useful

NM: Yes that would be useful. We are doing rocks and sludge, land cover such as agriculture and urban areas from satellite views of world, anatomical regions, holes in which you would find bacteria growing

NM gives demon of wonton ontology browser (URL is not shareable for this as currently private)

NM: Env and habitats and how you instantiate relations in OBI. Envo is trying to remove all reference ot habitats because we want to only reference site.

FG: Defining environments are outside scope of OBI (agreement from room)


Assessing use cases in OBI paper

BP: Discussion about replacing one of use cases from OBI Journal paper with the evidence code use case, discussed during the OBO Foundry workshop

FG: I would suggest adding a further use case.

LS: Should we just add this to existing use cases rather than create a new one?

BP: So the EC ontology exists and the idea is to replace all of the EC IDs with an OBI identifier. This is a use case for OBI.

FG: Competency question is can we replace EC IDs with OBI IDs, and second can we serve same queries. I would like to go through existing use cases first and then do other use cases.

MC: I would like to address EC one

FG: Only if we address other use cases first, this is priority (the journal use cases).

JM: How are we going to do this, I woudl like a process for this session

FG: Can we take each use case and represent it in actual OWL.

GenePattern use case

BP: Visualization is a process so it should be under planned process, presently it is obsoleted

JM: It should be put back in if useful

FG: We should add OBI IDs next to each name in the manuscript

BP: I think instead we should not do this as it is annoying to read, instead we add this in suppl material

JM: I agree, I think we need to make it easy to read first

FG: clustering process needs to be renamed in manuscript to class discovery data transformation (OBI_0200175), normalization process needs to be renamed in manuscript to normalization data transformation (OBI_0200169)

AI: Rename the above in the figure in the manuscript

Discussion about how we link the data item that is output from DT to the report graph that visualizes it.

JM: The part that is missing is the link between the data and report graph

Summary of discussion: We will make the connection by saying a report graph is_about some data item and then add the specific details for each class. So for clustering data the link will be, e.g. dendrogram is_about some tree-like data set (of which clustered data set will be a subclass) or .... etc. This will make the connection between our data sets, the DTs and the report graphs that render the data. This will enable us to represent the GenePattern use case, e.g. tell me all report graphs that can visualise clustered data sets (i.e. output of clustering data transformations).

AI: JM + MC: to add the is_about relations between data sets and report graphs, priority is to get clustering example from GenePattern completed. MC will help from IAO side as most of these axioms will be on IAO classes.

Immune Epitope DB Use Case

BP: In this use case there is an issue which is that I created an intermediate ontology for our purposes because we needed a lot of assays in quickly and OBI is not stable and also takes time to add classes to. So in many ways this does is not the best demonstration of OBI because it advocates a bespoke ontology.

FG: Can't you import this bespoke ontology back into OBI

BP: we have done some work on this

FG: Is epitope within scope of OBI, i would be inclined to say yes

BP: It is not clear to me whether it is or not. What we have done with the ontology is build it around OBI but given them ONTIE IDs and keep them apart for now. (BP shows ontology) There are def some things that belong in OBI ELISA assy for example, but we have very specific assays which are granular because we use them in work, not sure if OBI needs these

JM: Questions are, is in scope, who would be keeper for this if it wasn't and would it become sep ontology or sep file imported in?

BP: I would be happy to give these OBI IDs but equally fine with them being in sep file.

FG: This is a good use of how to build an application focused ontology under OBI

JM: I agree, similar to way we are trying to build for AE

AI: BP decide whether these should be added to OBI and come up with list of requirements before this can happen (i.e. adding things to OBI)

AI: MC + AR to check if this can be accommodated once above is done

FG: The slant needs to be changed in journal so that it shows that OBI can be extended for application ontology successfully

JM: Agreed this will become AI for BP (see below)

AI: BP The slant needs to be changed in journal so that it shows that OBI can be extended for application ontology successfully

FG: Presently, as OBI stands, we can't do all of this use case

BP: No, it still requires a few things, but all doable before journal is out

BP: I can add more specific info as well, like these are classews you get back from this query too, but i'm not sure it is interesting and we already have the GenePattern querying use case

FG: I think this is still useful as it's from a diff perspective, using application specific external ontology

MC: Do we explain why this useful, what advantages of using this approach are

BP: Yes, we explain in journal paper where demonstrate how this could tie in different resources using a single query

MC: But you are going to query using SPARQL oui? Something about ducks and genes querying GO and pubmed at same time, so why is this good to have in OBI instead of 3 different resources?

BP: This is fine as long it is all in based on OBI structure

MC: Paté

BP: Second part for IEDB was to use ontology to validate using consistency checks

FG: Add this to discussion perhaps to save space, it is useful to say though

JM: Agreed with above

BP: OK will move it

MC: Shall we put in examples, for example we can check for x, y or seagull?

BP: Yes so we do this


Use case 3: JF representing a protocol

BP: So the us case discusses a protocol and how we can represent this. We don't have in OBI, ethics committee, enroll balanced cohorts, placebo is still problematic, none of this can be done presently in OBI.

FG: Maybe we should stop at first part or not include it at all given we can't do it

JM: So pros and cons as I se it, pro is it does talk about a study and the parts therein which is different and is critical to OBI's scope. Con, I don't see this is an actual use case, what is USAGE here, is it annotation, template query, etc, also does this overlap with early methods section describing how we represent bits of studies?

FG: I think we don't cover most of this in ontology and it doesn't show use as in use case so we should consider dropping it

JM: Can we make modifications rather than drop?

MC: In what way is it not use case?

JM: Use cases have an actor that show how the ontology will be used in a given scenario, this is largely a coverage example at present, but this can be modified to become a use case.

FG: Issues still remains we don't cover it

PL: This is a use case which is describing utility

FG: yes but it needs updating as it's not correct at present. I'm not saying delete it but we should reformulate it.

JM: I think we need a third use case and if we even thought about replacing it we would still be faced with issue of requireing some work for a new use case so doesn't remove the problem. i would prefer to reformulate this and fix OBI

FG: so we need to go back to JF with the following four questions what is it the experiment you are trying to do? how do you model this? how are you using OBI? what are queries?

AI: BP: to ask JF the following four questions what is it the experiment you are trying to do? how do you model this? how are you using OBI? what are queries? This will enable use case to be completed

Tuesday PM 9th June

Looking at first section of manuscript

BP: We need a class that is high level, such as our anatomical entity, that we can import from external ontology. We need to be able to import a species neutral

AI: Biomaterial to contact Chris about species neutral anatomy (UBERON) as we need to import this into OBI

FG: chemical entities in solution why is this solution and not liquid?

MC: shall this be a defined class?

BP: No. more generally we are trying to model mixtures but we have limited to this for now.

BP: In an ideal world we would have a term 'mixture'. This is where we need to demonstrate the has_grain relation

MC: If we add this into the journal now we will take up a lot of space

FG: so chemical mixture is defined as mixture of one or more homogenous chemicals. So I think what chemical entities in solution describe is chemicals in a solution which is not a mixture. So are you saying you want mixture class?

BP: Yes I feel we should have this, since we have mixed materials generally not accounted for in OBI. At same time we have some issues saying, for example, why a cell is not one.

ET: yes

BP: Process material we have in, specimen and blood sample etc needs to be removed from manuscript and realigned with what we have in OBI. Postpone for 30mins for Phil to get back as we would like an outside review. We had discussion in previous workshop to skip specimen and sample but we can't avoid this any more.

JM: Agree we need to address these terms, they belong in OBI if anywhere.

MC: grml

FG: Protein is not a type of peptide

BP: This is not an OBI issue, it is defined as such in external ontologies, chebi and pro. We don't have recombinant expression system in OBI can we add it

FG: We need to say proteins can be synthesised or whatever

BP: This is not the same, i think natural language works here.

FG: We don't have agent in OBI

BP: No but this is in preparation for objective specification-planned process part, but we perhaps don't have space to add all the details in manuscript

FG: Agreed

MC: I agree with Frank, as always

FG: I disagree

BP: has_specified_output and has_specified_output_data - the issue with these two relations is that they are correct if we wish to assign 'output' role to material entity and data because we don't know if we can assign the role of input and output to data currently. If we can have input and output role for data then we don't need these two relations.

FG: We should just have input and output relations nothing more specific than that.

MC: I think everyone agree we need roles for data so we should push to barry/bfo, the problem is with the specification of the relation they need to have role output or input

LS: That needs to be fixed

JM: I don't see issue, can someone give a concrete example

MC: Issue is when we have role input/output it means that your role is realized within a process so it is a 'special' status

BP: When you have an experiment and have instrument, cell etc, then they are all participants in the process. We should remove the roles part of the definitions for the input and output relations and we can remove the information related input and output relations: has_specified_output_information, has_specified_input_information and instead has_specified_output and has_specified_input should be used.

AI: LS and relations branch: remove the roles part of the definitions for the input and output relations and we can remove the information related input and output relations: has_specified_output_information, has_specified_input_information and instead has_specified_output and has_specified_input should be used.

Phil reenters room so we go back to sample#

PL: Agreed with the changes that output range should become a GDC or material entity. (some rant about GDCs and why boundaries are Indep Cont)

Sample/Specimen

BP: One decision we made in biomaterial call is to try and model something simple first rather than try and do all possible uses of the words. This example was blood sample. This became hard because we ended up referencing other terms that are also contentious

frank takes the floor to demonstrate his thinking

PL: There are two uses we are debating here, first is i took some blood from someone. Second, this a randomly selected subset of a population which is representative of larger population

BP: I can have a population of 1, if i look at a whole body for example.

FG: Yes, my definition (Referring to my google doc) is that a sample is a role that is borne by a material entity that is, the part (subset) of a population (whole or class) which is assumed to be representative of the population.

PL: Data sets would not work here since it involves both roles and also Information which is not a material entity

JM: So we need two sample, one for material and one for statistics

BP: Yes

Agreed definition: a material sample is a role that is borne by a specimen that is, the part (subset) of a population (whole or class) which is assumed to be representative of the population.

FG: We can also add the defined class for material sample which is material bearing role of sample

BP: This is fine but it is not main controvesial part, if I do sample extraction is it material separation or sample.

PL: If you think of material sample as a role, I might want to collect butterflies but that does not mean they are samples they are just a collection or specimen.

BP: Question is when does it become a sample and not just a specimen?

A specimen role is a role borne by a material entity that is collected for potential use during an investigation

Furthermore; material sample is_a specimen (but not other way around)

specimen examples: Biobanking, e.g. blood taken and stored in a freezer for potential further investigations

material samples: blood drawn from a patient to monitor glucose level,

JM: There is also issue that role is only borne during a process so once process ends it's not a sample

FG: Well problem is in BFO these may not work since the role is dropped at end of the process

PL: Stored is key here, if we say it is stored until it thrown away it still has role during this storage process

specimen will lose role when it is no longer stored sample loses it representativeness of whole

BP: Role is a social construct so as long as someone has the intent to derive something from it then it still has the role

PL: it will fall down for measurements of height since heights are quality

specimen gathering process is_a planned process with the objective to obtain and store a material entity for potential use as an input during an investigation

material sampling process is_a planned process with the objective to obtain and store a specimen that is representative of an input material entity (we will have to be able to answer for a given material sample what material it is representative of; that may require adding a role for the 'sampled material'. Right now we will use the input relation)

JM: We need to say that when it is no longer stored it is not a specimen not just when it is destroyed since this is our definition

FG: I don't really care about this all that much, even though you make a very good point James and are correct.

JM: Why thank you Frank.

BP: bronchial alveolar lavage is this part of lung or is it extracellular

PL: it is extracellular it is secreted fluid, the lavage step suggests that something extra has been added to the process though

PL: I do not understand the derives_from relation, this not a definition by BFO standards. derivation by this is only frmo biological materials as mentions biologically significant.

FG: Can we use has_grain?

BP: I don't see how we would use has_grain, how would we?

FG: Beer

PL: derives_from suggests that original does not exist from the new part so it would not work for lavage. What you are trying to say is has_part some stuff that derives from some anatomical entity

BP: we need to be able to say that there was a link, that it_was_part_of but original still exists, similar to a sample process, but essentialy it is what did you take the specimen from.

PL: If you have the notion of lavage, the process to lavage, then you already have the notion of what it was

PL: we need to distinguish between process and result of the process, rest seems reasonable

BP: So we need to have a lavagae process we can add this now

MC and FG shake head, not at work but reading Twitter

LS: What language is word lavage from?

MC: français

LS: Do you know OBI in japanese means sash

BP: yes logo is meant to reflect that

BP: Can we call things urine specimen and blood specimen

JM: Yes

Looking at agenda for tomorrow

BP: We could consider, tomorrow, having time where 2 people leave room and edit the owl files while rest continue on with other things to speed things up, e.g. release process is not of interest to all, so that should split off with MC & someone else

BP: Also add reasoning to the agenda for Alan as it has slowed again

JM: Would be useful if Alan could give us some tips tomorrow on helping speed up reasoning as we edit

Tuesday Action Items

AI: JM to generate list of sample envs from AE and submit to Envo

AI: JM to edit the figure for the GenePattern manuscript to reflect the wording in OBI, e.g. normalization process in figure should be normalization data transformation

AI: JM + MC: to add the is_about relations between data sets and report graphs, priority is to get clustering example from GenePattern completed. MC will help from IAO side as most of these axioms will be on IAO classes.

AI: BP decide whether these should be added to OBI and come up with list of requirements before this can happen (i.e. adding things to OBI)

AI: MC + AR to check if this can be accommodated once above is done

AI: BP The slant needs to be changed in journal so that it shows that OBI can be extended for application ontology successfully

AI: BP: to ask JF the following four questions what is it the experiment you are trying to do? how do you model this? how are you using OBI? what are queries? This will enable use case to be completed

AI: Biomterial branch: to contact Chris about species neutral anatomy (UBERON) as we need to import this into OBI

AI: LS and relations branch: remove the roles part of the definitions for the input and output relations and we can remove the information related input and output relations: has_specified_output_information, has_specified_input_information and instead has_specified_output and has_specified_input should be used.

Wednesday AM 10th June

Attending: JM, BP, MC, FG, DD, PL, LS, AR, JZ, SAS, PRS, RS

BP gives overview from Tuesday OBI meeting

AR gives overview from industry meeting

Discussion on sample/specimen

AR: we need to add realises to the definition as this is a role

Bjoern edits to add realises

BS: We don't need storing process in the definition

BP: It is needed as otherwise the role is lost once it is collected and this is not what we desire, it needs to a be a specimen long after it is collected

RS: What is main distinction here between sample and specimen

FG: The distinction is that sample is representative of a whole whereas specimen is not, e.g. blood sample is representative of the whole blood, whereas you cut an arm off and use it then it it a specimen

AI: AR to think about the definition of sample since he had a couple of minor issues

BS: Earlier you said gathering and storing - do you mean gathering is one process and storing is another, or do you mean gathering and storing as one single process? This needs to be

AI: BS to tinker with wording a little of specimen and sample

Continuing manuscript work

BP: Are we saying we have a relation with IAo which is that we are also building IAO within OBI or that IAO is sep effort

AR: I think we should say that IAO was spun off from IAO as a result of the process of OBI

BP: rename information content entity in IAO to Information artifact

AI: MC rename information content entity in IAO to Information artifact

Discussion on parameter

AR: So there is a question whether an indep and dep variable should be in OBI

BS: They should be in OBI they don't exist outside biological investigations

AR: Indep and dep variables are clear, parameters are not clear. This requires discussion. Best way would be for JM and I to

recapitulate our views

BS wants to redfine independent variable (focusing on definition on board)

Lots of continuous discussion by everyone at once so not able to capture

Wednesday PM 10th June

example for parameter

for planned process k-means has_objective (for example) class prediction objection:

and this objective specificatoin has part_of some input parameter and some input dataset and some output

AI: JM to do parameter mark up from above


Data types

BP and JM: Discussion on data types, we would need ranges represetned by things such as 'string', 'floating point' and so on

JM: I already submitted some of these to IAO list

BS: integer and string are not data types, they are numbers or letters. Data types are social artifacts

AI: AR, following completion of JM parameter, AR will take this to IAO

BFO Discussion

BS: The common upper ontology which was to go into ISO is on hold for now. BFO 2.0 will be released soon and will incorporate RO 2.0 into the release. There will be several version of BFO 2.0, e.g. common logic version, owl version.

PL: Which was one is the 'correct version'?

BS: Common Logic version is the canonical version of BFO

AR: We have to live with the differences as a consequence of language differences.

AR: Alan does not want object, fiat object part and object aggregate in BFO

BS: There are many people in community use this so it is staying. Generically dependent continuant is being added to BFO 2.0 as previously agreed.

Chris Mungall: What is currently under disposition

BS: Nothing presently

BS: Function and capability will be added under disposition

AR: Are there any objections to the realizable entity changes?

BP: Given the approach we are going to take by putting everything just under realizable entity and then infer the different subtypes, this is fine for OBI I believe.

BS: We wish to remove processual context, this will become deprecated.

PL: What is relation between spatial region and spatiotemporal region

BS: That is a projection relation, so that you can prject a spatial region onto a temporal region. I have a paper on this very topic, unfortunately there are too many relations in that paper and that will potentially introduce problems. Do we have relation of process to temoral region

CM: duration of is in

BS: This is somewhat ambiguous, we need to address this

PL: What is an instance of a zero-dimensional region?

BS: Take a map of the heavens and pick a point in that map that is vacuumous and that is a zero-dimensional region

PRS: Surely there is a relation between two-dimensional region and site for example

BS: Yes and that is something that I need to address

BP: do you envisage any OBO foundry ontology that will use spatial region or temporal region

BS: I have thought of removing that.

AR: are subclasses under material entity needed?

BP: What is the need for spatial / temporal regions? If they are included, they seem to be missing combinations (scattered ...)

BS: length is only a property of spatial regions. There is an is_projected relation, which links objects etc. and the regions,

PL: Example: length of longest entity in the univers + 1 meter

PL: how do we express time duration for a process

BS: duration of time interval into which the process is projected

PL: How do you then describe velocity, accelaration etc?

BS: Will follow up.

Discussion on IAO

There are some apparent issues with slow progress of IAO. Alan has declared that he is the branch leader in effect and that there will now be calls at which term submission will be resolved to enable faster resolution of classes

AI: AR to organise calls for IAO

Change of URL in OBO Foundry

AI: AR + MC: MC will send email out to OBO-discuss to let everyone know of URL change and AR will do change

AI: AR measured data item - alan will organise a call to discuss the conflicts between this class, data and so on also add to tracker to IAO

back to journal paper

Frank closes "Rugby Union Heroes" website and back to Protege

Looking at the figure in the journal paper of the analyte mouse assay and checking for coverage in OBI

AI: MC will MIREOT mouse.

AI: MC or AR Author role is to be moved to IAO

AI: JF and Role branch to coordinate and improve/remove study specimen, study participant and anything else related in light of the new definitions for specimen and sample.

AI: Biomaterial branch to add class 'test tube'

Wednesday Action Items

AI: AR to think about the definition of sample since he had a couple of minor issues

AI: BS to tinker with wording a little of specimen and sample

AI: JM to do parameter mark up from above

AI: AR, following completion of JM parameter, AR will take this to IAO

AI: AR to organise calls for IAO

AI: AR + MC: MC will send email out to OBO-discuss to let everyone know of URL change and AR will do change

AI: AR measured data item - alan will organise a call to discuss the conflicts between this class, data and so on also add to tracker to IAO

AI: MC will MIREOT mouse

AI: MC or AR Author role is to be moved to IAO

AI: JF and Role branch to coordinate and improve/remove study specimen, study participant and anything else related in light of the new definitions for specimen and sample.

AI: Biomaterial branch to add class 'test tube'


Thursday 11th June

Attending: BP, MC, FG, DD, PL, LS, AR, JZ, SAS, PRS, RS

OBI review: Chris Mungall's review point to huge discrepancy between the logical and textual definitions.

We need to address style inconsistency in the definitions. The issue is how to make the OWL definitions that can be easly stated in natural language.

Logical vs textual definition: the goal of the english defintion is to convery to a reader what the term denotes, and to reflect the logical definition as much as sensible. The goal of the logical definition i to enable data integration. The 2 will not necessarly mach but should never contradict.

Top level defintions have been defined, we only managed to do 'chemical solution', 'organization', 'processed material', 'investigation', 'documenting', 'assay', 'material separation'.

MO and OBI integration: relevant/suitable terms will be moved to OBI; then when a view of OBI is created for the MGED community, then this file will also include additional modules (MO specific).


Actions

Alan: will edit the OBI definition in english

Richard (Foundry Coord): discuss new rules for all ontologies, where definitions are preceded with '=def' for any necessary + sufficient definition

Alan: bring chemical solution discussion to ChEBI

Dirk: to reorganise the wiki pages

Jie: will work with MGED reps in OBI to sort MO terms for inlcusion to OBI