Workshop OBI Vancouver 2008 Jan 30 notes
Wednesday January 30 Notes
- AI:move hypothesis, dt role, stat factor from role, add under new class specification suggest a new role - specifies? Plan/PA branch
- AI:processes can't bear roles, objective is used instead for e.g. negative control - this has been added to the action item wiki
- AI:BS will send out some notes on logical definitions for instance level relations to relations branch to prevent relation explosion.
- AI:communities to send terms to instrument branch to improve coverage. BB will send an OWL file for BIRN microscopy and imaging
- AI:Edit:suggested edit new class role assignment with group assignment underneath it if PA can be distinct from a Planned Process
- AI:Edit:Defintion of PA defined as being carried out to modify or generate data or materials in an investigation
- AI:discuss within Instrument branch consequences for making artefact object a defined class only to avoid problems where there are biomaterials that are also human generated.
OR can we could keep artefact object and add classes underneath it use this strategy only for biomaterials, as we think all instruments will be artefact objects. Would be weird to have instrument next to protein. MC will communicate to instrument branch. Biomaterial branch are going with the defined class and will use reasoner to place terms in h'archy e.g. under artifact object and some other parent
- AI:use roles for e.g. buffer in (bio)material rather than hierarchy - BioMaterial Branch
- AI: proposal how to define material: material is union of BFO object, BFP object aggregate, BFO fiat object part. Is now the root for biomaterial and artifact which are defined classes and will help OBI be multi granular
- AI:population now has material as a parent and not object aggregate as material is now a union of BFO things
- AI:biomaterial will attempt define these in vivo etc as a defined class and see if this works, these may be at assay level not material level, and these will likely be processes
- AI:DT can do the gating examples as a defined class into the ontology - branch action item
AI:Alan to write up an OBI import/mapping strategy doc as described at the workshop
AI:AR/JM/MC will work through a use case with the cell type ontology import needed for biomaterial and test this strategy and report on the results. This will be an OBI application ontology as soon as we start bringing into OBI
AI:develop a strategy for dealing with the OBO foundry ontologies and OBI application ontology if import works
AI:Add imports and all related issues as an agenda item for the next workshop SS
Barry Smith Role, Relation
- Time - OBI should deal with only biomedical investigations. Numbers, Time are outside OBI
- Information Ontology will be worked on by Alan and Barry
- Roles
* if there is a red curtain in the theatre, the redness is not a role in this context * if a grid is split into 1km squares, km is not a role * need a word for reln between parameter, variables, and the investigation that uses these in PA - BS suggests SETTING as these are not roles like patient role
MC:you say that parameter, hypothesis, objective will live in Protocol. If you modify param during PA, is that a new PA?
BS:No, PA are changeable. If we set voltage outside param we need to ammend the protocol, maybe create a new one
BP:We always said that the we are looking at the end of the PA, and modify Protocol.
BS:In a protocol have a spec of a parameter, realize info entity, gets realized in a 2 stage process, when experimenter adjusts machine, and then when the machine runs with that setting.
BS:also state what will be measuring in the protocol
JF:what about the drug dosage
BS:same as a protocol setting
HP:what about the variables relating to the materials used in the protocol
PRS:this is still part of the protocol, similar to the inclusion criteria for a study. I think is part of the protocols
BS:may be part of protocols that are applied, not nec. specified in the chain.
HP:was thinking more about relation to experimental design, and how that is represented
JW:for me it's characterizing state of the input and the output
BS:agree, but inputs have choice
JW:where there are variable where these are not factors need to record these as well
RS:Think this deals with a lot of issues, analyte, factor, you are talking about kinds of factor now - settings, variables, when you realize these they become qualities of the instrument, the evaulant etc
AR:can be a process, can be a disposition
BB:includes serial no of the device, lot no of the reagent
BP:how implement? we have protocol, we need specification, and a relationship
BS:we need a partonomic analysis of protocols, some will be optional some will be necessary
AR:was hoping that we could use relations rather than rebuilt the h.archy
BS:if we make is specification, makes it easier as is realizable info entity
BP:this would be parallel to how we use role, tree for specifications
RS:now we need to deal with PA and investiagtion. Variables are spec at the level of investigation, as at the PA level dealing with analytes.
BS:this is an old issue. I think simple soln is that protocol doesn't specify all in planned processes, if you add all together gets specified.
BP:plan has an objective and specifies instructions.
BS:someone needs to create an anatomy of a protocol so we know which bits are optional
JW:where does this deal with the characterization of input and output
BP:new relationship that allows us to talk about specifications that do not come to reality
PRS:do we just need a relation specifies
AI:move hypothesis, dt role, stat factor from role, add under new class specification suggest a new role - specifies? Plan/PA branch
RS:we specify that we need to use organ liver - so we'll have an organ specification and will need to point to other ontologies
BS:yes, and for data format that would be also specified here.
RS:and that's where you point to other ontologies
BS:dealt with all cases where processes are bearing roles now, and resolved that
RS/BP:Sham surgery has a role
AR:bees dancing
BS:continuants have optional qualities, can lose parts, occurents have all their qualities and parts, can't lose or gain. If a process goes faster it's not acquired a new feature.
BP:bees dancing, communication is a planned process, objective is to send a message. Mock surgery is a planned process where the objective is to have a control
BP:This is in objective, we tie a process and the why in objective
BS:the process is not taking a role
AI:processes can't bear roles, objective is used instead for e.g. negative control - this has been added to the action item wiki
PRS:if objective is realizable entity then is a cardinal part of a protocol,
BS:as a specification
BP:all protocols have an objective.
BS:plans have constant and variable parts. Some things are left open. True of all parts and the whole plan. Objective will have some parts that are specific and some that are left open. The fact that plans are structured with constant and variable, we might specify coarse and fine objective e.g. cure for cancer, vs test a response to a drug in mouse liver. When we say objective, this is an info entity, the mult descriptions are possible.
PRS:want to be 100 sure. Objective is a planned process?
BS:objectives exist in plan and the realized objective. Need label these.
AR:objective, and achievement, or outcome?
BS:planned objective and realized objective?
PRS:sailing a boat, goal is to sail to India, I find USA. I don't realize the objective - what have I done
BP:you revise the planned objective
AR:Plan and Protocol now merged in to a single file
Roles:
* RO has class and instance level relationships. * OBI needs instance and instance_type relationships * uniquely_identified is needed for the serial number instrument relationship * issue with this relationship and URI as used by the semantic web needs to be resolved * some instance level relations are primitive, some can be defined, not addressed in RO paper * OBI needs to define it's instance level relns in terms of simpler instance level relns and this has not been done in logical statements. * don't use names of related things in the relation name AR:thinks this is a consistency check and useful
AI:BS will send out some notes on logical definitions for instance level relations to relations branch to prevent relation explosion.
Melanie Courtot (for Daniel Schober), Instrument Branch
- OWL file up to date
- terms submitted are included
- finishing meta data clean up
- minor issues
- poor coverage, mostly flow cytometry
- issue with manufacturers and how relate to branch content
- expecting NMR, chromatography, NMR not in first draft
- current content=first working release
- need to discuss how import OWL files and what are the constraints
- consensus will be granular only to support specific use cases
- role and function confusion resolved y'day will be summarised
- labware -
- issues with how to represent by function
- how restrict use to instance level relations
- new relations requested
- commercial terms - where store?
- settings - now part of specification/PA
AI:communities to send terms to instrument branch to improve coverage. BB will send an OWL file for BIRN microscopy and imaging
BB:for BIRN we have microscopy and some imaging, we have all the meta data. If I submit as OWL would that be useful?
RB:We may not need to model in obi unless there is a query use case
RS:how would deal with a ruler? Or a pipette - a container
PRS:re consumables, another distinction, how deal with microarrays - used 1x. Is that an instrument or a reagent, where reagent is a role
MC:was thinking of using aggregate object so that composite material/instrument plays a role. Have role instrument, reagent etc
PRS:Instrument is a material with a role of instrument -
MC:under discussion also for biomaterial
AI:need a relation to connect PA to instrument and function and to work through how to represent that. LF, MC, BP
BB: how connect instrument to parts?
MC:using defined classes
BB:issues with part vs whole functions and inference
RS:parts of instruments can be parts, where is the line between instrument of part and instrument
MC:solved with functional parts, likely using these. Laser can be used as laser or as a part of.
RS:what part of an instrument has no function -
RB:the parts that no function in the experiment will not be modeled
Bjoern Peters, Protocol Application Branch
- specification etc needs adding from today's discussion
- worked on material transformation a lot (was biomaterial transformation)
- focus on text definitions, not much on OWL definitions as we need role and plan and they weren't ready
- meta data is good
- problem, induction of natural processes e.g. methylation, how acheived. objective is clear
- need a link between natural process to some other obo process
- have a conflict of labels for some classes
- conflict of labelling for e.g. cloning, overlapping terms in different ways - simply container classes now for non orthogonal terms
- Invitrogen schema, most is in PA, and needed to OBI-ize it, was a great schema took some resolution for PA
- material transformation complete
- assay incomplete
- substance detection - uses analyte - now problematic, here we'll have lots of things where can detect any quality, not exhaustive now. Need to add 'as the goal of'
- propose high level grouping of assays e.g. measurement of material quality, detection of label, etc. Issues with how separate DT from these assays, there is a dt but reporting on the process is natural, reporting as a dt is not natural. Proposal to group by what is the ultimate data output. Not clear yet if will work for all assays.
- planned processes need addressing : enrollment is needed, is not a data transformation, or a material transformation, we need the term, but would like to use planned process. We have things that are 'branchless' - should all go under planned processes. We could add more things here. Objective and specification will help
- group assignment and enrollment don't belong under the PA header. BS suggests remove PA as a subclass and move up h'archy. these are planned processes in which people are assigned roles.
- material transformation in PA or biomaterial transformation ? BP:sample or reagent transformations looks good
RS:when was biomaterial->material transformation.
BP:was a branch decision, Frank insisted
RS:do we need to link to a natural process? I can see why want to say methylation, but the natural process
BP:here because the labels conflict with processes outside obi like reverse transcription
RS:would call it generating making a molecule in a specific way. Suggest use as a chemical process
BB:and point to reverse transcribe
RS:you don't do DNA repair in a PA, you modify it. Not happy
BP:Ok these are as submitted
HP:labels are just labels, useful part in the definition
RS:induction of a natural process bad
BP:is a placeholder class, and we need to fix things here
BS:what would we substitute?
BP:back into higher level class, just a problem label
BS:deliberate methylation perhaps? or planned methylation
RS:lots of things mimicking natural process that are not under induction of natural process
BP:no GO term cell co culturing - maybe I need to change the label
RS:concern re h'archy
BP:there is no h'archy, we couldn't get to one, we focussed on defining things and later work on a h'archy, there is a bit of h'archy.
RS:if use an enzyme as a reagent you can talk about function, resist linking to processes
HP:suggest point at EC numbers rather than point at a GO process to avoid confusion
BS:worried about cloning, we have transformation_of in RO. Where have a thing that generates another thing, this is not transformation, is merging, or fusing
BP:we need to define were start with one or more, and end up with an output where multiple things might have happened
BS:cloning is not just material transformation as give rise to an organism
RS:merging and splitting are in material transformation
BS:then is simply a label issues between the class name and the RO name. That's OK but need to be aware of it and make the definition explicit
RS:many processes are composite processes and that's why things are hard
HP:but data is acquired that way. maybe need to use defined classes for common composite processes maybe even in a given domain
RS:care when say label - you don't detect the label, detect the photons, LIMS may care
BP:they can report specifically if needed. can use nesting for intermediate values e.g. measurements that contribute to the outcome
RB:we want OBI released, how far are you from functional PA
BP:in a better stage, is useable all the way for 2 use cases
RB:in next step for h'archy how long
BP:we have added general terms, and very specific terms for 2 assay. In next 4 weeks proof of prinicple for other use cases
RS:we have terms can use today. When you add to a h'archy will that affect us, we will still be OK. Things will just move
BP:h'archy is defined by what say, define an assay for e.g.
RS:I want PATO to define high level qualities does it do that?
BP:is unintuitive for me
RS:rate could be a general quality of a process
BP:h'achy could change if assay doesn't work in some cases
HP:seems to me that this will work, I can use assay as it stands I think
BP:text def says that there are mult steps in the assay
PRS:should that be defined here, or in the protocol?
BP:this is in the spirit of realizing a protocol, if you want to talk about steps, you can say all the parts, e.g. labelling is a material transformation
PRS:we have not finished PA and Protocol relation, BS told us that we need to determine the parts. I think we need to do that here
BP:The definition here is more generic, in terms of analyte, would change to another representation.
RS:where we talk about composite PA, even if there are sub parts material transformation and data transformation
BP:yes.All have some sort of dt, summing etc is a dt
PRS:perform a transcription assay, gene expression, I declare a protocol purpose is to detect mRNA<, material is a microarray, role consumable, utilizes some instruments.
BP:you just care about the assay? We don't have transcription assay yet. We need to add that Could define some assays.
RS:quality of rna in an evaluant, then that's a general class in OBI. When you do your application you are more precise about consumables and instruments when you instantiate
LF:NMR assay definition You are grouing assays on instrument
BP:not yet done, yes. We can classify as alt. h'archy. We need one assay h'archy and we went this way
RS:h'archy is based on types of measurements are made, ie the evaluant
BP:I would propose that, but you can define in other ways e.g. instrument manufacturers might want that
RS:I want to group all assays that measure virulence or function of a T cell
BP:also for our use case.
JF: do we need specific protocols to assign to a role, or a generic role assigment one?
AR:need both.
HP:could make a generic parent and then assign existing children
AR:relevant also to data processing, e.g. randomization of the data
AI:Edit:suggested edit new class role assignment with group assignment underneath it if PA can be distinct from a Planned Process
AI:Edit:Defintion of PA defined as being carried out to modify or generate data or materials in an investigation
AR:moves it up to a higher level, and that doesn't help
BP:I would like to move acquisition, up to planned process - things that come from outside a protocol application
HP:these are things that are not created by your protocol applications in your investigation, This is clear and I agree on planned process placement. Data collection is a bad name, confusing with the label with other parts of dt
Biomaterial Branch - Melanie Courtot
- OWL file is up to date
- term clean up in progress
- issue with def of biomaterial - new def added, material is BFO object
- BS suggest have material - as need biomaterial which has non bio parts
- need to allow to disambiguate biomaterials, artefact objects and instruments - use role and have biomaterial as a defined class
- action item for MC to take to instrument branch
- genetic info content is a problem - suggestion to send to sequence ontology
- issues with importing/mapping from outside ontologies
- what is a buffer, AR :a material with a role
- what is a mixture. we think population is aggregate
- artifact and natural AR:could use roles BP: also need for organism derived, or is derived from a planned process - mating can be a planned process for e.g. We will use artefact object instead. Need artfact object as a defined class for this to work, but we like the solution
- in vivo, in vitro, ex vivo etc BP:only true for cells, tissues in general things inside or outside the body hard for proteins. These are qualities of some biomaterials and need to restrict how is applied.
HP:do we lose artefact object?
AI:discuss within Instrument branch consequences for making artefact object a defined class only to avoid problems where there are biomaterials that are also human generated.
OR can we could keep artefact object and add classes underneath it use this strategy only for biomaterials, as we think all instruments will be artefact objects. Would be weird to have instrument next to protein. MC will communicate to instrument branch. Biomaterial branch are going with the defined class and will use reasoner to place terms in h'archy e.g. under artifact object and some other parent
AR:workaround, have an owl file, user interface promises that has the subclass axiom, so that these can be seen during development, but not for distribution.
AR:suggest that we only add mixtures that we care about, and not add general concepts
AI:use roles for e.g. buffer in (bio)material rather than hierarchy - BioMaterial Branch
AI: proposal how to define material: material is union of BFO object, BFP object aggregate, BFO fiat object part. Is now the root for biomaterial and artifact which are defined classes and will help OBI be multi granular
AI:population now has material as a parent and not object aggregate as material is now a union of BFO things
BP:e.g. manufactured protein, have proteins, these are materials determined by content, manufactured ones use the reasoner - classified under material and artifact object
BB:seems obvious that in vivo/in vitro etc need to be defined classes based on some properties
AI:biomaterial will attempt define these in vivo etc as a defined class and see if this works, these may be at assay level not material level, and these will likely be processes
BP:maybe attached to objects or processes as well as biomaterials.
RB:seems like we can test this
BS:these are locations of processes
BP:you'd define these as the environment of the cell or tissue then
AR:in vivo is to do with the participant
BP:cell in a liver measuring some metabolite, if it's ex vivo this experiment was ex vivo
HP:does this belong on assay, in vivo, ex vivo, etc.
BP:nothing else has used environment
James Malone - Data Transformation
- Cambridge Workshop summary, see wiki
- Done a branch review to look at complete terms corrected curation status
- 139, 51 complete - and h'archy OK
- we have some cases where we took a sensible view on suffixes
- chris S will do our external branch review
- use case docs next
- had to use roles to avoid multiple inheritance, will now use objective
- we used has_input and has_output - there are no logical definitions yet. these are future work
- people think we need to connection to algorithm and logical definitions of input and output
BB: is what's here partly the outcome of a microarray meeting
AR:dye swap merge is not mathematical, could logically define in terms of the data that is information entity
RS:this is merging replicates and is placed
BS:moving average -> moving average transformation or calculation
RS:still want to do gating
BP:there is a specific dt in e.g chromium release assay. for me natural definition would be in terms of the assays.
RS:we will run into the same issue with data transformations, and we can say what the inputs and outputs are without assay
AR:we can write owl statements for e.g. some of the terms. Some of these are composite, we can catch the commonalities.
BP:background correction is a process - and there is also an objective -
RS:issues with roles so we didn't do anything with some terms, and now the h'archy is flatter - we can go back to more h'archy
PRS:there are still some that are compound terms that include method and purpose e.g. false discovery rate. e.g. Box Cox is refers to the method from Box and Cox, most point to a function and get a base and need a parameter to get a function. If we go down the line of fixing qualification and a method.
RS:what is the difference? SOme things are well defined by the function and others are more complex. Things that were small and well described are based on function, in other cases we refer to a function
PRS:then we need to resolved the case where I need to an FFF and Tukey Bi weight correction, I need to find a property that I can set.
HP:there is no connection yet to algorithms anyway
BP:if you describe all steps along the way - people care about the outcome.
PRS:we need to provide the means to capture which algorithm, you just need a way to specify things.
BP:we just need to deal with that somehow
AR:inputs and outputs needs to be dealt with, or the objectives
RS:we can describe the output but not the objective
RS:runs through the gating example and dimensionality reduction
BP:this is not about matrices, this is about vectors, and projection of vectors
RS:OK
MC:move gate as a child of partitioning method
AR:no is a defined class in context
RS:we need a generic thing though like a vector partition
AR:you'll try to factor the neutral stuff, and make composite logical definitions
HP:if we have convincing use cases
BP:should avoid matrix terms.
RB:data were cells and vectors, and gating is a complex thing that uses terms everywhere
BB:if this terms to be practical send the maths things to Ivo and they might get on with it
AR:not the next thing that you do
RB:you can test gating context if you can define in terms of what is the input data
BP:some maths things will not have an objective, some that are tied to assays can be tied to an objective. for sine etc there should not be an objective, should be handled elsewhere. for an expt purpose then we should consider that
AI:DT can do the gating examples as a defined class into the ontology - branch action item
Environment Ontology Discussion - Susanna Sansone, Barry Smith
- GAZ - open source gazeteer, paris etc
- ontology - mountain, cave, bedding, all place types where organisms can live, should include the gut
- gaz - country names all place names that were ever used e.g. parts of countries that no longer exist, envo - reef
- Envo doesn't do natural qualities - PATO does that
- 1110 terms, in use by several organizations already
Ontology Imports - Alan Ruttenberg
- Alan outlines an import strategy
AI:Alan to write up an OBI import strategy doc as described at the workshop
JM:How does this look? Sep file
AR:yes, one external, one external derived, run a script to get parts of the info
JM:can see how could work, how it scales. Concern if we refer to a lot of other ontologies?
AR:could create externals by script if scaling issues
CK:could do with Alan Rector's methodology
AR:for some additional info e.g. pref term, may need to translate and will need to use the derived file fr that
HP:what's the next step - create an external file.
AI:AR/MC/JM will do a test with CTO into BioMaterial
CK:why do we need people curating it? Could do on fly
AR:not all URIs tell where a term is due to issues. Need to say is an instance of owl thing or a class. or may need to define domains and ranges so we might want to make assertions in OBI.
SS:we will create an owl file in all cases? We need to create cross product ontology and we can use OLS to do that. We would like OLS to serve this.
AR:this is a cross product ontology
SS:this we do when we need it. OBI will be pure, when you want for your application you import?
AR:no we will get things and they will live in OBI, and we will release that
BS:another use for importing, OBI to some community - you know they need some chem terms, you can do that
AR:yes you can package any way you want
SS:how will you have this packaged up for all
AR:there will be one file called externals and people can edit themselves
HP:I think we can build a default
RS:I thought a community would build their own, but that's not part of OBI
AR:what need, will be more critical what they get from an ontology. This is just a way to get external terms
RS:when we need an external term in OBI wasn't designed to build cross products
HP:concern about branding this into an OBI when we get all the other ontologies and saying this is OBI
AR:we could get a slim
RS:we should be getting a slim for an application ontology
SS:this is an obo foundry process
HP:this is an application ontology and we use it as a test, this is not OBI,
BB:like this, politic way as we can decide what we support. If we have obi and obi application, what would OBI be without these.

