Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create a GO-CAM -> TSV file for end user consumption #2008

Open
1 task
kltm opened this issue May 18, 2023 · 10 comments
Open
1 task

Create a GO-CAM -> TSV file for end user consumption #2008

kltm opened this issue May 18, 2023 · 10 comments
Assignees

Comments

@kltm
Copy link
Member

kltm commented May 18, 2023

Create a GO-CAM -> TSV file for end user consumption.
This could fill the ecological niche of our previous SIF effort.

TBD:

  • is this a one-off, or is this a future pipeline product?

Tagging @dustine32 @balhoff

@kltm
Copy link
Member Author

kltm commented May 18, 2023

Also tagging @pgaudet

@kltm
Copy link
Member Author

kltm commented May 18, 2023

Noting for @balhoff, technically, not having the SPARQL endpoint or store involved (i.e. a file-to-file transformation) would likely be easier to reuse and recreate. (Dealing with the endpoint complicates things and the blazegraph is--or should be--a moving target right now.)

@balhoff
Copy link
Member

balhoff commented May 19, 2023

@cmungall what are the desired columns? Something like this (using noted property path)?

  • subject
  • subject_activity (?mf enabled_by s)
  • subject_location (?cc ^occurs_in/enabled_by s)
  • subject_process (?bp ^part_of/enabled_by s)
  • relation (?s ^enabled_by/?rel/enabled_by ?o)
  • object
  • object_activity
  • object_location
  • object_process
  • model_iri

Taking this model as an example: http:https://noctua.geneontology.org/editor/graph/gomodel:645d887900000758?

For hormone activity enabled by BGLAP, would we only include the brain development process, and not the other two which it's part of?

@kltm kltm added this to In progress in GO-CAM Enrichment May 22, 2023
@kltm
Copy link
Member Author

kltm commented May 22, 2023

@balhoff Curious: would end users be wanting URIs or CURIEs? Guess it depends on the audience?

@balhoff
Copy link
Member

balhoff commented May 22, 2023

@kltm just an information example for now. We can do curies in the end. But I want to check that I'm pulling out the right stuff.

@cmungall
Copy link
Member

For hormone activity enabled by BGLAP, would we only include the brain development process, and not the other two which it's part of?

I think having a pipe separate list for this should be fine (it should always be a tree structure and hence the list can always be interpreted as a chain)

It may turn out that this is overkill and there is not so much information in nested part-ofs. Looking at that model, one of the paths is clearly wrong (unless neurotransmitters can think). But as a first pass, having this be transparent is a great way for us to easily spot check some of these

@balhoff
Copy link
Member

balhoff commented May 23, 2023

@cmungall specifying the order of transitive part-ofs might require a different approach; I'm using SPARQL right now, nice and simple but property paths don't capture that. If we're okay with interpreting as a bag of relevant terms then we can use the property paths.

@balhoff
Copy link
Member

balhoff commented May 24, 2023

With some BP and CC aggregation (unordered):

?gp1 ?mf1 ?cc1s ?bp1s ?relation ?gp2 ?mf2 ?cc2s ?bp2s ?model
UniProtKB:P02818 GO:0005179 GO:0005576 GO:0007420,GO:0050890,GO:0001956 RO:0002413 UniProtKB:Q5T848 GO:0004888 GO:0005886 GO:0007420,GO:0050890,GO:0001956 gomodel:645d887900000758
UniProtKB:P38435 GO:0008488 GO:0017187 RO:0002630 UniProtKB:P02818 GO:0005179 GO:0005576 GO:0007420,GO:0050890,GO:0001956 gomodel:645d887900000758

@balhoff balhoff self-assigned this May 26, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
GO-CAM Enrichment
  
In progress
Development

No branches or pull requests

3 participants