Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GORULE_0000001 : extension errors cause lines to be filtered - we'd like to change that behavior #2334

Closed
hattrill opened this issue May 23, 2024 · 11 comments
Assignees

Comments

@hattrill
Copy link
Contributor

hattrill commented May 23, 2024

When there is an issue with an extension e.g. like I have caused by using the ID instead of the relation name in this annotations:

ERROR - Syntax error in annotation extension field:extensions should be relation(curie) and relation should have corresponding URI: FB FBgn0034841 atlas is_active_in GO:0005634 FB:FBrf0251275|PMID:34478447 IDA C atlas CG13541|atl protein taxon:7227 20211012 FlyBase BFO:0000050(CL:0000019)

Could you "fix it" by stripping out the extension rather than dropping the whole annotation, as that bit of info is still valid (whilst still flagging it up)

(ps I will fix this particular issue for future releases)

@pgaudet
Copy link
Contributor

pgaudet commented Jun 11, 2024

My inclination would be to make this a WARNING rather than a FILTER, and leave the extensions as-is.
Other option would be as @hattrill suggests, to filter out the information in the extensions (but then we may end up with redundant annotations. Not sure that's a problematic issue, but it would be confusing.)

@pgaudet pgaudet changed the title GO_rule: fix when issue in extension by dropping extension GORULE_0000001 : extension errors cause lines to be filtered - we'd like to change that behavior Jun 11, 2024
@mugitty
Copy link
Contributor

mugitty commented Jun 11, 2024

@pgaudet and @kltm,
I looked at the code and we cannot leave the extension as-is. I could remove the extension, output a warning and not filter the annotation. However, this may lead to redundant annotations as @pgaudet suggested.

@kltm
Copy link
Member

kltm commented Jun 11, 2024

@mugitty Just to clarify, what is the destructive part of the code that doesn't allow the original to pass through?

@mugitty
Copy link
Contributor

mugitty commented Jun 11, 2024

@kltm, the extension string is parsed into a conjunctive set

@mugitty
Copy link
Contributor

mugitty commented Jun 24, 2024

@kltm, do you want me to implement a temporary fix to remove the extension if it is invalid? We can create a ticket to add the check once @hattrill updates the extensions

@kltm
Copy link
Member

kltm commented Jun 24, 2024

@mugitty I think this may be a @pgaudet question. Above (#2334 (comment)), there is a desire to leave this as a WARNING and pass it through. I recall, however, that that was not possible due to reconstructing the annotation from the internal model?

@mugitty
Copy link
Contributor

mugitty commented Jun 24, 2024

@kltm, you are right. We cannot keep the 'problem extension' and flag as a warning. We will have to remove the problem extension and flag as a warning. This may result in redundant annotations as @pgaudet suggested.

@kltm
Copy link
Member

kltm commented Jun 24, 2024

@mugitty I think we need @pgaudet here: this is policy rather than mechanism. I would point out that the question of whether or not there would actually be redundant annotations created by striping is (as far as I know) a possibility, rather than a certainty.
If we were wanting to make more changes, a literal copy of the initial string in the column could be kept and used if this condition was hit. That said, I'm not sure that this is a priority, as there is a fix coming from upstream anyways and this was purely meant to bridge the upstream data error.

@hattrill
Copy link
Contributor Author

Would be nice to have a fix here - I think that the number of redundancies from stripping out bad extensions would probably be very small and won't be a problem for enrichment, etc. Where as the absence of annotations is potentially problematic for us e.g. we can't use them in Noctua.

We should have a new release out today so the GAF will be updated on our ftpsite. However, I fear this will take a long time to filter through as there has just been a GO release - unless there is a way to 'inject' the new set indep. of release.

@kltm
Copy link
Member

kltm commented Jun 25, 2024

@hattrill Do you pick up the GAFs for further use at your end? Otherwise, as we generally direct people to the "releases", whether it's fixed in your data or patched over in software, we'd still be looking at the next release for things to get to downloads and AmiGO.

@hattrill
Copy link
Contributor Author

Hi @kltm we don't pick up our GAFs from GO. It would have just been good for the fix to have happened automatically rather than having to wait for the next GO release for GO to pick up our fixed GAF.

Other fixes happen automatically and there is no check for potential redundancy issues there.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Development

No branches or pull requests

4 participants