-
Notifications
You must be signed in to change notification settings - Fork 89
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement GPAD/GPI cross check so that bad identifiers are eliminated early and do not make it to output #2066
Comments
This is a preventative measure; cleaning bad incoming data is here: #2061 |
Note that we will initially be trying to have this after the 3.10 requirement in ontobio, which we are aiming for the end of the week. If we cannot make that, we'll try some kind of "backport" or other hack to get this done sooner rather than later. |
@mugitty Here are two example
We can work backwards to find the Noctua GPAD lines for these and debug how the annotations are coming through the |
Now testing on |
@mugitty / @dustine32 This looks to be complete now?
Can you confirm that this is reporting as desired? If so, let's go ahead and close this. |
@dustine32 @mugitty Sorry, to bring this up again, but are these being "silently" dropped when emitted, or are these being reported as a GORULE violation somewhere (I was unable to see anything in noctua_zfin.gaf.gz). I tried to find the ontobio code, but couldn't track it back quickly. |
@kltm You know, looking back through the commits in ontobio since we made this ticket, I don't think we ever implemented an explicit "if not in GPI However, I don't see those |
Should we move this to the go-rules project ? |
@pgaudet If is a current ongoing data issue that needs to be solved before release, it would seem to fit here better than the rolling GORULES, from my POV. If it's a non-blocking issue, this can switch over to GORULES. |
sigh I think I now know why we don't see those
Some suggestions to fix:
@kltm More details can be figured out later but for now my brain is broke. |
@dustine32 Let's make sure your brain is cared for and regroup after to figure out the best course. |
When a GPAD and its associated GPI are parsed, a check should occur (GORULE:0000001) and if a matching identifier is not found the annotation is eliminated.
Currently, when a GAF is emitted and there is not a matching identifier from the GPI file in the GPAD, it is passed through but with
taxon:0
.Under no circumstances should taxon:0 be emitted by the ontobio code.
The text was updated successfully, but these errors were encountered: