Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Testing DQ checks: miscellany - Dates #104

Open
Mesibov opened this issue Dec 13, 2015 · 0 comments
Open

Testing DQ checks: miscellany - Dates #104

Mesibov opened this issue Dec 13, 2015 · 0 comments

Comments

@Mesibov
Copy link

Mesibov commented Dec 13, 2015

Tested on 457075 beetle records downloaded 2 December 2015. It's difficult to test these with the download, which is missing a field for date supplied. Even so, some problems are evident.

Invalid collection date ('Unparseable date supplied from the original database')

444822 false
12253 true

Incomplete collection date ('The date supplied was incomplete, missing the day and/or month component')

413203 false
43872 true

There are 4 possible cases:

(1) invalid FALSE, incomplete FALSE
(2) invalid FALSE, incomplete TRUE
(3) invalid TRUE, incomplete FALSE
(4) invalid TRUE, incomplete TRUE

(1) These records should have an 'Event Date - parsed', unless no date details at all were supplied. There are, however, 88 false/false records with 'Year - parsed' and 'Month - parsed' fields occupied, but no 'Event Date - parsed'. Examining just one of them:

http:https://biocache.ala.org.au/occurrences/823f1736-73d9-4122-97bf-768b53331bb5

shows that the original record has the Event Date 1976-12-13, but the record has been processed as

Collecting Date
[date not supplied] Year: 1976, Month: 12, Day:
Supplied date "1976-12-13"

(2), (3), (4) None of these records should have an 'Event Date - parsed', but 3 do. Each has a mistaken 'Year - parsed' entry:

RecordID Year - parsed Month - parsed Event Date - parsed Incomplete Invalid
bbf7b7c0-b65f-4386-b1b6-530187359667 1279 01 1279-01-01 false true
a480ce45-ca53-4657-8c5d-1b44280ca586 0196 01 0196-01-01 false true
99b7100a-bcca-421f-9e7b-a3a2f6b7c5ee 0905 10 0905-10-01 false true

And this Queensland Museum record has an 'Event Date - parsed' of 1668-02-07:

http:https://biocache.ala.org.au/occurrences/97cc70f0-86c1-4833-a992-79d6250159a4

It was flagged 'false' under 'Invalid collection date'. QM says it's a mistake for '1968'.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant