-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Inconsistencies with POSTEAM #33
Comments
This is an example of duplicated g <- fast_scraper('2000_11_OAK_DEN') %>%
clean_pbp()
g %>%
filter(play_id == 2323) %>%
select(posteam, desc, name)
# A tibble: 4 x 3
posteam desc name
<chr> <chr> <chr>
1 LV (8:49) B.Griese pass to E.McCaffrey to OAK 4 for 2 yards (G.Biekert). B.Griese
2 LV (8:06) T.Davis right guard to OAK 5 for -1 yards (R.Coleman, T.Bryant). T.Davis
3 LV (7:29) J.Elam 23 yard field goal is GOOD, Center-M.Lepsis, Holder-T.Rouen. T.Davis
4 LV J.Elam kicks 53 yards from DEN 30 to OAK 17. D.Dunn to OAK 29 for 12 yards (G.Coghill). T.Davis |
I see. Just ran a quick check and it looks like this issue affects 9 plays across 8 games:
Personally I'd typically be inclined to try and keep as much data as possible with exceptions/workarounds but I don't really know how convoluted that becomes for this case. |
Just looking through the data and after running
update_db
and inspecting the whole dataset I discovered that there are three plays withplay_id = 2323
game_id = 2000_11_OAK_DEN
where thename
is listed asT.Davis
(Terrell Davis) but theposteam
isLV
.I'm not sure what the cause of this is, or if it affects other plays/games.
The text was updated successfully, but these errors were encountered: