Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some Footywire matches duplicated using update_footywire_stats() #115

Closed
insightlane opened this issue Mar 28, 2020 · 0 comments
Closed

Some Footywire matches duplicated using update_footywire_stats() #115

insightlane opened this issue Mar 28, 2020 · 0 comments

Comments

@insightlane
Copy link

@insightlane insightlane commented Mar 28, 2020

Some records in update_footywire_stats() are duplicated, it seems to be related to 2019 Round 4 and 2019 Round 6:

footywire_data <- update_footywire_stats()

footywire_data %>%
  ungroup() %>%
  group_by(Date, Season, Round, Team, Player) %>%
  summarise(count_rows = n()) %>%
  arrange(-count_rows)

# A tibble: 792 x 6
# Groups:   Date, Season, Round, Team [36]
   Date       Season Round   Team      Player             count_rows
   <date>      <dbl> <chr>   <chr>     <chr>                   <int>
 1 2019-04-11   2019 Round 4 Melbourne Angus Brayshaw              2
 2 2019-04-11   2019 Round 4 Melbourne Bayley Fritsch              2
 3 2019-04-11   2019 Round 4 Melbourne Billy Stretch               2
 4 2019-04-11   2019 Round 4 Melbourne Braydon Preuss              2
 5 2019-04-11   2019 Round 4 Melbourne Charlie Spargo              2
 6 2019-04-11   2019 Round 4 Melbourne Christian Petracca          2
 7 2019-04-11   2019 Round 4 Melbourne Christian Salem             2
 8 2019-04-11   2019 Round 4 Melbourne Clayton Oliver              2
 9 2019-04-11   2019 Round 4 Melbourne Corey Wagner                2
10 2019-04-11   2019 Round 4 Melbourne Jack Viney                  2
# ... with 782 more rows

footywire_data %>%
  ungroup() %>%
  group_by(Season, Round) %>%
  summarise(count_rows = n()) %>%
  arrange(-count_rows) 

# A tibble: 281 x 3
# Groups:   Season [11]
   Season Round    count_rows
    <dbl> <chr>         <int>
 1   2019 Round 4         792
 2   2019 Round 6         792
 3   2012 Round 1         396
 4   2012 Round 10        396
 5   2012 Round 14        396
 6   2012 Round 15        396
 7   2012 Round 16        396
 8   2012 Round 17        396
 9   2012 Round 18        396
10   2012 Round 19        396
# ... with 271 more rows
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
1 participant
You can’t perform that action at this time.