EDIT: As of 3:25pm Pacific Time, the issues still have not been fixed. It’s not just affecting yesterday’s box scores, but every box score for this season. Looks like we aren’t going to get anything today, but I will keep checking. What worries me is that these issues will persist into the weekend or longer. I won’t have the time to write a backup scraper until Wednesday at the earliest and its a project that could take a day or two. I will be out of town from March 8-12, so if this isn’t fixed, there is a good chance my NCAAB model will be going offline until the NCAA Tournament.
The website I scrape my NCAA Basketball box scores from is glitching out like crazy this morning. I am not sure when or if it will be fixed. The problem is I can not obtain any box scores from yesterday. Some of you may think that’s not a big deal but I do not like to run my model against stale data, and even missing a single day of box scores will make the data I run stale against what the lines say. I can go without a box score or two, but missing an entire’s day worth means it is better off not to run anything at all than run it against stale data which can mess up the keys, machine learning algorithms and other objects I would want to trend against.
I am definitely looking into paid data for next season to ensure this doesn’t happen. This is the 2nd time this season it has happened, the first time it was corrected by 9:30am Pacific Time, but we are nearing that mark as I write this and no sign yet.