Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to get several datasets #3229

Closed
wwdda opened this issue Oct 24, 2024 · 1 comment · Fixed by #3232
Closed

Unable to get several datasets #3229

wwdda opened this issue Oct 24, 2024 · 1 comment · Fixed by #3232
Labels
bug Something isn't working

Comments

@wwdda
Copy link

wwdda commented Oct 24, 2024

Description

Error:"ValueError: Cannot obtain default prediction length from frequency ME." occured when getting 'cif_2016' dataset.
Error:"ValueError: Cannot obtain default prediction length from frequency min." occured when getting 'london_smart_meters_without_missing' dataset and 'wind_farms_minutely_dataset_without_missing_values' dataset.

To Reproduce

from gluonts.dataset.repository import get_dataset, dataset_names
print(f"Available datasets: {dataset_names}")
dataset = get_dataset("cif_2016") # or 'london_smart_meters_without_missing'```

Error message or code output

Screenshot 2024-10-23 at 10 08 17 PM

Environment

  • Python version: 3.10
  • GluonTS version: 0.15.1

Could you please help update the prediction_length_map for 'ME' and 'min' in _tsf_datasets.py?

Thanks!

@wwdda wwdda added the bug Something isn't working label Oct 24, 2024
@wwdda wwdda changed the title Unable to get cif_2016 dataset Unable to get several datasets Oct 24, 2024
@lostella
Copy link
Contributor

lostella commented Nov 5, 2024

@wwdda being fixed in #3232, thanks for submitting the issue!

lostella added a commit to lostella/gluonts that referenced this issue Nov 7, 2024
*Issue #, if available:* fixes awslabs#3229, pandas changes in frequency
strings broke some of our logic.

*Description of changes:* Add missing frequency strings in
_tsf_datasets.py, and get rid of other frequency-related warnings with
other datasets. I tested the change by running the following script:

```python
from gluonts.dataset.repository import get_dataset, dataset_names

skip = [
    "m3_monthly",
    "m3_yearly",
    "m3_quarterly",
    "m3_other",
    "m5",
]

for dataset_name in dataset_names:
    if dataset_name in skip:
        continue
    print(dataset_name)
    dataset = get_dataset(dataset_name, regenerate=True)
```


By submitting this pull request, I confirm that you can use, modify,
copy, and redistribute this contribution, under the terms of your
choice.


**Please tag this pr with at least one of these labels to make our
release process faster:** BREAKING, new feature, bug fix, other change,
dev setup
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants