Bug when preparing data for finetuning #122

marcopeix · 2024-09-11T17:57:06Z

I've run into a bug that I can't fix when trying to prepare a dataset for finetuning.

Here's the code:

def data_generator() -> Generator[dict[str, Any]]:
    yield {
        "target": df['Weekly_Sales'].to_numpy(),
        "start": df.index[0],
        "freq": pd.infer_freq(df.index),
        "item_id": "1",
    }

features = Features(
    dict(
        target=Sequence(Value("float32")),
        start=Value("date32")),
        freq=Value("string"),
        item_id=Value("string"),
    )

hf_dataset = Dataset.from_generator(data_generator, features=features)

hf_dataset.save_to_disk(Path("sales_dataset/"))

df = hf_dataset.to_pandas()

df.to_csv('sales_dataset/sales_data.csv', index=False)

Then, when I run python -m uni2ts.data.builder.simple sales_data sales_dataset/sales_data.csv --offset 40 --dataset_type long , I get the error:

IndexError: index 0 is out of bounds for axis 0 with size 0. Not sure why that happens, as my df is not empty, and the .csv is not empty either.

Here's the CSV I'm using: https://raw.githubusercontent.com/marcopeix/FoundationModelsForTimeSeriesForecasting/main/data/walmart_sales_small.csv

I'm only using data for Store==1 (143 rows of data) and the first three columns only (Store, Date, Weekly_Sales). Prior to running the function, I set the index as the Date column.

What am I missing?

The text was updated successfully, but these errors were encountered:

gorold · 2024-09-30T03:29:49Z

didn't look too deeply into this, but I'm guessing it's due to the format (column names) of your data frame?

uni2ts/src/uni2ts/data/builder/simple.py

Line 58 in 2ba614d

item_df = df.query(f'item_id == "{item_id}"').drop("item_id", axis=1)

chenghaoliu89 · 2024-12-04T07:54:27Z

Hi @marcopeix, have you solved this issue? If so, I will close this issue

marcopeix added the bug Something isn't working label Sep 11, 2024

chenghaoliu89 mentioned this issue Dec 4, 2024

Bug when trying to prepare custom dataset for finetuning #102

Closed

marcopeix closed this as completed Jan 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug when preparing data for finetuning #122

Bug when preparing data for finetuning #122

marcopeix commented Sep 11, 2024

gorold commented Sep 30, 2024

chenghaoliu89 commented Dec 4, 2024

Bug when preparing data for finetuning #122

Bug when preparing data for finetuning #122

Comments

marcopeix commented Sep 11, 2024

gorold commented Sep 30, 2024

chenghaoliu89 commented Dec 4, 2024