Skip to content

Conversation

inishchith
Copy link
Member

Changelog

  • to be added

Additional context (e.g. screenshots, logs, links)

  • to be added

Checklist

  • Additional tests added
  • All CI checks passed
  • Relevant documentation updated

Copyleft License Compliance

  • Have you used any code that is subject to a Copyleft license (e.g., GPL, AGPL, LGPL)?
  • If yes, have you modified the code in the context of this project? please share additional details.

description="Number of errors while writing to files",
)
logger.error(f"Error writing pandas dataframe to files: {str(e)}")
raise
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Bug: Missing Attributes in Output Class

The write_dataframe method in the Output base class, now a concrete implementation, attempts to use attributes like chunk_part and metrics, and calls methods such as path_gen, _flush_buffer, and _upload_file. These members are not defined in the Output base class, which causes AttributeErrors when the method executes.

Fix in Cursor Fix in Web

# Get the generated file path and rename to final location
result_dict = result.to_pydict()
generated_file = result_dict["path"][0]
os.rename(generated_file, consolidated_file_path)
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Bug: Consolidation Fails on Empty Daft Output

The consolidation logic in _consolidate_current_folder assumes daft_df.write_parquet always returns a dictionary with a "path" key containing a non-empty list. If Daft's output structure changes or is empty, accessing result_dict["path"][0] could cause a KeyError or IndexError.

Additional Locations (1)

Fix in Cursor Fix in Web

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants