Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

support nanosecond precision values in TimeDtype #19

Open
1 of 3 tasks
tswast opened this issue Sep 30, 2021 · 3 comments
Open
1 of 3 tasks

support nanosecond precision values in TimeDtype #19

tswast opened this issue Sep 30, 2021 · 3 comments
Labels
api: bigquery Issues related to the googleapis/python-db-dtypes-pandas API. priority: p3 Desirable enhancement or fix. May not be included in next release. type: feature request ‘Nice-to-have’ improvement, new feature or different behavior or design.

Comments

@tswast
Copy link
Collaborator

tswast commented Sep 30, 2021

datetime.time only supports microsecond precision.

Similarly, the BigQuery TIME data type only returns microsecond precision.

The pandas convention is to use nanosecond precision for timestamps via the pandas.Timestamp class. It may make sense for this extension dtype to also support this.

Subtasks:

  • Preserve nanoseconds when we are given them in the data (e.g. from numpy array or pyarrow array).
  • Add unit as an optional argument in the dtype constructor, similar to https://github.com/pandas-dev/pandas/blob/1c986d6213904fd7d9acc5622dc91d029d3f1218/pandas/core/dtypes/dtypes.py#L777 -- Note: maybe unnecessary since we always store as nanoseconds, but could be useful as a box/unbox hint (see below)
  • Return / accept something other than datetime.time that supports nanoseconds instead of only microsecond precision. This is similar to datetime.datetime versus pandas.Timestamp.
@tswast tswast added type: feature request ‘Nice-to-have’ improvement, new feature or different behavior or design. priority: p3 Desirable enhancement or fix. May not be included in next release. labels Sep 30, 2021
@product-auto-label product-auto-label bot added the api: bigquery Issues related to the googleapis/python-db-dtypes-pandas API. label Sep 30, 2021
@jbrockmendel
Copy link

Could wrap TimedeltaArray and disallow anything not between 0 and 24H

@tswast
Copy link
Collaborator Author

tswast commented Nov 4, 2021

#39 makes some tiny progress towards this. If going from arrow -> pandas -> arrow, nanosecond-precision times should be preserved.

@tswast
Copy link
Collaborator Author

tswast commented Dec 3, 2021

Could wrap TimedeltaArray and disallow anything not between 0 and 24H

That's more-or-less what we're doing already. We store the right precision data, but when folks access individual rows, they get back a datetime.time which doesn't have a nanosecond field. Need something similar to pandas.Timestamp.

@tswast tswast changed the title support nanosecond precision values support nanosecond precision values in TimeDtype Nov 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
api: bigquery Issues related to the googleapis/python-db-dtypes-pandas API. priority: p3 Desirable enhancement or fix. May not be included in next release. type: feature request ‘Nice-to-have’ improvement, new feature or different behavior or design.
Projects
None yet
Development

No branches or pull requests

2 participants