-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parquet output format #29
Comments
Being able to output as parquet would be nice too--even if it's called twarc-csv :-) |
Yeah I'm actually considering a different command as an alias, just for it to make semantic sense / good docs, so these would be the same:
But not sure how useful that is. It'll purely be an alias for a docs entry and for the command line. |
I was going to say that pandas has many output formats. It might not be hard to add parquet, pickle, hdf, sql, excel, json, html, feather, latex, stata, gbq, markdown, ... :-) but like you said, figuring out the api is the hard part. |
Yeah - still figuring out that part! |
Still haven't figured this out, but for now, you can use Maybe an alias?
or
or
|
Instead of CSVs, append the parsed dataframes to parquet https://stackoverflow.com/a/47839247/11090908
The text was updated successfully, but these errors were encountered: