- Added Multiple Reponse Data Sets for SAV files #259
- Fixed pyreadstat not raising error if folder does not exists when writing #269
- Fixed tests for numpy 2 changes # 266
- Readstat sources updated to commit ba4392e9d48c4d997d2737719f4cf6320fb66990 on dev branch
- Fixing warnings in new pandas version 2.2.1 fixes #252
- Remove string encoding for read_por fixes #253
- Fix read in chunks when output is dict, fixes #251
- correcting bug when writing files with character columns with all missing, character length set to 1
- Correcting bug for datetime64[us
- Added modification_time and creation_time to metadata
- added datetime64[us in addition to ns for improvement when writing.
- changes introduced for compatibility with cython 3.0.0. Cython 0.29 no longer supported. Python 2.7 no longer supported.
- added num_rows to multiprocessing to allow processing of xport, por and sav files not having the number of rows in the metadata.
- Readstat source updated to version 1.1.9
- introduced recognition for pandas datatype datetime64[ns, UTC] and other datetime64 types when writing, so that this column type gets correctly written as datetime
- introduced extra_datetime_formats and extra_date_formats arguments for read functions, cleaned the list of sas date, datetime and time formats to exclude those not directly convertible to python objects
- improved performace of writer when there are datetime64 columns
- Fixed #206, #207
- added pyproject.toml
- Adapted setup.py to work with conda forge libraries on windows
- Changed windows compilation to use MSVC 2019. Static compilation for windows, no longer need for dll files.
- Updated Readstat code to Release 1.1.8
- Solved issue with duplicated columns, #188
- Solved issue with no truncation of written files on windows #191
- Solved #183
- new parameter output_type to switch between returning a pandas dataframe or a dict of numpy arrays, partial solution for #184
- Implemented row_compression for writing sav files, solves #185
- Solved #159, #158, #163, #164, #172, #177
- Added new date formats for SAS and SPSS solves #170, #171
- Added check missing_range type against variable type solves #180
- Removed np.float, solves #160
- Updated Readstat source code to Release 1.1.8 candidate 0
- Included new pandas datatypes Float32Dtype and Float64Dtype when writing. Soves #150.
- Forcing cythonization when installing from source solves #151.
- Updated Readstat source to version 1.1.17
- Improved error when reading dates and finding a string (#145)
- repaired not reading correctly metadata with multiprocessing (#146)
- Using os.fsencode when reading file paths to support file system encoding and Path-like objects (#123)
- Updated Readstat source to version 1.1.6, fixes #117
- Accepting dict as the column_label parameter for writing functions.
- added meta.readstat_variable_types to track the type of binary variable in the original file.
- Adding better error message for not accepting certain files types in multiprocessing, solves #105
- Fixing #106 writing failing when row 0 is removed from dataframe.
- Removing numpy deprecated objects solves #107.
- Adding support for pathlib.Path, soves #110.
- fixing order of categories
- Added formats_as_ordered_category to get an ordered category.
- Added value_formats in order to be able to set the variable format when writing.
- Updated Readstat to version 1.1.5, this fixes: reading sas7bdat file labels, reading newer por files date-like columns, and few others.
- Corrected writing columns with numpy.float32 format.
- fixed sharing permissions when reading files on windows, fixes #87
- read_file_in_chunks can read chunks with multiprocessing.
- Improved performance specially for big files.
- added a method to read files in parallel
- added license information to setup.py
- Updated default widths for DATE and DATETIME formats (from Readstat src). That makes the files readable both in SPSS and PSPP, solves issue #69.
- added support for binary compressed sas7bdat files
- fixes writing XPT files v8 that can be read in SAS. Reading XPT v8 improved.
- fixes writing files with NaT values.
- fixes #72: variable display width with long strings.
- fixes #70: variable_measures not set when writing sav files
- updated Readstat to commit 02562413ded25e920b96bbbfc4d87ed062aacec8, version 1.1.4
- updated readstat to commit f867d75af24cb5ca355befc8607e3a9ff1898f15, fixes issue #67
- improved error messages when column names have spaces
- updated readstat to release 1.1.3: improved support for columns with utf-8 characters.
- bumped version to first stable release as the library looks stable
- corrected bug when writing and updating files fixes #17
- updated Readstat source to commit 3a53c088cb622f10a02723bbfd9b91d316995aab: better error message when the file is not a spss file
- added capability to set_value_labels to handle subset of dataframes, solves #58
- fixed bug when writing string columns with missing values (#54 and #55)
- Updated Readstat to commit a71b4e80dae729e236d63d42066e79ca646cd23b (on dev)
- Updated Readstat update fixes #3
- Updated Readstat to Release 1.1.2, this did not fix #3
- added capacity to read and write files with paths starting with tilde
- added pandas as dependency > 0.24.0
- copying windows dlls inside pyreadstat folder in site-packages
- implemented ability to read files in chunks
- implemented write_por
- Readstat sources updated to release 1.1.0
- write user defined missing values, value labels for spss and stata
- write variable_display_width and variable_measure for spss
- breaking change: missing_user_values is now dict instead of set both for reading and writing.
- windows wheels build with appveyor + m2w64-toolchain. Wheels for python 2.7 not produced anymore, instead setup.py raises an error if trying to compile with 2.7 on windows so that pip will raise this error when trying to install from source distribution.
- column_names_to_labels: new member in metadata object, a dict of column names to labels
- setup.py: ordering source files to make the compilation reproducible
- added writing capability.
- MANIFEST.in changed so that windows dlls get into the source distribution, setup.py changed so that those get installed only on windows but not unix.
- Implemented option disable_datetime_conversion for all reading functions in order to skip date/datetime and time object conversion.
- Updated Readstat Sources to Release Version 1.0.1
- Same as 0.2.4, just fixed issues in source distribution so that conda build succeeds.
- encoding support for xpt files.
- support for user defined missing character values in sav.
- Updated Readstat Sources to Release 1.0
- User defined values fixed for sas7bdat, implemented for sas7bcat and dta.
- Readstat sources updated.
- Implemented support for reading international characters in files.
- Implemented user defined missing values for read_sas7bdat. Currently not working for 64 bit files.
- Corrected datetime transformations for stata dta files, sample.dta updated
- cython updated from 26.1 to 29.1
- Readstat source update, latest version as 17.12.2018 (latest commit: b52b77447d293f0fd530aabdab3330909a61bb8d)
- The latest readstat brings: POR files support improved (sample.por updated), sas7bcat support improved, improvements for sav files.
- Implemented usecols to be able to read only selected columns.
- Readstat source updated to fix a bug in usecols, latest version as 17.10.2018 (latest commit: 727bfb87aa332da0df96b77d86694c0e9c464723)
- Implemented user defined missing values (user_missing keyword) for read_sav (only for SPSS files)
- Bug when reading sas7bcat files #4 fixed in readstat #152. Readstat src files updated with latest version as of 12.10.2018 (latest commit: 4c798478ed1eff35ed16c067a3b0b3fd0a9a0561)
- Installation on mac fixed (static linking iconv, #2)
- same as 0.1.8, had to bump the version in order to fix the description in pypi.
- First public release