sas7bdat

A sas7bdat file reader for Python

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Environment
- Console
Intended Audience
- Developers
- Science/Research
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
Topic
- Text Processing
- Utilities

Project description

sas7bdat.py

This module will read sas7bdat files using pure Python (2.6+, 3+). No SAS software required! The module started out as a port of the R script of the same name found here: https://github.com/BioStatMatt/sas7bdat but has since been completely rewritten.

Also included with this library is a simple command line script, sas7bdat_to_csv, which converts sas7bdat files to csv files. It will also print out header information and meta data using the --header option and it will batch convert files as well. Use the --help option for more information.

As is, I’ve successfully tested the script almost three hundred sample files I found on the internet. For the most part, it works well. We can now read compressed files!

I’m sure there are more issues that I haven’t come across yet. Please let me know if you come across a data file that isn’t supported and I’ll see if I can add support for the file.

Usage

To install, run:

pip install sas7bdat

To create a sas7bdat object, simply pass the constructor a file path. The object is iterable so you can read the contents like this:

#!python
from sas7bdat import SAS7BDAT
with SAS7BDAT('foo.sas7bdat', skip_header=True) as reader:
    for row in reader:
        print row

Each row will be a list of values of type string, float, datetime.date, datetime.datetime, or datetime.time. Without skip_header, the first row returned will be the SAS variable names.

If you’d like to get a pandas DataFrame, use the to_data_frame method:

#!python
df = reader.to_data_frame()

Variable attributes are available from reader.columns. The order of these columns will be the same as the corresponding values in each row. Each Column has the following attributes:

col_id (int) - the column number
name (bytes)
label (bytes)
format (str)
type (str)
length (int)

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Environment
- Console
Intended Audience
- Developers
- Science/Research
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
Topic
- Text Processing
- Utilities

Release history Release notifications | RSS feed

2.2.3

Jul 15, 2019

This version

2.2.2

Dec 27, 2018

2.2.1

Nov 5, 2018

2.2.0

Nov 5, 2018

2.1.2

Oct 20, 2018

2.1.1

May 25, 2018

2.0.7

Jan 7, 2016

2.0.6

Sep 7, 2015

2.0.5

Aug 7, 2015

2.0.4

Jan 27, 2015

2.0.3

Jan 21, 2015

2.0.2

Jan 8, 2015

2.0.1

Jan 4, 2015

2.0.0

Jan 4, 2015

1.0.5

Dec 28, 2014

1.0.4

Dec 11, 2014

1.0.3

Dec 5, 2014

1.0.2

Nov 22, 2014

1.0.1

Nov 19, 2014

1.0.0

Nov 17, 2014

0.2.5

Oct 13, 2014

0.2.4

Sep 30, 2014

0.2.3

Sep 29, 2014

0.2.2

Jun 21, 2013

0.2.1

Jun 15, 2013

0.2.0

Jun 14, 2013

0.1.2

Mar 22, 2013

0.1.1

Mar 2, 2013

0.1.0

Feb 16, 2013

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sas7bdat-2.2.2.tar.gz (15.6 kB view hashes)

Uploaded Dec 27, 2018 Source

Hashes for sas7bdat-2.2.2.tar.gz

Hashes for sas7bdat-2.2.2.tar.gz
Algorithm	Hash digest
SHA256	`fea44a95e0db614088493de28b94afe8471b9e1fd4d6d5cabc56c83664793855`
MD5	`5532f63fa0b9b893452f32d295fda9c0`
BLAKE2b-256	`c77df6187c1233e05f340985cccd3541bc3a96d800f8d1e20d3ff36c1661e385`