Merge pull request #315 from cls1991/develop

DataBook().load function params error.
DataBook().load function params error
2026-06-05 23:10:17 +00:00 · 2019-03-02 12:13:11 +01:00 · 2017-10-24 19:33:35 +08:00 · 2017-09-11 11:51:53 -04:00 · 2017-09-11 13:05:31 +01:00 · 2017-09-11 12:47:51 +01:00
48 changed files with 4374 additions and 372 deletions
@@ -4,6 +4,7 @@ python:
  - 2.7
  - 3.2
  - 3.3
  - 3.4
 install:
  - python setup.py install
 script: python test_tablib.py
@@ -27,3 +27,8 @@ Patches and Suggestions
 - Jakub Janoszek
 - Marc Abramowitz
 - Alex Gaynor
 - James Douglass
 - Tommy Anthony
 - Rabin Nankhwa
 - Marco Dallagiacoma
 - Mathias Loesch
@@ -1,9 +1,60 @@
 History
 -------
-++++
+0.11.3 (2016-02-16)
 +++++++++++++++++++
 - Release fix.
 0.11.2 (2016-02-16)
 +++++++++++++++++++
 **Bugfixes**
 - Fix export only formats.
 - Fix for xlsx output.
 0.11.1 (2016-02-07)
 +++++++++++++++++++
 **Bugfixes**
 - Fixed packaging error on Python 3.
 0.11.0 (2016-02-07)
 +++++++++++++++++++
 **New Formats!**
 - Added LaTeX table export format (``Dataset.latex``).
 - Support for dBase (DBF) files (``Dataset.dbf``).
 **Improvements**
 - New import/export interface (``Dataset.export()``, ``Dataset.load()``).
 - CSV custom delimiter support (``Dataset.export('csv', delimiter='$')``).
 - Adding ability to remove duplicates to all rows in a dataset (``Dataset.remove_duplicates()``).
 - Added a mechanism to avoid ``datetime.datetime`` issues when serializing data.
 - New ``detect_format()`` function (mostly for internal use).
 - Update the vendored unicodecsv to fix ``None`` handling.
 - Only freeze the headers row, not the headers columns (xls).
 **Breaking Changes**
 - ``detect()`` function removed.
 **Bugfixes**
 - Fix XLSX import.
 - Bugfix for ``Dataset.transpose().transpose()``.
 0.10.0 (2014-05-27)
 +++++++++++++++++++
 * Unicode Column Headers
 * ALL the bugfixes!
 0.9.11 (2011-06-30)
 +++++++++++++++++++
@@ -1,4 +1,4 @@
-Copyright 2011 Kenneth Reitz
+Copyright 2016 Kenneth Reitz
 Permission is hereby granted, free of charge, to any person obtaining a copy
 of this software and associated documentation files (the "Software"), to deal
@@ -0,0 +1,6 @@
 test:
 	python test_tablib.py
 publish:
 	python setup.py register
 	python setup.py sdist upload
 	python setup.py bdist_wheel --universal upload
@@ -23,7 +23,9 @@ Output formats supported:
 - YAML (Sets + Books)
 - HTML (Sets)
 - TSV (Sets)
 - OSD (Sets)
 - CSV (Sets)
 - DBF (Sets)
 Note that tablib *purposefully* excludes XML support. It always will. (Note: This is a joke. Pull requests are welcome.)
@@ -31,7 +33,7 @@ Overview
 --------
 `tablib.Dataset()`
-	A Dataset is a table of tabular data. It may or may not have a header row. They can be build and manipulated as raw Python datatypes (Lists of tuples|dictionaries). Datasets can be imported from JSON, YAML, and CSV; they can be exported to XLSX, XLS, ODS, JSON, YAML, CSV, TSV, and HTML.
+	A Dataset is a table of tabular data. It may or may not have a header row. They can be build and manipulated as raw Python datatypes (Lists of tuples|dictionaries). Datasets can be imported from JSON, YAML, DBF, and CSV; they can be exported to XLSX, XLS, ODS, JSON, YAML, DBF, CSV, TSV, and HTML.
 `tablib.Databook()`
 	A Databook is a set of Datasets. The most common form of a Databook is an Excel file with multiple spreadsheets. Databooks can be imported from JSON and YAML; they can be exported to XLSX, XLS, ODS, JSON, and YAML.
@@ -123,6 +125,13 @@ EXCEL!
 	>>> with open('people.xls', 'wb') as f:
 	...     f.write(data.xls)
 DBF!
 ++++
 ::
    >>> with open('people.dbf', 'wb') as f:
    ...     f.write(data.dbf)
 It's that easy.
@@ -133,9 +142,8 @@ To install tablib, simply: ::
 	$ pip install tablib
-Or, if you absolutely must: ::
+Make sure to check out `Tablib on PyPi <https://pypi.python.org/pypi/tablib/>`_!
 	$ easy_install tablib
 Contribute
 ----------
@@ -1,7 +0,0 @@
 * Hooks System
  - pre/post-append
  - pre/post-import
  - pre/post-export
 * Add Tablib.ext namespace
 * Width detection for XLS output
 * Documentation Improvements
@@ -13,7 +13,7 @@
      &copy; Copyright {{ copyright }}.
    </div>
    <a href="https://github.com/kennethreitz/tablib">
-        <img style="position: absolute; top: 0; right: 0; border: 0;" src="http://s3.amazonaws.com/github/ribbons/forkme_right_darkblue_121621.png" alt="Fork me on GitHub" />
+        <img style="position: absolute; top: 0; right: 0; border: 0;" src="//s3.amazonaws.com/github/ribbons/forkme_right_darkblue_121621.png" alt="Fork me on GitHub" />
    </a>
    <script type="text/javascript" src="//www.hellobar.com/hellobar.js"></script>
@@ -14,8 +14,8 @@
 {% block relbar1 %}{% endblock %}
 {% block relbar2 %}
  {% if theme_github_fork %}
-    <a href="http://github.com/{{ theme_github_fork }}"><img style="position: fixed; top: 0; right: 0; border: 0;"
+    <a href="https://github.com/{{ theme_github_fork }}"><img style="position: fixed; top: 0; right: 0; border: 0;"
-    src="http://s3.amazonaws.com/github/ribbons/forkme_right_darkblue_121621.png" alt="Fork me on GitHub" /></a>
+    src="//s3.amazonaws.com/github/ribbons/forkme_right_darkblue_121621.png" alt="Fork me on GitHub" /></a>
  {% endif %}
 {% endblock %}
 {% block sidebar1 %}{% endblock %}
@@ -41,7 +41,7 @@ master_doc = 'index'
 # General information about the project.
 project = u'Tablib'
-copyright = u'2011. A <a href="http://kennethreitz.com/pages/open-projects.html">Kenneth Reitz</a> Project'
+copyright = u'2016. A <a href="http://kennethreitz.org/">Kenneth Reitz</a> Project'
 # The version info for the project you're documenting, acts as replacement for
 # |version| and |release|, also used in various other places throughout the
@@ -8,11 +8,6 @@ Tablib is under active development, and contributors are welcome.
 If you have a feature request, suggestion, or bug report, please open a new
 issue on GitHub_. To submit patches, please send a pull request on GitHub_.
 If you'd like to contribute, there's plenty to do. Here's a short todo list.
    .. include:: ../TODO.rst
 .. _GitHub: http://github.com/kennethreitz/tablib/
@@ -66,8 +61,6 @@ Feature / Hotfix / Release branches follow a `Successful Git Branching Model`_ .
    The "next release" branch. Likely unstable.
 ``master``
    Current production release (|version|) on PyPi.
 ``gh-pages``
    Current release of http://docs.python-tablib.org.
 Each release is tagged.
@@ -87,9 +80,7 @@ Adding New Formats
 Tablib welcomes new format additions! Format suggestions include:
 * Tab Separated Values
 * MySQL Dump
 * HTML Table
 Coding by Convention
@@ -164,7 +155,7 @@ Once installed, we can generate our xUnit report with a single command. ::
 This will generate a **nosetests.xml** file, which can then be analyzed.
-.. _Nose: http://somethingaboutorange.com/mrl/projects/nose/
+.. _Nose: https://github.com/nose-devs/nose
@@ -207,34 +198,9 @@ Your ``docs/_build/html`` directory will then contain an HTML representation of
 You can also generate the documentation in **epub**, **latex**, **json**, *&c* similarly.
 .. admonition:: GitHub Pages
 	To push the documentation up to `GitHub Pages`_, you will first need to run `sphinx-to-github`_ against your ``docs/_build/html`` directory.
 	GitHub Pages are powered by an HTML generation system called Jekyll_, which is configured to ignore files and folders that begin with "``_``" (*ie.* **_static**).
 	 and `sphinx-to-github`_. ::
 	Installing sphinx-to-github is simple. ::
 		$ pip install sphinx-to-github
 	Running it against the docs is even simpler. ::
 		$ sphinx-to-github _build/html
 	Move the resulting files to the **gh-pages** branch of your repository, and push it up to GitHub.
 .. _`reStructured Text`: http://docutils.sourceforge.net/rst.html
 .. _Sphinx: http://sphinx.pocoo.org
 .. _`GitHub Pages`: http://pages.github.com
 .. _Jekyll: http://github.com/mojombo/jekyll
 .. _`sphinx-to-github`: http://github.com/michaeljones/sphinx-to-github
 ----------
@@ -26,13 +26,13 @@ Tablib is an :ref:`MIT Licensed <mit>` format-agnostic tabular dataset library,
 ::
-   >>> data = tablib.Dataset(*[('Kenneth', 'Reitz', 23), ('Bessie', 'Monke', 22)],
+   >>> data = tablib.Dataset(headers=['First Name', 'Last Name', 'Age'])
-                             headers=['First Name', 'Last Name', 'Age'])
+   >>> map(data.append, [('Kenneth', 'Reitz', 22), ('Bessie', 'Monke', 21)])
-   >>> data.json
+   >>> print data.json
   [{"Last Name": "Reitz", "First Name": "Kenneth", "Age": 22}, {"Last Name": "Monke", "First Name": "Bessie", "Age": 21}]
-   >>> data.yaml
+   >>> print data.yaml
   - {Age: 22, First Name: Kenneth, Last Name: Reitz}
   - {Age: 21, First Name: Bessie, Last Name: Monke}
@@ -14,27 +14,10 @@ Installing Tablib
 Distribute & Pip
 ----------------
-Installing Tablib is simple with `pip <http://www.pip-installer.org/>`_::
+Of course, the recommended way to install Tablib is with `pip <http://www.pip-installer.org/>`_::
    $ pip install tablib
 or, with `easy_install <http://pypi.python.org/pypi/setuptools>`_::
    $ easy_install tablib
 But, you really `shouldn't do that <http://www.pip-installer.org/en/latest/other-tools.html#pip-compared-to-easy-install>`_.
 Cheeseshop Mirror
 -----------------
 If the Cheeseshop is down, you can also install Requests from Kenneth Reitz's personal `Cheeseshop mirror <pip.kreitz.co/>`_::
    $ pip install -i http://pip.kreitz.co/simple tablib
 -------------------
 Download the Source
@@ -89,4 +72,4 @@ When a new version is available, upgrading is simple::
    $ pip install tablib --upgrade
-Now, go get a :ref:`Quick Start <quickstart>`.
+Now, go get a :ref:`Quick Start <quickstart>`.
@@ -10,7 +10,7 @@ Advanced features include, segregation, dynamic columns, tags / filtering, and
 seamless format import/export.
-Philosphy
+Philosophy
 ---------
 Tablib was developed with a few :pep:`20` idioms in mind.
@@ -49,7 +49,7 @@ Tablib is released under terms of `The MIT License`_.
 Tablib License
 --------------
-Copyright 2011 Kenneth Reitz
+Copyright 2016 Kenneth Reitz
 Permission is hereby granted, free of charge, to any person obtaining a copy
 of this software and associated documentation files (the "Software"), to deal
@@ -90,4 +90,4 @@ Support for other Pythons will be rolled out soon.
-Now, go :ref:`Install Tablib <install>`.
+Now, go :ref:`Install Tablib <install>`.
@@ -39,6 +39,7 @@ You can now start filling this :class:`Dataset <tablib.Dataset>` object with dat
 -----------
 Adding Rows
 -----------
@@ -97,6 +98,15 @@ Let's view the data now. ::
 It's that easy.
 --------------
 Importing Data
 --------------
 Creating a :class:`tablib.Dataset` object by importing a pre-existing file is simple. ::
   imported_data = Dataset().load(open('data.csv').read())
 This detects what sort of data is being passed in, and uses an appropriate formatter to do the import. So you can import from a variety of different file types.
 --------------
 Exporting Data
 --------------
@@ -1,2 +0,0 @@
 [wheel]
 universal = 1
@@ -36,6 +36,32 @@ if sys.argv[-1] == 'test':
    errors = os.system('py.test test_tablib.py')
    sys.exit(bool(errors))
 packages = [
    'tablib', 'tablib.formats',
    'tablib.packages',
    'tablib.packages.omnijson',
    'tablib.packages.unicodecsv',
    'tablib.packages.xlwt',
    'tablib.packages.xlrd',
    'tablib.packages.odf',
    'tablib.packages.openpyxl',
    'tablib.packages.openpyxl.shared',
    'tablib.packages.openpyxl.reader',
    'tablib.packages.openpyxl.writer',
    'tablib.packages.yaml',
    'tablib.packages.dbfpy',
    'tablib.packages.xlwt3',
    'tablib.packages.xlrd3',
    'tablib.packages.odf3',
    'tablib.packages.openpyxl3',
    'tablib.packages.openpyxl3.shared',
    'tablib.packages.openpyxl3.reader',
    'tablib.packages.openpyxl3.writer',
    'tablib.packages.yaml3',
    'tablib.packages.dbfpy3'
 ]
 setup(
    name='tablib',
    version=tablib.__version__,
@@ -45,28 +71,7 @@ setup(
    author='Kenneth Reitz',
    author_email='me@kennethreitz.org',
    url='http://python-tablib.org',
-    packages=[
+    packages=packages,
        'tablib', 'tablib.formats',
        'tablib.packages',
        'tablib.packages.xlwt',
        'tablib.packages.xlwt3',
        'tablib.packages.xlrd',
        'tablib.packages.xlrd3',
        'tablib.packages.omnijson',
        'tablib.packages.odf',
        'tablib.packages.odf3',
        'tablib.packages.openpyxl',
        'tablib.packages.openpyxl.shared',
        'tablib.packages.openpyxl.reader',
        'tablib.packages.openpyxl.writer',
        'tablib.packages.openpyxl3',
        'tablib.packages.openpyxl3.shared',
        'tablib.packages.openpyxl3.reader',
        'tablib.packages.openpyxl3.writer',
        'tablib.packages.yaml',
        'tablib.packages.yaml3',
        'tablib.packages.unicodecsv'
    ],
    license='MIT',
    classifiers=(
        'Development Status :: 5 - Production/Stable',
@@ -1,7 +1,7 @@
 """ Tablib. """
 from tablib.core import (
-    Databook, Dataset, detect, import_set, import_book,
+    Databook, Dataset, detect_format, import_set, import_book,
    InvalidDatasetType, InvalidDimensions, UnsupportedFormat,
    __version__
 )
@@ -28,6 +28,7 @@ if is_py3:
    from tablib.packages import markup3 as markup
    from tablib.packages import openpyxl3 as openpyxl
    from tablib.packages.odf3 import opendocument, style, text, table
    import tablib.packages.dbfpy3 as dbfpy
    import csv
    from io import StringIO
@@ -36,6 +37,7 @@ if is_py3:
    unicode = str
    bytes = bytes
    basestring = str
    xrange = range
 else:
    from cStringIO import StringIO as BytesIO
@@ -49,5 +51,7 @@ else:
    from tablib.packages.odf import opendocument, style, text, table
    from tablib.packages import unicodecsv as csv
    import tablib.packages.dbfpy as dbfpy
    unicode = unicode
    xrange = xrange
@@ -5,7 +5,7 @@
    This module implements the central Tablib objects.
-    :copyright: (c) 2014 by Kenneth Reitz.
+    :copyright: (c) 2016 by Kenneth Reitz.
    :license: MIT, see LICENSE for more details.
 """
@@ -18,11 +18,11 @@ from tablib.compat import OrderedDict, unicode
 __title__ = 'tablib'
-__version__ = '0.10.0'
+__version__ = '0.11.3'
-__build__ = 0x001000
+__build__ = 0x001103
 __author__ = 'Kenneth Reitz'
 __license__ = 'MIT'
-__copyright__ = 'Copyright 2014 Kenneth Reitz'
+__copyright__ = 'Copyright 2016 Kenneth Reitz'
 __docformat__ = 'restructuredtext'
@@ -45,7 +45,7 @@ class Row(object):
        return repr(self._row)
    def __getslice__(self, i, j):
-        return self._row[i,j]
+        return self._row[i:j]
    def __getitem__(self, i):
        return self._row[i]
@@ -153,6 +153,8 @@ class Dataset(object):
    """
    _formats = {}
    def __init__(self, *args, **kwargs):
        self._data = list(Row(arg) for arg in args)
        self.__headers = None
@@ -163,15 +165,9 @@ class Dataset(object):
        # (column, callback) tuples
        self._formatters = []
-        try:
+        self.headers = kwargs.get('headers')
            self.headers = kwargs['headers']
        except KeyError:
            self.headers = None
-        try:
+        self.title = kwargs.get('title')
            self.title = kwargs['title']
        except KeyError:
            self.title = None
        self._register_formats()
@@ -224,12 +220,15 @@ class Dataset(object):
            return '<dataset object>'
    def __unicode__(self):
-        result = [self.__headers]
+        result = []
        # Add unicode representation of headers.
        result.append([unicode(h) for h in self.__headers])
        # Add unicode representation of rows.
        result.extend(list(map(unicode, row)) for row in self._data)
-        # here, we calculate max width for each column
+        lens = [list(map(len, row)) for row in result]
        lens = (list(map(len, row)) for row in result)
        field_lens = list(map(max, zip(*lens)))
        # delimiter between header and data
@@ -242,7 +241,6 @@ class Dataset(object):
    def __str__(self):
        return self.__unicode__()
    # ---------
    # Internals
    # ---------
@@ -254,11 +252,16 @@ class Dataset(object):
            try:
                try:
                    setattr(cls, fmt.title, property(fmt.export_set, fmt.import_set))
                    setattr(cls, 'get_%s' % fmt.title, fmt.export_set)
                    setattr(cls, 'set_%s' % fmt.title, fmt.import_set)
                    cls._formats[fmt.title] = (fmt.export_set, fmt.import_set)
                except AttributeError:
                    setattr(cls, fmt.title, property(fmt.export_set))
                    setattr(cls, 'get_%s' % fmt.title, fmt.export_set)
                    cls._formats[fmt.title] = (fmt.export_set, None)
            except AttributeError:
-                pass
+                cls._formats[fmt.title] = (None, None)
    def _validate(self, row=None, col=None, safety=False):
@@ -349,7 +352,7 @@ class Dataset(object):
        A dataset object can also be imported by setting the `Dataset.dict` attribute: ::
            data = tablib.Dataset()
-            data.json = '[{"last_name": "Adams","age": 90,"first_name": "John"}]'
+            data.dict = [{'age': 90, 'first_name': 'Kenneth', 'last_name': 'Reitz'}]
        """
        return self._package()
@@ -429,11 +432,41 @@ class Dataset(object):
                return 0
    def load(self, in_stream, format=None, **kwargs):
        """
        Import `in_stream` to the :class:`Dataset` object using the `format`.
        :param \*\*kwargs: (optional) custom configuration to the format `import_set`.
        """
        if not format:
            format = detect_format(in_stream)
        export_set, import_set = self._formats.get(format, (None, None))
        if not import_set:
            raise UnsupportedFormat('Format {0} cannot be imported.'.format(format))
        import_set(self, in_stream, **kwargs)
        return self
    def export(self, format, **kwargs):
        """
        Export :class:`Dataset` object to `format`.
        :param \*\*kwargs: (optional) custom configuration to the format `export_set`.
        """
        export_set, import_set = self._formats.get(format, (None, None))
        if not export_set:
            raise UnsupportedFormat('Format {0} cannot be exported.'.format(format))
        return export_set(self, **kwargs)
    # -------
    # Formats
    # -------
    @property
    def xls():
        """A Legacy Excel Spreadsheet representation of the :class:`Dataset` object, with :ref:`separators`. Cannot be set.
@@ -544,7 +577,7 @@ class Dataset(object):
        A dataset object can also be imported by setting the :class:`Dataset.json` attribute: ::
            data = tablib.Dataset()
-            data.json = '[{age: 90, first_name: "John", liast_name: "Adams"}]'
+            data.json = '[{"age": 90, "first_name": "John", "last_name": "Adams"}]'
        Import assumes (for now) that headers exist.
        """
@@ -559,6 +592,40 @@ class Dataset(object):
        """
        pass
    @property
    def dbf():
        """A dBASE representation of the :class:`Dataset` object.
        A dataset object can also be imported by setting the
        :class:`Dataset.dbf` attribute. ::
            # To import data from an existing DBF file:
            data = tablib.Dataset()
            data.dbf = open('existing_table.dbf').read()
            # to import data from an ASCII-encoded bytestring:
            data = tablib.Dataset()
            data.dbf = '<bytestring of tabular data>'
        .. admonition:: Binary Warning
            :class:`Dataset.dbf` contains binary data, so make sure to write in binary mode::
                with open('output.dbf', 'wb') as f:
                    f.write(data.dbf)
        """
        pass
    @property
    def latex():
        """A LaTeX booktabs representation of the :class:`Dataset` object. If a
        title has been set, it will be exported as the table caption.
        .. note:: This method can be used for export only.
        """
        pass
    # ----
    # Rows
@@ -839,17 +906,17 @@ class Dataset(object):
        new_headers = [self.headers[0]] + self[self.headers[0]]
        _dset.headers = new_headers
-        for column in self.headers:
+        for index, column in enumerate(self.headers):
            if column == self.headers[0]:
                # It's in the headers, so skip it
                continue
            # Adding the column name as now they're a regular column
-            row_data = [column] + self[column]
+            # Use `get_col(index)` in case there are repeated values
            row_data = [column] + self.get_col(index)
            row_data = Row(row_data)
            _dset.append(row=row_data)
        return _dset
@@ -910,17 +977,66 @@ class Dataset(object):
        return _dset
    def remove_duplicates(self):
        """Removes all duplicate rows from the :class:`Dataset` object
        while maintaining the original order."""
        seen = set()
        self._data[:] = [row for row in self._data if not (tuple(row) in seen or seen.add(tuple(row)))]
    def wipe(self):
        """Removes all content and headers from the :class:`Dataset` object."""
        self._data = list()
        self.__headers = None
    def subset(self, rows=None, cols=None):
        """Returns a new instance of the :class:`Dataset`,
        including only specified rows and columns.
        """
        # Don't return if no data
        if not self:
            return
        if rows is None:
            rows = list(range(self.height))
        if cols is None:
            cols = list(self.headers)
        #filter out impossible rows and columns
        rows = [row for row in rows if row in range(self.height)]
        cols = [header for header in cols if header in self.headers]
        _dset = Dataset()
        #filtering rows and columns
        _dset.headers = list(cols)
        _dset._data = []
        for row_no, row in enumerate(self._data):
            data_row = []
            for key in _dset.headers:
                if key in self.headers:
                    pos = self.headers.index(key)
                    data_row.append(row[pos])
                else:
                    raise KeyError
            if row_no in rows:
                _dset.append(row=Row(data_row))
        return _dset
 class Databook(object):
    """A book of :class:`Dataset` objects.
    """
    _formats = {}
    def __init__(self, sets=None):
        if sets is None:
@@ -936,7 +1052,6 @@ class Databook(object):
        except AttributeError:
            return '<databook object>'
    def wipe(self):
        """Removes all :class:`Dataset` objects from the :class:`Databook`."""
        self._datasets = []
@@ -949,11 +1064,13 @@ class Databook(object):
            try:
                try:
                    setattr(cls, fmt.title, property(fmt.export_book, fmt.import_book))
                    cls._formats[fmt.title] = (fmt.export_book, fmt.import_book)
                except AttributeError:
                    setattr(cls, fmt.title, property(fmt.export_book))
                    cls._formats[fmt.title] = (fmt.export_book, None)
            except AttributeError:
-                pass
+                cls._formats[fmt.title] = (None, None)
    def sheets(self):
        return self._datasets
@@ -988,42 +1105,55 @@ class Databook(object):
        """The number of the :class:`Dataset` objects within :class:`Databook`."""
        return len(self._datasets)
    def load(self, in_stream, format, **kwargs):
        """
        Import `in_stream` to the :class:`Databook` object using the `format`.
-def detect(stream):
+        :param \*\*kwargs: (optional) custom configuration to the format `import_book`.
-    """Return (format, stream) of given stream."""
+        """
        if not format:
            format = detect_format(in_stream)
        export_book, import_book = self._formats.get(format, (None, None))
        if not import_book:
            raise UnsupportedFormat('Format {0} cannot be loaded.'.format(format))
        import_book(self, in_stream, **kwargs)
        return self
    def export(self, format, **kwargs):
        """
        Export :class:`Databook` object to `format`.
        :param \*\*kwargs: (optional) custom configuration to the format `export_book`.
        """
        export_book, import_book = self._formats.get(format, (None, None))
        if not export_book:
            raise UnsupportedFormat('Format {0} cannot be exported.'.format(format))
        return export_book(self, **kwargs)
 def detect_format(stream):
    """Return format name of given stream."""
    for fmt in formats.available:
        try:
            if fmt.detect(stream):
-                return (fmt, stream)
+                return fmt.title
        except AttributeError:
            pass
    return (None, stream)
-
+def import_set(stream, format=None, **kwargs):
 def import_set(stream):
    """Return dataset of given stream."""
    (format, stream) = detect(stream)
-    try:
+    return Dataset().load(stream, format, **kwargs)
        data = Dataset()
        format.import_set(data, stream)
        return data
    except AttributeError:
        return None
-def import_book(stream):
+def import_book(stream, format=None, **kwargs):
    """Return dataset of given stream."""
    (format, stream) = detect(stream)
-    try:
+    return Databook().load(stream, format, **kwargs)
        databook = Databook()
        format.import_book(databook, stream)
        return databook
    except AttributeError:
        return None
 class InvalidDatasetType(Exception):
@@ -11,5 +11,7 @@ from . import _tsv as tsv
 from . import _html as html
 from . import _xlsx as xlsx
 from . import _ods as ods
 from . import _dbf as dbf
 from . import _latex as latex
-available = (json, xls, yaml, csv, tsv, html, xlsx, ods)
+available = (json, xls, yaml, csv, dbf, tsv, html, latex, xlsx, ods)
@@ -1,6 +1,6 @@
 # -*- coding: utf-8 -*-
-""" Tablib - CSV Support.
+""" Tablib - *SV Support.
 """
 from tablib.compat import is_py3, csv, StringIO
@@ -11,17 +11,18 @@ extensions = ('csv',)
 DEFAULT_ENCODING = 'utf-8'
 DEFAULT_DELIMITER = ','
-
+def export_set(dataset, **kwargs):
 def export_set(dataset):
    """Returns CSV representation of Dataset."""
    stream = StringIO()
-    if is_py3:
+    kwargs.setdefault('delimiter', DEFAULT_DELIMITER)
-        _csv = csv.writer(stream)
+    if not is_py3:
-    else:
+        kwargs.setdefault('encoding', DEFAULT_ENCODING)
-        _csv = csv.writer(stream, encoding=DEFAULT_ENCODING)
+
    _csv = csv.writer(stream, **kwargs)
    for row in dataset._package(dicts=False):
        _csv.writerow(row)
@@ -29,15 +30,16 @@ def export_set(dataset):
    return stream.getvalue()
-def import_set(dset, in_stream, headers=True):
+def import_set(dset, in_stream, headers=True, **kwargs):
    """Returns dataset from CSV stream."""
    dset.wipe()
-    if is_py3:
+    kwargs.setdefault('delimiter', DEFAULT_DELIMITER)
-        rows = csv.reader(StringIO(in_stream))
+    if not is_py3:
-    else:
+        kwargs.setdefault('encoding', DEFAULT_ENCODING)
-        rows = csv.reader(StringIO(in_stream), encoding=DEFAULT_ENCODING)
+
    rows = csv.reader(StringIO(in_stream), **kwargs)
    for i, row in enumerate(rows):
        if (i == 0) and (headers):
@@ -46,10 +48,10 @@ def import_set(dset, in_stream, headers=True):
            dset.append(row)
-def detect(stream):
+def detect(stream, delimiter=DEFAULT_DELIMITER):
    """Returns True if given stream is valid CSV."""
    try:
-        csv.Sniffer().sniff(stream, delimiters=',')
+        csv.Sniffer().sniff(stream, delimiters=delimiter)
        return True
    except (csv.Error, TypeError):
        return False
@@ -0,0 +1,93 @@
 # -*- coding: utf-8 -*-
 """ Tablib - DBF Support.
 """
 import tempfile
 import struct
 import os
 from tablib.compat import StringIO
 from tablib.compat import dbfpy
 from tablib.compat import is_py3
 if is_py3:
    from tablib.packages.dbfpy3 import dbf
    from tablib.packages.dbfpy3 import dbfnew
    from tablib.packages.dbfpy3 import record as dbfrecord
    import io
 else:
    from tablib.packages.dbfpy import dbf
    from tablib.packages.dbfpy import dbfnew
    from tablib.packages.dbfpy import record as dbfrecord
 title = 'dbf'
 extensions = ('csv',)
 DEFAULT_ENCODING = 'utf-8'
 def export_set(dataset):
    """Returns DBF representation of a Dataset"""
    new_dbf = dbfnew.dbf_new()
    temp_file, temp_uri = tempfile.mkstemp()
    # create the appropriate fields based on the contents of the first row
    first_row = dataset[0]
    for fieldname, field_value in zip(dataset.headers, first_row):
        if type(field_value) in [int, float]:
            new_dbf.add_field(fieldname, 'N', 10, 8)
        else:
            new_dbf.add_field(fieldname, 'C', 80)
    new_dbf.write(temp_uri)
    dbf_file = dbf.Dbf(temp_uri, readOnly=0)
    for row in dataset:
        record = dbfrecord.DbfRecord(dbf_file)
        for fieldname, field_value in zip(dataset.headers, row):
            record[fieldname] = field_value
        record.store()
    dbf_file.close()
    dbf_stream = open(temp_uri, 'rb')
    if is_py3:
        stream = io.BytesIO(dbf_stream.read())
    else:
        stream = StringIO(dbf_stream.read())
    dbf_stream.close()
    os.remove(temp_uri)
    return stream.getvalue()
 def import_set(dset, in_stream, headers=True):
    """Returns a dataset from a DBF stream."""
    dset.wipe()
    if is_py3:
        _dbf = dbf.Dbf(io.BytesIO(in_stream))
    else:
        _dbf = dbf.Dbf(StringIO(in_stream))
    dset.headers = _dbf.fieldNames
    for record in range(_dbf.recordCount):
        row = [_dbf[record][f] for f in _dbf.fieldNames]
        dset.append(row)
 def detect(stream):
    """Returns True if the given stream is valid DBF"""
    #_dbf = dbf.Table(StringIO(stream))
    try:
        if is_py3:
            if type(stream) is not bytes:
                stream = bytes(stream, 'utf-8')
            _dbf = dbf.Dbf(io.BytesIO(stream), readOnly=True)
        else:
            _dbf = dbf.Dbf(StringIO(stream), readOnly=True)
        return True
    except (ValueError, struct.error):
        # When we try to open up a file that's not a DBF, dbfpy raises a
        # ValueError.
        # When unpacking a string argument with less than 8 chars, struct.error is
        # raised.
        return False
@@ -23,45 +23,45 @@ extensions = ('html', )
 def export_set(dataset):
-	"""HTML representation of a Dataset."""
+    """HTML representation of a Dataset."""
-	stream = StringIO()
+    stream = StringIO()
-	page = markup.page()
+    page = markup.page()
-	page.table.open()
+    page.table.open()
-	if dataset.headers is not None:
+    if dataset.headers is not None:
-		new_header = [item if item is not None else '' for item in dataset.headers] 
+        new_header = [item if item is not None else '' for item in dataset.headers] 
-		page.thead.open()
+        page.thead.open()
-		headers = markup.oneliner.th(new_header)
+        headers = markup.oneliner.th(new_header)
-		page.tr(headers)
+        page.tr(headers)
-		page.thead.close()
+        page.thead.close()
-	for row in dataset:
+    for row in dataset:
-		new_row = [item if item is not None else '' for item in row] 
+        new_row = [item if item is not None else '' for item in row] 
-		html_row = markup.oneliner.td(new_row)
+        html_row = markup.oneliner.td(new_row)
-		page.tr(html_row)
+        page.tr(html_row)
-	page.table.close()
+    page.table.close()
    # Allow unicode characters in output
-	wrapper = codecs.getwriter("utf8")(stream)
+    wrapper = codecs.getwriter("utf8")(stream)
-	wrapper.writelines(unicode(page))
+    wrapper.writelines(unicode(page))
-	return stream.getvalue().decode('utf-8')
+    return stream.getvalue().decode('utf-8')
 def export_book(databook):
-	"""HTML representation of a Databook."""
+    """HTML representation of a Databook."""
-	stream = StringIO()
+    stream = StringIO()
-	for i, dset in enumerate(databook._datasets):
+    for i, dset in enumerate(databook._datasets):
-		title = (dset.title if dset.title else 'Set %s' % (i))
+        title = (dset.title if dset.title else 'Set %s' % (i))
-		stream.write('<%s>%s</%s>\n' % (BOOK_ENDINGS, title, BOOK_ENDINGS))
+        stream.write('<%s>%s</%s>\n' % (BOOK_ENDINGS, title, BOOK_ENDINGS))
-		stream.write(dset.html)
+        stream.write(dset.html)
-		stream.write('\n')
+        stream.write('\n')
-	return stream.getvalue()
+    return stream.getvalue()
@@ -13,14 +13,18 @@ title = 'json'
 extensions = ('json', 'jsn')
 def date_handler(obj):
    return obj.isoformat() if hasattr(obj, 'isoformat') else obj
 def export_set(dataset):
    """Returns JSON representation of Dataset."""
-    return json.dumps(dataset.dict)
+    return json.dumps(dataset.dict, default=date_handler)
 def export_book(databook):
    """Returns JSON representation of Databook."""
-    return json.dumps(databook._package())
+    return json.dumps(databook._package(), default=date_handler)
 def import_set(dset, in_stream):
@@ -0,0 +1,134 @@
 # -*- coding: utf-8 -*-
 """Tablib - LaTeX table export support.
   Generates a LaTeX booktabs-style table from the dataset.
 """
 import re
 from tablib.compat import unicode
 title = 'latex'
 extensions = ('tex',)
 TABLE_TEMPLATE = """\
 %% Note: add \\usepackage{booktabs} to your preamble
 %%
 \\begin{table}[!htbp]
  \\centering
  %(CAPTION)s
  \\begin{tabular}{%(COLSPEC)s}
    \\toprule
 %(HEADER)s
    %(MIDRULE)s
 %(BODY)s
    \\bottomrule
  \\end{tabular}
 \\end{table}
 """
 TEX_RESERVED_SYMBOLS_MAP = dict([
    ('\\', '\\textbackslash{}'),
    ('{', '\\{'),
    ('}', '\\}'),
    ('$', '\\$'),
    ('&', '\\&'),
    ('#', '\\#'),
    ('^', '\\textasciicircum{}'),
    ('_', '\\_'),
    ('~', '\\textasciitilde{}'),
    ('%', '\\%'),
 ])
 TEX_RESERVED_SYMBOLS_RE = re.compile(
    '(%s)' % '|'.join(map(re.escape, TEX_RESERVED_SYMBOLS_MAP.keys())))
 def export_set(dataset):
    """Returns LaTeX representation of dataset
    :param dataset: dataset to serialize
    :type dataset: tablib.core.Dataset
    """
    caption = '\\caption{%s}' % dataset.title if dataset.title else '%'
    colspec = _colspec(dataset.width)
    header = _serialize_row(dataset.headers) if dataset.headers else ''
    midrule = _midrule(dataset.width)
    body = '\n'.join([_serialize_row(row) for row in dataset])
    return TABLE_TEMPLATE % dict(CAPTION=caption, COLSPEC=colspec,
                                 HEADER=header, MIDRULE=midrule, BODY=body)
 def _colspec(dataset_width):
    """Generates the column specification for the LaTeX `tabular` environment
    based on the dataset width.
    The first column is justified to the left, all further columns are aligned
    to the right.
    .. note:: This is only a heuristic and most probably has to be fine-tuned
    post export. Column alignment should depend on the data type, e.g., textual
    content should usually be aligned to the left while numeric content almost
    always should be aligned to the right.
    :param dataset_width: width of the dataset
    """
    spec = 'l'
    for _ in range(1, dataset_width):
        spec += 'r'
    return spec
 def _midrule(dataset_width):
    """Generates the table `midrule`, which may be composed of several
    `cmidrules`.
    :param dataset_width: width of the dataset to serialize
    """
    if not dataset_width or dataset_width == 1:
        return '\\midrule'
    return ' '.join([_cmidrule(colindex, dataset_width) for colindex in
                     range(1, dataset_width + 1)])
 def _cmidrule(colindex, dataset_width):
    """Generates the `cmidrule` for a single column with appropriate trimming
    based on the column position.
    :param colindex: Column index
    :param dataset_width: width of the dataset
    """
    rule = '\\cmidrule(%s){%d-%d}'
    if colindex == 1:
        # Rule of first column is trimmed on the right
        return rule % ('r', colindex, colindex)
    if colindex == dataset_width:
        # Rule of last column is trimmed on the left
        return rule % ('l', colindex, colindex)
    # Inner columns are trimmed on the left and right
    return rule % ('lr', colindex, colindex)
 def _serialize_row(row):
    """Returns string representation of a single row.
    :param row: single dataset row
    """
    new_row = [_escape_tex_reserved_symbols(unicode(item)) if item else '' for
               item in row]
    return 6 * ' ' + ' & '.join(new_row) + ' \\\\'
 def _escape_tex_reserved_symbols(input):
    """Escapes all TeX reserved symbols ('_', '~', etc.) in a string.
    :param input: String to escape
    """
    def replace(match):
        return TEX_RESERVED_SYMBOLS_MAP[match.group()]
    return TEX_RESERVED_SYMBOLS_RE.sub(replace, input)
@@ -3,57 +3,28 @@
 """ Tablib - TSV (Tab Separated Values) Support.
 """
-from tablib.compat import is_py3, csv, StringIO
+from tablib.formats._csv import (
-
+    export_set as export_set_wrapper,
-
+    import_set as import_set_wrapper,
    detect as detect_wrapper,
 )
 title = 'tsv'
 extensions = ('tsv',)
 DEFAULT_ENCODING = 'utf-8'
 DELIMITER = '\t'
 def export_set(dataset):
-    """Returns a TSV representation of Dataset."""
+    """Returns TSV representation of Dataset."""
-
+    return export_set_wrapper(dataset, delimiter=DELIMITER)
    stream = StringIO()
    if is_py3:
        _tsv = csv.writer(stream, delimiter='\t')
    else:
        _tsv = csv.writer(stream, encoding=DEFAULT_ENCODING, delimiter='\t')
    for row in dataset._package(dicts=False):
        _tsv.writerow(row)
    return stream.getvalue()
 def import_set(dset, in_stream, headers=True):
    """Returns dataset from TSV stream."""
-
+    return import_set_wrapper(dset, in_stream, headers=headers, delimiter=DELIMITER)
    dset.wipe()
    if is_py3:
        rows = csv.reader(in_stream.splitlines(), delimiter='\t')
    else:
        rows = csv.reader(in_stream.splitlines(), delimiter='\t',
                          encoding=DEFAULT_ENCODING)
    for i, row in enumerate(rows):
        # Skip empty rows
        if not row:
            continue
        if (i == 0) and (headers):
            dset.headers = row
        else:
            dset.append(row)
 def detect(stream):
    """Returns True if given stream is valid TSV."""
-    try:
+    return detect_wrapper(stream, delimiter=DELIMITER)
        csv.Sniffer().sniff(stream, delimiters='\t')
        return True
    except (csv.Error, TypeError):
        return False
@@ -5,7 +5,7 @@
 import sys
-from tablib.compat import BytesIO, xlwt, xlrd, XLRDError
+from tablib.compat import BytesIO, xlwt, xlrd, XLRDError, xrange
 import tablib
 title = 'xls'
@@ -17,6 +17,7 @@ import tablib
 Workbook = openpyxl.workbook.Workbook
 ExcelWriter = openpyxl.writer.excel.ExcelWriter
 get_column_letter = openpyxl.cell.get_column_letter
 DataTypeException = openpyxl.shared.exc.DataTypeException
 from tablib.compat import unicode
@@ -33,21 +34,21 @@ def detect(stream):
    except openpyxl.shared.exc.InvalidFileException:
        pass
-def export_set(dataset):
+def export_set(dataset, freeze_panes=True):
    """Returns XLSX representation of Dataset."""
    wb = Workbook()
    ws = wb.worksheets[0]
    ws.title = dataset.title if dataset.title else 'Tablib Dataset'
-    dset_sheet(dataset, ws)
+    dset_sheet(dataset, ws, freeze_panes=freeze_panes)
    stream = BytesIO()
    wb.save(stream)
    return stream.getvalue()
-def export_book(databook):
+def export_book(databook, freeze_panes=True):
    """Returns XLSX representation of DataBook."""
    wb = Workbook()
@@ -56,7 +57,7 @@ def export_book(databook):
        ws = wb.create_sheet()
        ws.title = dset.title if dset.title else 'Sheet%s' % (i)
-        dset_sheet(dset, ws)
+        dset_sheet(dset, ws, freeze_panes=freeze_panes)
    stream = BytesIO()
@@ -69,7 +70,7 @@ def import_set(dset, in_stream, headers=True):
    dset.wipe()
-    xls_book = openpyxl.reader.excel.load_workbook(in_stream)
+    xls_book = openpyxl.reader.excel.load_workbook(BytesIO(in_stream))
    sheet = xls_book.get_active_sheet()
    dset.title = sheet.title
@@ -87,7 +88,7 @@ def import_book(dbook, in_stream, headers=True):
    dbook.wipe()
-    xls_book = openpyxl.reader.excel.load_workbook(in_stream)
+    xls_book = openpyxl.reader.excel.load_workbook(BytesIO(in_stream))
    for sheet in xls_book.worksheets:
        data = tablib.Dataset()
@@ -103,7 +104,7 @@ def import_book(dbook, in_stream, headers=True):
        dbook.add_sheet(data)
-def dset_sheet(dataset, ws):
+def dset_sheet(dataset, ws, freeze_panes=True):
    """Completes given worksheet from given Dataset."""
    _package = dataset._package(dicts=False)
@@ -115,38 +116,35 @@ def dset_sheet(dataset, ws):
        row_number = i + 1
        for j, col in enumerate(row):
            col_idx = get_column_letter(j + 1)
            # We want to freeze the column after the last column
            frzn_col_idx = get_column_letter(j + 2)
            # bold headers
            if (row_number == 1) and dataset.headers:
                # ws.cell('%s%s'%(col_idx, row_number)).value = unicode(
                    # '%s' % col, errors='ignore')
                ws.cell('%s%s'%(col_idx, row_number)).value = unicode(col)
                style = ws.get_style('%s%s' % (col_idx, row_number))
                style.font.bold = True
-                ws.freeze_panes = '%s%s' % (frzn_col_idx, row_number)
+                if freeze_panes:
-
+                    # As already done in #53, but after Merge lost:
                    #  Export Freeze only after first Line
                    ws.freeze_panes = 'A2'
            # bold separators
            elif len(row) < dataset.width:
                ws.cell('%s%s'%(col_idx, row_number)).value = unicode(
                    '%s' % col, errors='ignore')
                style = ws.get_style('%s%s' % (col_idx, row_number))
                style.font.bold = True
            # wrap the rest
            else:
                try:
-                    if '\n' in col:
+                    str_col_value = unicode(col)
-                        ws.cell('%s%s'%(col_idx, row_number)).value = unicode(
+                except TypeError:
-                            '%s' % col, errors='ignore')
+                    str_col_value = ''
                    if '\n' in str_col_value:
                        style = ws.get_style('%s%s' % (col_idx, row_number))
                        style.alignment.wrap_text
                    else:
                        ws.cell('%s%s'%(col_idx, row_number)).value = unicode(
                            '%s' % col, errors='ignore')
                except TypeError:
                    ws.cell('%s%s'%(col_idx, row_number)).value = unicode(col)
            try:
                ws.cell('%s%s' % (col_idx, row_number)).value = col
            except (ValueError, TypeError, DataTypeException):
                ws.cell('%s%s' % (col_idx, row_number)).value = unicode(col)
@@ -0,0 +1,292 @@
 #! /usr/bin/env python
 """DBF accessing helpers.
 FIXME: more documentation needed
 Examples:
    Create new table, setup structure, add records:
        dbf = Dbf(filename, new=True)
        dbf.addField(
            ("NAME", "C", 15),
            ("SURNAME", "C", 25),
            ("INITIALS", "C", 10),
            ("BIRTHDATE", "D"),
        )
        for (n, s, i, b) in (
            ("John", "Miller", "YC", (1980, 10, 11)),
            ("Andy", "Larkin", "", (1980, 4, 11)),
        ):
            rec = dbf.newRecord()
            rec["NAME"] = n
            rec["SURNAME"] = s
            rec["INITIALS"] = i
            rec["BIRTHDATE"] = b
            rec.store()
        dbf.close()
    Open existed dbf, read some data:
        dbf = Dbf(filename, True)
        for rec in dbf:
            for fldName in dbf.fieldNames:
                print '%s:\t %s (%s)' % (fldName, rec[fldName],
                    type(rec[fldName]))
            print
        dbf.close()
 """
 """History (most recent first):
 11-feb-2007 [als]   export INVALID_VALUE;
                    Dbf: added .ignoreErrors, .INVALID_VALUE
 04-jul-2006 [als]   added export declaration
 20-dec-2005 [yc]    removed fromStream and newDbf methods:
                    use argument of __init__ call must be used instead;
                    added class fields pointing to the header and
                    record classes.
 17-dec-2005 [yc]    split to several modules; reimplemented
 13-dec-2005 [yc]    adapted to the changes of the `strutil` module.
 13-sep-2002 [als]   support FoxPro Timestamp datatype
 15-nov-1999 [jjk]   documentation updates, add demo
 24-aug-1998 [jjk]   add some encodeValue methods (not tested), other tweaks
 08-jun-1998 [jjk]   fix problems, add more features
 20-feb-1998 [jjk]   fix problems, add more features
 19-feb-1998 [jjk]   add create/write capabilities
 18-feb-1998 [jjk]   from dbfload.py
 """
 __version__ = "$Revision: 1.7 $"[11:-2]
 __date__ = "$Date: 2007/02/11 09:23:13 $"[7:-2]
 __author__ = "Jeff Kunce <kuncej@mail.conservation.state.mo.us>"
 __all__ = ["Dbf"]
 from . import header
 from .import record
 from utils import INVALID_VALUE
 class Dbf(object):
    """DBF accessor.
    FIXME:
        docs and examples needed (dont' forget to tell
        about problems adding new fields on the fly)
    Implementation notes:
        ``_new`` field is used to indicate whether this is
        a new data table. `addField` could be used only for
        the new tables! If at least one record was appended
        to the table it's structure couldn't be changed.
    """
    __slots__ = ("name", "header", "stream",
        "_changed", "_new", "_ignore_errors")
    HeaderClass = header.DbfHeader
    RecordClass = record.DbfRecord
    INVALID_VALUE = INVALID_VALUE
    ## initialization and creation helpers
    def __init__(self, f, readOnly=False, new=False, ignoreErrors=False):
        """Initialize instance.
        Arguments:
            f:
                Filename or file-like object.
            new:
                True if new data table must be created. Assume
                data table exists if this argument is False.
            readOnly:
                if ``f`` argument is a string file will
                be opend in read-only mode; in other cases
                this argument is ignored. This argument is ignored
                even if ``new`` argument is True.
            headerObj:
                `header.DbfHeader` instance or None. If this argument
                is None, new empty header will be used with the
                all fields set by default.
            ignoreErrors:
                if set, failing field value conversion will return
                ``INVALID_VALUE`` instead of raising conversion error.
        """
        if isinstance(f, basestring):
            # a filename
            self.name = f
            if new:
                # new table (table file must be
                # created or opened and truncated)
                self.stream = file(f, "w+b")
            else:
                # tabe file must exist
                self.stream = file(f, ("r+b", "rb")[bool(readOnly)])
        else:
            # a stream
            self.name = getattr(f, "name", "")
            self.stream = f
        if new:
            # if this is a new table, header will be empty
            self.header = self.HeaderClass()
        else:
            # or instantiated using stream
            self.header = self.HeaderClass.fromStream(self.stream)
        self.ignoreErrors = ignoreErrors
        self._new = bool(new)
        self._changed = False
    ## properties
    closed = property(lambda self: self.stream.closed)
    recordCount = property(lambda self: self.header.recordCount)
    fieldNames = property(
        lambda self: [_fld.name for _fld in self.header.fields])
    fieldDefs = property(lambda self: self.header.fields)
    changed = property(lambda self: self._changed or self.header.changed)
    def ignoreErrors(self, value):
        """Update `ignoreErrors` flag on the header object and self"""
        self.header.ignoreErrors = self._ignore_errors = bool(value)
    ignoreErrors = property(
        lambda self: self._ignore_errors,
        ignoreErrors,
        doc="""Error processing mode for DBF field value conversion
        if set, failing field value conversion will return
        ``INVALID_VALUE`` instead of raising conversion error.
        """)
    ## protected methods
    def _fixIndex(self, index):
        """Return fixed index.
        This method fails if index isn't a numeric object
        (long or int). Or index isn't in a valid range
        (less or equal to the number of records in the db).
        If ``index`` is a negative number, it will be
        treated as a negative indexes for list objects.
        Return:
            Return value is numeric object maning valid index.
        """
        if not isinstance(index, (int, long)):
            raise TypeError("Index must be a numeric object")
        if index < 0:
            # index from the right side
            # fix it to the left-side index
            index += len(self) + 1
        if index >= len(self):
            raise IndexError("Record index out of range")
        return index
    ## iterface methods
    def close(self):
        self.flush()
        self.stream.close()
    def flush(self):
        """Flush data to the associated stream."""
        if self.changed:
            self.header.setCurrentDate()
            self.header.write(self.stream)
            self.stream.flush()
            self._changed = False
    def indexOfFieldName(self, name):
        """Index of field named ``name``."""
        # FIXME: move this to header class
        return self.header.fields.index(name)
    def newRecord(self):
        """Return new record, which belong to this table."""
        return self.RecordClass(self)
    def append(self, record):
        """Append ``record`` to the database."""
        record.index = self.header.recordCount
        record._write()
        self.header.recordCount += 1
        self._changed = True
        self._new = False
    def addField(self, *defs):
        """Add field definitions.
        For more information see `header.DbfHeader.addField`.
        """
        if self._new:
            self.header.addField(*defs)
        else:
            raise TypeError("At least one record was added, "
                "structure can't be changed")
    ## 'magic' methods (representation and sequence interface)
    def __repr__(self):
        return "Dbf stream '%s'\n" % self.stream + repr(self.header)
    def __len__(self):
        """Return number of records."""
        return self.recordCount
    def __getitem__(self, index):
        """Return `DbfRecord` instance."""
        return self.RecordClass.fromStream(self, self._fixIndex(index))
    def __setitem__(self, index, record):
        """Write `DbfRecord` instance to the stream."""
        record.index = self._fixIndex(index)
        record._write()
        self._changed = True
        self._new = False
    #def __del__(self):
    #    """Flush stream upon deletion of the object."""
    #    self.flush()
 def demoRead(filename):
    _dbf = Dbf(filename, True)
    for _rec in _dbf:
        print
        print(repr(_rec))
    _dbf.close()
 def demoCreate(filename):
    _dbf = Dbf(filename, new=True)
    _dbf.addField(
        ("NAME", "C", 15),
        ("SURNAME", "C", 25),
        ("INITIALS", "C", 10),
        ("BIRTHDATE", "D"),
    )
    for (_n, _s, _i, _b) in (
        ("John", "Miller", "YC", (1981, 1, 2)),
        ("Andy", "Larkin", "AL", (1982, 3, 4)),
        ("Bill", "Clinth", "", (1983, 5, 6)),
        ("Bobb", "McNail", "", (1984, 7, 8)),
    ):
        _rec = _dbf.newRecord()
        _rec["NAME"] = _n
        _rec["SURNAME"] = _s
        _rec["INITIALS"] = _i
        _rec["BIRTHDATE"] = _b
        _rec.store()
    print(repr(_dbf))
    _dbf.close()
 if (__name__=='__main__'):
    import sys
    _name = len(sys.argv) > 1 and sys.argv[1] or "county.dbf"
    demoCreate(_name)
    demoRead(_name)
 # vim: set et sw=4 sts=4 :
@@ -0,0 +1,188 @@
 #!/usr/bin/python
 """.DBF creation helpers.
 Note: this is a legacy interface.  New code should use Dbf class
    for table creation (see examples in dbf.py)
 TODO:
  - handle Memo fields.
  - check length of the fields accoring to the
    `http://www.clicketyclick.dk/databases/xbase/format/data_types.html`
 """
 """History (most recent first)
 04-jul-2006 [als]   added export declaration;
                    updated for dbfpy 2.0
 15-dec-2005 [yc]    define dbf_new.__slots__
 14-dec-2005 [yc]    added vim modeline; retab'd; added doc-strings;
                    dbf_new now is a new class (inherited from object)
 ??-jun-2000 [--]    added by Hans Fiby
 """
 __version__ = "$Revision: 1.4 $"[11:-2]
 __date__ = "$Date: 2006/07/04 08:18:18 $"[7:-2]
 __all__ = ["dbf_new"]
 from dbf import *
 from fields import *
 from header import *
 from record import *
 class _FieldDefinition(object):
    """Field definition.
    This is a simple structure, which contains ``name``, ``type``,
    ``len``, ``dec`` and ``cls`` fields.
    Objects also implement get/setitem magic functions, so fields
    could be accessed via sequence iterface, where 'name' has
    index 0, 'type' index 1, 'len' index 2, 'dec' index 3 and
    'cls' could be located at index 4.
    """
    __slots__ = "name", "type", "len", "dec", "cls"
    # WARNING: be attentive - dictionaries are mutable!
    FLD_TYPES = {
        # type: (cls, len)
        "C": (DbfCharacterFieldDef, None),
        "N": (DbfNumericFieldDef, None),
        "L": (DbfLogicalFieldDef, 1),
        # FIXME: support memos
        # "M": (DbfMemoFieldDef),
        "D": (DbfDateFieldDef, 8),
        # FIXME: I'm not sure length should be 14 characters!
        # but temporary I use it, cuz date is 8 characters
        # and time 6 (hhmmss)
        "T": (DbfDateTimeFieldDef, 14),
    }
    def __init__(self, name, type, len=None, dec=0):
        _cls, _len = self.FLD_TYPES[type]
        if _len is None:
            if len is None:
                raise ValueError("Field length must be defined")
            _len = len
        self.name = name
        self.type = type
        self.len = _len
        self.dec = dec
        self.cls = _cls
    def getDbfField(self):
        "Return `DbfFieldDef` instance from the current definition."
        return self.cls(self.name, self.len, self.dec)
    def appendToHeader(self, dbfh):
        """Create a `DbfFieldDef` instance and append it to the dbf header.
        Arguments:
            dbfh: `DbfHeader` instance.
        """
        _dbff = self.getDbfField()
        dbfh.addField(_dbff)
 class dbf_new(object):
    """New .DBF creation helper.
    Example Usage:
        dbfn = dbf_new()
        dbfn.add_field("name",'C',80)
        dbfn.add_field("price",'N',10,2)
        dbfn.add_field("date",'D',8)
        dbfn.write("tst.dbf")
    Note:
        This module cannot handle Memo-fields,
        they are special.
    """
    __slots__ = ("fields",)
    FieldDefinitionClass = _FieldDefinition
    def __init__(self):
        self.fields = []
    def add_field(self, name, typ, len, dec=0):
        """Add field definition.
        Arguments:
            name:
                field name (str object). field name must not
                contain ASCII NULs and it's length shouldn't
                exceed 10 characters.
            typ:
                type of the field. this must be a single character
                from the "CNLMDT" set meaning character, numeric,
                logical, memo, date and date/time respectively.
            len:
                length of the field. this argument is used only for
                the character and numeric fields. all other fields
                have fixed length.
                FIXME: use None as a default for this argument?
            dec:
                decimal precision. used only for the numric fields.
        """
        self.fields.append(self.FieldDefinitionClass(name, typ, len, dec))
    def write(self, filename):
        """Create empty .DBF file using current structure."""
        _dbfh = DbfHeader()
        _dbfh.setCurrentDate()
        for _fldDef in self.fields:
            _fldDef.appendToHeader(_dbfh)
        _dbfStream = file(filename, "wb")
        _dbfh.write(_dbfStream)
        _dbfStream.close()
    def write_stream(self, stream):
        _dbfh = DbfHeader()
        _dbfh.setCurrentDate()
        for _fldDef in self.fields:
            _fldDef.appendToHeader(_dbfh)
        _dbfh.write(stream)
 if (__name__=='__main__'):
    # create a new DBF-File
    dbfn=dbf_new()
    dbfn.add_field("name",'C',80)
    dbfn.add_field("price",'N',10,2)
    dbfn.add_field("date",'D',8)
    dbfn.write("tst.dbf")
    # test new dbf
    print "*** created tst.dbf: ***"
    dbft = Dbf('tst.dbf', readOnly=0)
    print repr(dbft)
    # add a record
    rec=DbfRecord(dbft)
    rec['name']='something'
    rec['price']=10.5
    rec['date']=(2000,1,12)
    rec.store()
    # add another record
    rec=DbfRecord(dbft)
    rec['name']='foo and bar'
    rec['price']=12234
    rec['date']=(1992,7,15)
    rec.store()
    # show the records
    print "*** inserted 2 records into tst.dbf: ***"
    print repr(dbft)
    for i1 in range(len(dbft)):
        rec = dbft[i1]
        for fldName in dbft.fieldNames:
            print '%s:\t %s'%(fldName, rec[fldName])
        print
    dbft.close()
 # vim: set et sts=4 sw=4 :
@@ -0,0 +1,466 @@
 """DBF fields definitions.
 TODO:
  - make memos work
 """
 """History (most recent first):
 26-may-2009 [als]   DbfNumericFieldDef.decodeValue: strip zero bytes
 05-feb-2009 [als]   DbfDateFieldDef.encodeValue: empty arg produces empty date
 16-sep-2008 [als]   DbfNumericFieldDef decoding looks for decimal point
                    in the value to select float or integer return type
 13-mar-2008 [als]   check field name length in constructor
 11-feb-2007 [als]   handle value conversion errors
 10-feb-2007 [als]   DbfFieldDef: added .rawFromRecord()
 01-dec-2006 [als]   Timestamp columns use None for empty values
 31-oct-2006 [als]   support field types 'F' (float), 'I' (integer)
                    and 'Y' (currency);
                    automate export and registration of field classes
 04-jul-2006 [als]   added export declaration
 10-mar-2006 [als]   decode empty values for Date and Logical fields;
                    show field name in errors
 10-mar-2006 [als]   fix Numeric value decoding: according to spec,
                    value always is string representation of the number;
                    ensure that encoded Numeric value fits into the field
 20-dec-2005 [yc]    use field names in upper case
 15-dec-2005 [yc]    field definitions moved from `dbf`.
 """
 __version__ = "$Revision: 1.14 $"[11:-2]
 __date__ = "$Date: 2009/05/26 05:16:51 $"[7:-2]
 __all__ = ["lookupFor",] # field classes added at the end of the module
 import datetime
 import struct
 import sys
 from . import utils
 ## abstract definitions
 class DbfFieldDef(object):
    """Abstract field definition.
    Child classes must override ``type`` class attribute to provide datatype
    infromation of the field definition. For more info about types visit
    `http://www.clicketyclick.dk/databases/xbase/format/data_types.html`
    Also child classes must override ``defaultValue`` field to provide
    default value for the field value.
    If child class has fixed length ``length`` class attribute must be
    overriden and set to the valid value. None value means, that field
    isn't of fixed length.
    Note: ``name`` field must not be changed after instantiation.
    """
    __slots__ = ("name", "length", "decimalCount",
        "start", "end", "ignoreErrors")
    # length of the field, None in case of variable-length field,
    # or a number if this field is a fixed-length field
    length = None
    # field type. for more information about fields types visit
    # `http://www.clicketyclick.dk/databases/xbase/format/data_types.html`
    # must be overriden in child classes
    typeCode = None
    # default value for the field. this field must be
    # overriden in child classes
    defaultValue = None
    def __init__(self, name, length=None, decimalCount=None,
        start=None, stop=None, ignoreErrors=False,
    ):
        """Initialize instance."""
        assert self.typeCode is not None, "Type code must be overriden"
        assert self.defaultValue is not None, "Default value must be overriden"
        ## fix arguments
        if len(name) >10:
            raise ValueError("Field name \"%s\" is too long" % name)
        name = str(name).upper()
        if self.__class__.length is None:
            if length is None:
                raise ValueError("[%s] Length isn't specified" % name)
            length = int(length)
            if length <= 0:
                raise ValueError("[%s] Length must be a positive integer"
                    % name)
        else:
            length = self.length
        if decimalCount is None:
            decimalCount = 0
        ## set fields
        self.name = name
        # FIXME: validate length according to the specification at
        # http://www.clicketyclick.dk/databases/xbase/format/data_types.html
        self.length = length
        self.decimalCount = decimalCount
        self.ignoreErrors = ignoreErrors
        self.start = start
        self.end = stop
    def __cmp__(self, other):
        return cmp(self.name, str(other).upper())
    def __hash__(self):
        return hash(self.name)
    def fromString(cls, string, start, ignoreErrors=False):
        """Decode dbf field definition from the string data.
        Arguments:
            string:
                a string, dbf definition is decoded from. length of
                the string must be 32 bytes.
            start:
                position in the database file.
            ignoreErrors:
                initial error processing mode for the new field (boolean)
        """
        assert len(string) == 32
        _length = ord(string[16])
        return cls(utils.unzfill(string)[:11], _length, ord(string[17]),
            start, start + _length, ignoreErrors=ignoreErrors)
    fromString = classmethod(fromString)
    def toString(self):
        """Return encoded field definition.
        Return:
            Return value is a string object containing encoded
            definition of this field.
        """
        if sys.version_info < (2, 4):
            # earlier versions did not support padding character
            _name = self.name[:11] + "\0" * (11 - len(self.name))
        else:
            _name = self.name.ljust(11, '\0')
        return (
            _name +
            self.typeCode +
            #data address
            chr(0) * 4 +
            chr(self.length) +
            chr(self.decimalCount) +
            chr(0) * 14
        )
    def __repr__(self):
        return "%-10s %1s %3d %3d" % self.fieldInfo()
    def fieldInfo(self):
        """Return field information.
        Return:
            Return value is a (name, type, length, decimals) tuple.
        """
        return (self.name, self.typeCode, self.length, self.decimalCount)
    def rawFromRecord(self, record):
        """Return a "raw" field value from the record string."""
        return record[self.start:self.end]
    def decodeFromRecord(self, record):
        """Return decoded field value from the record string."""
        try:
            return self.decodeValue(self.rawFromRecord(record))
        except:
            if self.ignoreErrors:
                return utils.INVALID_VALUE
            else:
                raise
    def decodeValue(self, value):
        """Return decoded value from string value.
        This method shouldn't be used publicly. It's called from the
        `decodeFromRecord` method.
        This is an abstract method and it must be overridden in child classes.
        """
        raise NotImplementedError
    def encodeValue(self, value):
        """Return str object containing encoded field value.
        This is an abstract method and it must be overriden in child classes.
        """
        raise NotImplementedError
 ## real classes
 class DbfCharacterFieldDef(DbfFieldDef):
    """Definition of the character field."""
    typeCode = "C"
    defaultValue = ""
    def decodeValue(self, value):
        """Return string object.
        Return value is a ``value`` argument with stripped right spaces.
        """
        return value.rstrip(" ")
    def encodeValue(self, value):
        """Return raw data string encoded from a ``value``."""
        return str(value)[:self.length].ljust(self.length)
 class DbfNumericFieldDef(DbfFieldDef):
    """Definition of the numeric field."""
    typeCode = "N"
    # XXX: now I'm not sure it was a good idea to make a class field
    # `defaultValue` instead of a generic method as it was implemented
    # previously -- it's ok with all types except number, cuz
    # if self.decimalCount is 0, we should return 0 and 0.0 otherwise.
    defaultValue = 0
    def decodeValue(self, value):
        """Return a number decoded from ``value``.
        If decimals is zero, value will be decoded as an integer;
        or as a float otherwise.
        Return:
            Return value is a int (long) or float instance.
        """
        value = value.strip(" \0")
        if "." in value:
            # a float (has decimal separator)
            return float(value)
        elif value:
            # must be an integer
            return int(value)
        else:
            return 0
    def encodeValue(self, value):
        """Return string containing encoded ``value``."""
        _rv = ("%*.*f" % (self.length, self.decimalCount, value))
        if len(_rv) > self.length:
            _ppos = _rv.find(".")
            if 0 <= _ppos <= self.length:
                _rv = _rv[:self.length]
            else:
                raise ValueError("[%s] Numeric overflow: %s (field width: %i)"
                    % (self.name, _rv, self.length))
        return _rv
 class DbfFloatFieldDef(DbfNumericFieldDef):
    """Definition of the float field - same as numeric."""
    typeCode = "F"
 class DbfIntegerFieldDef(DbfFieldDef):
    """Definition of the integer field."""
    typeCode = "I"
    length = 4
    defaultValue = 0
    def decodeValue(self, value):
        """Return an integer number decoded from ``value``."""
        return struct.unpack("<i", value)[0]
    def encodeValue(self, value):
        """Return string containing encoded ``value``."""
        return struct.pack("<i", int(value))
 class DbfCurrencyFieldDef(DbfFieldDef):
    """Definition of the currency field."""
    typeCode = "Y"
    length = 8
    defaultValue = 0.0
    def decodeValue(self, value):
        """Return float number decoded from ``value``."""
        return struct.unpack("<q", value)[0] / 10000.
    def encodeValue(self, value):
        """Return string containing encoded ``value``."""
        return struct.pack("<q", round(value * 10000))
 class DbfLogicalFieldDef(DbfFieldDef):
    """Definition of the logical field."""
    typeCode = "L"
    defaultValue = -1
    length = 1
    def decodeValue(self, value):
        """Return True, False or -1 decoded from ``value``."""
        # Note: value always is 1-char string
        if value == "?":
            return -1
        if value in "NnFf ":
            return False
        if value in "YyTt":
            return True
        raise ValueError("[%s] Invalid logical value %r" % (self.name, value))
    def encodeValue(self, value):
        """Return a character from the "TF?" set.
        Return:
            Return value is "T" if ``value`` is True
            "?" if value is -1 or False otherwise.
        """
        if value is True:
            return "T"
        if value == -1:
            return "?"
        return "F"
 class DbfMemoFieldDef(DbfFieldDef):
    """Definition of the memo field.
    Note: memos aren't currenly completely supported.
    """
    typeCode = "M"
    defaultValue = " " * 10
    length = 10
    def decodeValue(self, value):
        """Return int .dbt block number decoded from the string object."""
        #return int(value)
        raise NotImplementedError
    def encodeValue(self, value):
        """Return raw data string encoded from a ``value``.
        Note: this is an internal method.
        """
        #return str(value)[:self.length].ljust(self.length)
        raise NotImplementedError
 class DbfDateFieldDef(DbfFieldDef):
    """Definition of the date field."""
    typeCode = "D"
    defaultValue = utils.classproperty(lambda cls: datetime.date.today())
    # "yyyymmdd" gives us 8 characters
    length = 8
    def decodeValue(self, value):
        """Return a ``datetime.date`` instance decoded from ``value``."""
        if value.strip():
            return utils.getDate(value)
        else:
            return None
    def encodeValue(self, value):
        """Return a string-encoded value.
        ``value`` argument should be a value suitable for the
        `utils.getDate` call.
        Return:
            Return value is a string in format "yyyymmdd".
        """
        if value:
            return utils.getDate(value).strftime("%Y%m%d")
        else:
            return " " * self.length
 class DbfDateTimeFieldDef(DbfFieldDef):
    """Definition of the timestamp field."""
    # a difference between JDN (Julian Day Number)
    # and GDN (Gregorian Day Number). note, that GDN < JDN
    JDN_GDN_DIFF = 1721425
    typeCode = "T"
    defaultValue = utils.classproperty(lambda cls: datetime.datetime.now())
    # two 32-bits integers representing JDN and amount of
    # milliseconds respectively gives us 8 bytes.
    # note, that values must be encoded in LE byteorder.
    length = 8
    def decodeValue(self, value):
        """Return a `datetime.datetime` instance."""
        assert len(value) == self.length
        # LE byteorder
        _jdn, _msecs = struct.unpack("<2I", value)
        if _jdn >= 1:
            _rv = datetime.datetime.fromordinal(_jdn - self.JDN_GDN_DIFF)
            _rv += datetime.timedelta(0, _msecs / 1000.0)
        else:
            # empty date
            _rv = None
        return _rv
    def encodeValue(self, value):
        """Return a string-encoded ``value``."""
        if value:
            value = utils.getDateTime(value)
            # LE byteorder
            _rv = struct.pack("<2I", value.toordinal() + self.JDN_GDN_DIFF,
                (value.hour * 3600 + value.minute * 60 + value.second) * 1000)
        else:
            _rv = "\0" * self.length
        assert len(_rv) == self.length
        return _rv
 _fieldsRegistry = {}
 def registerField(fieldCls):
    """Register field definition class.
    ``fieldCls`` should be subclass of the `DbfFieldDef`.
    Use `lookupFor` to retrieve field definition class
    by the type code.
    """
    assert fieldCls.typeCode is not None, "Type code isn't defined"
    # XXX: use fieldCls.typeCode.upper()? in case of any decign
    # don't forget to look to the same comment in ``lookupFor`` method
    _fieldsRegistry[fieldCls.typeCode] = fieldCls
 def lookupFor(typeCode):
    """Return field definition class for the given type code.
    ``typeCode`` must be a single character. That type should be
    previously registered.
    Use `registerField` to register new field class.
    Return:
        Return value is a subclass of the `DbfFieldDef`.
    """
    # XXX: use typeCode.upper()? in case of any decign don't
    # forget to look to the same comment in ``registerField``
    return _fieldsRegistry[typeCode]
 ## register generic types
 for (_name, _val) in globals().items():
    if isinstance(_val, type) and issubclass(_val, DbfFieldDef) \
    and (_name != "DbfFieldDef"):
        __all__.append(_name)
        registerField(_val)
 del _name, _val
 # vim: et sts=4 sw=4 :
@@ -0,0 +1,275 @@
 """DBF header definition.
 TODO:
  - handle encoding of the character fields
    (encoding information stored in the DBF header)
 """
 """History (most recent first):
 16-sep-2010 [als]   fromStream: fix century of the last update field
 11-feb-2007 [als]   added .ignoreErrors
 10-feb-2007 [als]   added __getitem__: return field definitions
                    by field name or field number (zero-based)
 04-jul-2006 [als]   added export declaration
 15-dec-2005 [yc]    created
 """
 __version__ = "$Revision: 1.6 $"[11:-2]
 __date__ = "$Date: 2010/09/16 05:06:39 $"[7:-2]
 __all__ = ["DbfHeader"]
 try:
    import cStringIO
 except ImportError:
    # when we're in python3, we cStringIO has been replaced by io.StringIO
    import io as cStringIO
 import datetime
 import struct
 import time
 from . import fields
 from . import utils
 class DbfHeader(object):
    """Dbf header definition.
    For more information about dbf header format visit
    `http://www.clicketyclick.dk/databases/xbase/format/dbf.html#DBF_STRUCT`
    Examples:
        Create an empty dbf header and add some field definitions:
            dbfh = DbfHeader()
            dbfh.addField(("name", "C", 10))
            dbfh.addField(("date", "D"))
            dbfh.addField(DbfNumericFieldDef("price", 5, 2))
        Create a dbf header with field definitions:
            dbfh = DbfHeader([
                ("name", "C", 10),
                ("date", "D"),
                DbfNumericFieldDef("price", 5, 2),
            ])
    """
    __slots__ = ("signature", "fields", "lastUpdate", "recordLength",
        "recordCount", "headerLength", "changed", "_ignore_errors")
    ## instance construction and initialization methods
    def __init__(self, fields=None, headerLength=0, recordLength=0,
        recordCount=0, signature=0x03, lastUpdate=None, ignoreErrors=False,
    ):
        """Initialize instance.
        Arguments:
            fields:
                a list of field definitions;
            recordLength:
                size of the records;
            headerLength:
                size of the header;
            recordCount:
                number of records stored in DBF;
            signature:
                version number (aka signature). using 0x03 as a default meaning
                "File without DBT". for more information about this field visit
                ``http://www.clicketyclick.dk/databases/xbase/format/dbf.html#DBF_NOTE_1_TARGET``
            lastUpdate:
                date of the DBF's update. this could be a string ('yymmdd' or
                'yyyymmdd'), timestamp (int or float), datetime/date value,
                a sequence (assuming (yyyy, mm, dd, ...)) or an object having
                callable ``ticks`` field.
            ignoreErrors:
                error processing mode for DBF fields (boolean)
        """
        self.signature = signature
        if fields is None:
            self.fields = []
        else:
            self.fields = list(fields)
        self.lastUpdate = utils.getDate(lastUpdate)
        self.recordLength = recordLength
        self.headerLength = headerLength
        self.recordCount = recordCount
        self.ignoreErrors = ignoreErrors
        # XXX: I'm not sure this is safe to
        # initialize `self.changed` in this way
        self.changed = bool(self.fields)
    # @classmethod
    def fromString(cls, string):
        """Return header instance from the string object."""
        return cls.fromStream(cStringIO.StringIO(str(string)))
    fromString = classmethod(fromString)
    # @classmethod
    def fromStream(cls, stream):
        """Return header object from the stream."""
        stream.seek(0)
        _data = stream.read(32)
        (_cnt, _hdrLen, _recLen) = struct.unpack("<I2H", _data[4:12])
        #reserved = _data[12:32]
        _year = ord(_data[1])
        if _year < 80:
            # dBase II started at 1980.  It is quite unlikely
            # that actual last update date is before that year.
            _year += 2000
        else:
            _year += 1900
        ## create header object
        _obj = cls(None, _hdrLen, _recLen, _cnt, ord(_data[0]),
            (_year, ord(_data[2]), ord(_data[3])))
        ## append field definitions
        # position 0 is for the deletion flag
        _pos = 1
        _data = stream.read(1)
        # The field definitions are ended either by \x0D OR a newline
        # character, so we need to handle both when reading from a stream.
        # When writing, dbfpy appears to write newlines instead of \x0D.
        while _data[0] not in ["\x0D", "\n"]:
            _data += stream.read(31)
            _fld = fields.lookupFor(_data[11]).fromString(_data, _pos)
            _obj._addField(_fld)
            _pos = _fld.end
            _data = stream.read(1)
        return _obj
    fromStream = classmethod(fromStream)
    ## properties
    year = property(lambda self: self.lastUpdate.year)
    month = property(lambda self: self.lastUpdate.month)
    day = property(lambda self: self.lastUpdate.day)
    def ignoreErrors(self, value):
        """Update `ignoreErrors` flag on self and all fields"""
        self._ignore_errors = value = bool(value)
        for _field in self.fields:
            _field.ignoreErrors = value
    ignoreErrors = property(
        lambda self: self._ignore_errors,
        ignoreErrors,
        doc="""Error processing mode for DBF field value conversion
        if set, failing field value conversion will return
        ``INVALID_VALUE`` instead of raising conversion error.
        """)
    ## object representation
    def __repr__(self):
        _rv = """\
 Version (signature): 0x%02x
        Last update: %s
      Header length: %d
      Record length: %d
       Record count: %d
 FieldName Type Len Dec
 """ % (self.signature, self.lastUpdate, self.headerLength,
    self.recordLength, self.recordCount)
        _rv += "\n".join(
            ["%10s %4s %3s %3s" % _fld.fieldInfo() for _fld in self.fields]
        )
        return _rv
    ## internal methods
    def _addField(self, *defs):
        """Internal variant of the `addField` method.
        This method doesn't set `self.changed` field to True.
        Return value is a length of the appended records.
        Note: this method doesn't modify ``recordLength`` and
        ``headerLength`` fields. Use `addField` instead of this
        method if you don't exactly know what you're doing.
        """
        # insure we have dbf.DbfFieldDef instances first (instantiation
        # from the tuple could raise an error, in such a case I don't
        # wanna add any of the definitions -- all will be ignored)
        _defs = []
        _recordLength = 0
        for _def in defs:
            if isinstance(_def, fields.DbfFieldDef):
                _obj = _def
            else:
                (_name, _type, _len, _dec) = (tuple(_def) + (None,) * 4)[:4]
                _cls = fields.lookupFor(_type)
                _obj = _cls(_name, _len, _dec,
                    ignoreErrors=self._ignore_errors)
            _recordLength += _obj.length
            _defs.append(_obj)
        # and now extend field definitions and
        # update record length
        self.fields += _defs
        return _recordLength
    ## interface methods
    def addField(self, *defs):
        """Add field definition to the header.
        Examples:
            dbfh.addField(
                ("name", "C", 20),
                dbf.DbfCharacterFieldDef("surname", 20),
                dbf.DbfDateFieldDef("birthdate"),
                ("member", "L"),
            )
            dbfh.addField(("price", "N", 5, 2))
            dbfh.addField(dbf.DbfNumericFieldDef("origprice", 5, 2))
        """
        _oldLen = self.recordLength
        self.recordLength += self._addField(*defs)
        if not _oldLen:
            self.recordLength += 1
            # XXX: may be just use:
            # self.recordeLength += self._addField(*defs) + bool(not _oldLen)
        # recalculate headerLength
        self.headerLength = 32 + (32 * len(self.fields)) + 1
        self.changed = True
    def write(self, stream):
        """Encode and write header to the stream."""
        stream.seek(0)
        stream.write(self.toString())
        stream.write("".join([_fld.toString() for _fld in self.fields]))
        stream.write(chr(0x0D))   # cr at end of all hdr data
        self.changed = False
    def toString(self):
        """Returned 32 chars length string with encoded header."""
        return struct.pack("<4BI2H",
            self.signature,
            self.year - 1900,
            self.month,
            self.day,
            self.recordCount,
            self.headerLength,
            self.recordLength) + "\0" * 20
    def setCurrentDate(self):
        """Update ``self.lastUpdate`` field with current date value."""
        self.lastUpdate = datetime.date.today()
    def __getitem__(self, item):
        """Return a field definition by numeric index or name string"""
        if isinstance(item, basestring):
            _name = item.upper()
            for _field in self.fields:
                if _field.name == _name:
                    return _field
            else:
                raise KeyError(item)
        else:
            # item must be field index
            return self.fields[item]
 # vim: et sts=4 sw=4 :
@@ -0,0 +1,262 @@
 """DBF record definition.
 """
 """History (most recent first):
 11-feb-2007 [als]   __repr__: added special case for invalid field values
 10-feb-2007 [als]   added .rawFromStream()
 30-oct-2006 [als]   fix record length in .fromStream()
 04-jul-2006 [als]   added export declaration
 20-dec-2005 [yc]    DbfRecord.write() -> DbfRecord._write();
                    added delete() method.
 16-dec-2005 [yc]    record definition moved from `dbf`.
 """
 __version__ = "$Revision: 1.7 $"[11:-2]
 __date__ = "$Date: 2007/02/11 09:05:49 $"[7:-2]
 __all__ = ["DbfRecord"]
 from itertools import izip
 import utils
 class DbfRecord(object):
    """DBF record.
    Instances of this class shouldn't be created manualy,
    use `dbf.Dbf.newRecord` instead.
    Class implements mapping/sequence interface, so
    fields could be accessed via their names or indexes
    (names is a preffered way to access fields).
    Hint:
        Use `store` method to save modified record.
    Examples:
        Add new record to the database:
            db = Dbf(filename)
            rec = db.newRecord()
            rec["FIELD1"] = value1
            rec["FIELD2"] = value2
            rec.store()
        Or the same, but modify existed
        (second in this case) record:
            db = Dbf(filename)
            rec = db[2]
            rec["FIELD1"] = value1
            rec["FIELD2"] = value2
            rec.store()
    """
    __slots__ = "dbf", "index", "deleted", "fieldData"
    ## creation and initialization
    def __init__(self, dbf, index=None, deleted=False, data=None):
        """Instance initialiation.
        Arguments:
            dbf:
                A `Dbf.Dbf` instance this record belonogs to.
            index:
                An integer record index or None. If this value is
                None, record will be appended to the DBF.
            deleted:
                Boolean flag indicating whether this record
                is a deleted record.
            data:
                A sequence or None. This is a data of the fields.
                If this argument is None, default values will be used.
        """
        self.dbf = dbf
        # XXX: I'm not sure ``index`` is necessary
        self.index = index
        self.deleted = deleted
        if data is None:
            self.fieldData = [_fd.defaultValue for _fd in dbf.header.fields]
        else:
            self.fieldData = list(data)
    # XXX: validate self.index before calculating position?
    position = property(lambda self: self.dbf.header.headerLength + \
        self.index * self.dbf.header.recordLength)
    def rawFromStream(cls, dbf, index):
        """Return raw record contents read from the stream.
        Arguments:
            dbf:
                A `Dbf.Dbf` instance containing the record.
            index:
                Index of the record in the records' container.
                This argument can't be None in this call.
        Return value is a string containing record data in DBF format.
        """
        # XXX: may be write smth assuming, that current stream
        # position is the required one? it could save some
        # time required to calculate where to seek in the file
        dbf.stream.seek(dbf.header.headerLength +
            index * dbf.header.recordLength)
        return dbf.stream.read(dbf.header.recordLength)
    rawFromStream = classmethod(rawFromStream)
    def fromStream(cls, dbf, index):
        """Return a record read from the stream.
        Arguments:
            dbf:
                A `Dbf.Dbf` instance new record should belong to.
            index:
                Index of the record in the records' container.
                This argument can't be None in this call.
        Return value is an instance of the current class.
        """
        return cls.fromString(dbf, cls.rawFromStream(dbf, index), index)
    fromStream = classmethod(fromStream)
    def fromString(cls, dbf, string, index=None):
        """Return record read from the string object.
        Arguments:
            dbf:
                A `Dbf.Dbf` instance new record should belong to.
            string:
                A string new record should be created from.
            index:
                Index of the record in the container. If this
                argument is None, record will be appended.
        Return value is an instance of the current class.
        """
        return cls(dbf, index, string[0]=="*",
            [_fd.decodeFromRecord(string) for _fd in dbf.header.fields])
    fromString = classmethod(fromString)
    ## object representation
    def __repr__(self):
        _template = "%%%ds: %%s (%%s)" % max([len(_fld)
            for _fld in self.dbf.fieldNames])
        _rv = []
        for _fld in self.dbf.fieldNames:
            _val = self[_fld]
            if _val is utils.INVALID_VALUE:
                _rv.append(_template %
                    (_fld, "None", "value cannot be decoded"))
            else:
                _rv.append(_template % (_fld, _val, type(_val)))
        return "\n".join(_rv)
    ## protected methods
    def _write(self):
        """Write data to the dbf stream.
        Note:
            This isn't a public method, it's better to
            use 'store' instead publically.
            Be design ``_write`` method should be called
            only from the `Dbf` instance.
        """
        self._validateIndex(False)
        self.dbf.stream.seek(self.position)
        self.dbf.stream.write(self.toString())
        # FIXME: may be move this write somewhere else?
        # why we should check this condition for each record?
        if self.index == len(self.dbf):
            # this is the last record,
            # we should write SUB (ASCII 26)
            self.dbf.stream.write("\x1A")
    ## utility methods
    def _validateIndex(self, allowUndefined=True, checkRange=False):
        """Valid ``self.index`` value.
        If ``allowUndefined`` argument is True functions does nothing
        in case of ``self.index`` pointing to None object.
        """
        if self.index is None:
            if not allowUndefined:
                raise ValueError("Index is undefined")
        elif self.index < 0:
            raise ValueError("Index can't be negative (%s)" % self.index)
        elif checkRange and self.index <= self.dbf.header.recordCount:
            raise ValueError("There are only %d records in the DBF" %
                self.dbf.header.recordCount)
    ## interface methods
    def store(self):
        """Store current record in the DBF.
        If ``self.index`` is None, this record will be appended to the
        records of the DBF this records belongs to; or replaced otherwise.
        """
        self._validateIndex()
        if self.index is None:
            self.index = len(self.dbf)
            self.dbf.append(self)
        else:
            self.dbf[self.index] = self
    def delete(self):
        """Mark method as deleted."""
        self.deleted = True
    def toString(self):
        """Return string packed record values."""
        return "".join([" *"[self.deleted]] + [
            _def.encodeValue(_dat)
            for (_def, _dat) in izip(self.dbf.header.fields, self.fieldData)
        ])
    def asList(self):
        """Return a flat list of fields.
        Note:
            Change of the list's values won't change
            real values stored in this object.
        """
        return self.fieldData[:]
    def asDict(self):
        """Return a dictionary of fields.
        Note:
            Change of the dicts's values won't change
            real values stored in this object.
        """
        return dict([_i for _i in izip(self.dbf.fieldNames, self.fieldData)])
    def __getitem__(self, key):
        """Return value by field name or field index."""
        if isinstance(key, (long, int)):
            # integer index of the field
            return self.fieldData[key]
        # assuming string field name
        return self.fieldData[self.dbf.indexOfFieldName(key)]
    def __setitem__(self, key, value):
        """Set field value by integer index of the field or string name."""
        if isinstance(key, (int, long)):
            # integer index of the field
            return self.fieldData[key]
        # assuming string field name
        self.fieldData[self.dbf.indexOfFieldName(key)] = value
 # vim: et sts=4 sw=4 :
@@ -0,0 +1,170 @@
 """String utilities.
 TODO:
  - allow strings in getDateTime routine;
 """
 """History (most recent first):
 11-feb-2007 [als]   added INVALID_VALUE
 10-feb-2007 [als]   allow date strings padded with spaces instead of zeroes
 20-dec-2005 [yc]    handle long objects in getDate/getDateTime
 16-dec-2005 [yc]    created from ``strutil`` module.
 """
 __version__ = "$Revision: 1.4 $"[11:-2]
 __date__ = "$Date: 2007/02/11 08:57:17 $"[7:-2]
 import datetime
 import time
 def unzfill(str):
    """Return a string without ASCII NULs.
    This function searchers for the first NUL (ASCII 0) occurance
    and truncates string till that position.
    """
    try:
        return str[:str.index('\0')]
    except ValueError:
        return str
 def getDate(date=None):
    """Return `datetime.date` instance.
    Type of the ``date`` argument could be one of the following:
        None:
            use current date value;
        datetime.date:
            this value will be returned;
        datetime.datetime:
            the result of the date.date() will be returned;
        string:
            assuming "%Y%m%d" or "%y%m%dd" format;
        number:
            assuming it's a timestamp (returned for example
            by the time.time() call;
        sequence:
            assuming (year, month, day, ...) sequence;
    Additionaly, if ``date`` has callable ``ticks`` attribute,
    it will be used and result of the called would be treated
    as a timestamp value.
    """
    if date is None:
        # use current value
        return datetime.date.today()
    if isinstance(date, datetime.date):
        return date
    if isinstance(date, datetime.datetime):
        return date.date()
    if isinstance(date, (int, long, float)):
        # date is a timestamp
        return datetime.date.fromtimestamp(date)
    if isinstance(date, basestring):
        date = date.replace(" ", "0")
        if len(date) == 6:
            # yymmdd
            return datetime.date(*time.strptime(date, "%y%m%d")[:3])
        # yyyymmdd
        return datetime.date(*time.strptime(date, "%Y%m%d")[:3])
    if hasattr(date, "__getitem__"):
        # a sequence (assuming date/time tuple)
        return datetime.date(*date[:3])
    return datetime.date.fromtimestamp(date.ticks())
 def getDateTime(value=None):
    """Return `datetime.datetime` instance.
    Type of the ``value`` argument could be one of the following:
        None:
            use current date value;
        datetime.date:
            result will be converted to the `datetime.datetime` instance
            using midnight;
        datetime.datetime:
            ``value`` will be returned as is;
        string:
            *** CURRENTLY NOT SUPPORTED ***;
        number:
            assuming it's a timestamp (returned for example
            by the time.time() call;
        sequence:
            assuming (year, month, day, ...) sequence;
    Additionaly, if ``value`` has callable ``ticks`` attribute,
    it will be used and result of the called would be treated
    as a timestamp value.
    """
    if value is None:
        # use current value
        return datetime.datetime.today()
    if isinstance(value, datetime.datetime):
        return value
    if isinstance(value, datetime.date):
        return datetime.datetime.fromordinal(value.toordinal())
    if isinstance(value, (int, long, float)):
        # value is a timestamp
        return datetime.datetime.fromtimestamp(value)
    if isinstance(value, basestring):
        raise NotImplementedError("Strings aren't currently implemented")
    if hasattr(value, "__getitem__"):
        # a sequence (assuming date/time tuple)
        return datetime.datetime(*tuple(value)[:6])
    return datetime.datetime.fromtimestamp(value.ticks())
 class classproperty(property):
    """Works in the same way as a ``property``, but for the classes."""
    def __get__(self, obj, cls):
        return self.fget(cls)
 class _InvalidValue(object):
    """Value returned from DBF records when field validation fails
    The value is not equal to anything except for itself
    and equal to all empty values: None, 0, empty string etc.
    In other words, invalid value is equal to None and not equal
    to None at the same time.
    This value yields zero upon explicit conversion to a number type,
    empty string for string types, and False for boolean.
    """
    def __eq__(self, other):
        return not other
    def __ne__(self, other):
        return not (other is self)
    def __nonzero__(self):
        return False
    def __int__(self):
        return 0
    __long__ = __int__
    def __float__(self):
        return 0.0
    def __str__(self):
        return ""
    def __unicode__(self):
        return u""
    def __repr__(self):
        return "<INVALID>"
 # invalid value is a constant singleton
 INVALID_VALUE = _InvalidValue()
 # vim: set et sts=4 sw=4 :
@@ -0,0 +1,293 @@
 #! /usr/bin/env python
 """DBF accessing helpers.
 FIXME: more documentation needed
 Examples:
    Create new table, setup structure, add records:
        dbf = Dbf(filename, new=True)
        dbf.addField(
            ("NAME", "C", 15),
            ("SURNAME", "C", 25),
            ("INITIALS", "C", 10),
            ("BIRTHDATE", "D"),
        )
        for (n, s, i, b) in (
            ("John", "Miller", "YC", (1980, 10, 11)),
            ("Andy", "Larkin", "", (1980, 4, 11)),
        ):
            rec = dbf.newRecord()
            rec["NAME"] = n
            rec["SURNAME"] = s
            rec["INITIALS"] = i
            rec["BIRTHDATE"] = b
            rec.store()
        dbf.close()
    Open existed dbf, read some data:
        dbf = Dbf(filename, True)
        for rec in dbf:
            for fldName in dbf.fieldNames:
                print '%s:\t %s (%s)' % (fldName, rec[fldName],
                    type(rec[fldName]))
            print
        dbf.close()
 """
 """History (most recent first):
 11-feb-2007 [als]   export INVALID_VALUE;
                    Dbf: added .ignoreErrors, .INVALID_VALUE
 04-jul-2006 [als]   added export declaration
 20-dec-2005 [yc]    removed fromStream and newDbf methods:
                    use argument of __init__ call must be used instead;
                    added class fields pointing to the header and
                    record classes.
 17-dec-2005 [yc]    split to several modules; reimplemented
 13-dec-2005 [yc]    adapted to the changes of the `strutil` module.
 13-sep-2002 [als]   support FoxPro Timestamp datatype
 15-nov-1999 [jjk]   documentation updates, add demo
 24-aug-1998 [jjk]   add some encodeValue methods (not tested), other tweaks
 08-jun-1998 [jjk]   fix problems, add more features
 20-feb-1998 [jjk]   fix problems, add more features
 19-feb-1998 [jjk]   add create/write capabilities
 18-feb-1998 [jjk]   from dbfload.py
 """
 __version__ = "$Revision: 1.7 $"[11:-2]
 __date__ = "$Date: 2007/02/11 09:23:13 $"[7:-2]
 __author__ = "Jeff Kunce <kuncej@mail.conservation.state.mo.us>"
 __all__ = ["Dbf"]
 from . import header
 from . import record
 from .utils import INVALID_VALUE
 class Dbf(object):
    """DBF accessor.
    FIXME:
        docs and examples needed (dont' forget to tell
        about problems adding new fields on the fly)
    Implementation notes:
        ``_new`` field is used to indicate whether this is
        a new data table. `addField` could be used only for
        the new tables! If at least one record was appended
        to the table it's structure couldn't be changed.
    """
    __slots__ = ("name", "header", "stream",
        "_changed", "_new", "_ignore_errors")
    HeaderClass = header.DbfHeader
    RecordClass = record.DbfRecord
    INVALID_VALUE = INVALID_VALUE
    ## initialization and creation helpers
    def __init__(self, f, readOnly=False, new=False, ignoreErrors=False):
        """Initialize instance.
        Arguments:
            f:
                Filename or file-like object.
            new:
                True if new data table must be created. Assume
                data table exists if this argument is False.
            readOnly:
                if ``f`` argument is a string file will
                be opend in read-only mode; in other cases
                this argument is ignored. This argument is ignored
                even if ``new`` argument is True.
            headerObj:
                `header.DbfHeader` instance or None. If this argument
                is None, new empty header will be used with the
                all fields set by default.
            ignoreErrors:
                if set, failing field value conversion will return
                ``INVALID_VALUE`` instead of raising conversion error.
        """
        if isinstance(f, str):
            # a filename
            self.name = f
            if new:
                # new table (table file must be
                # created or opened and truncated)
                self.stream = open(f, "w+b")
            else:
                # tabe file must exist
                self.stream = open(f, ("r+b", "rb")[bool(readOnly)])
        else:
            # a stream
            self.name = getattr(f, "name", "")
            self.stream = f
        if new:
            # if this is a new table, header will be empty
            self.header = self.HeaderClass()
        else:
            # or instantiated using stream
            self.header = self.HeaderClass.fromStream(self.stream)
        self.ignoreErrors = ignoreErrors
        self._new = bool(new)
        self._changed = False
    ## properties
    closed = property(lambda self: self.stream.closed)
    recordCount = property(lambda self: self.header.recordCount)
    fieldNames = property(
        lambda self: [_fld.name for _fld in self.header.fields])
    fieldDefs = property(lambda self: self.header.fields)
    changed = property(lambda self: self._changed or self.header.changed)
    def ignoreErrors(self, value):
        """Update `ignoreErrors` flag on the header object and self"""
        self.header.ignoreErrors = self._ignore_errors = bool(value)
    ignoreErrors = property(
        lambda self: self._ignore_errors,
        ignoreErrors,
        doc="""Error processing mode for DBF field value conversion
        if set, failing field value conversion will return
        ``INVALID_VALUE`` instead of raising conversion error.
        """)
    ## protected methods
    def _fixIndex(self, index):
        """Return fixed index.
        This method fails if index isn't a numeric object
        (long or int). Or index isn't in a valid range
        (less or equal to the number of records in the db).
        If ``index`` is a negative number, it will be
        treated as a negative indexes for list objects.
        Return:
            Return value is numeric object maning valid index.
        """
        if not isinstance(index, int):
            raise TypeError("Index must be a numeric object")
        if index < 0:
            # index from the right side
            # fix it to the left-side index
            index += len(self) + 1
        if index >= len(self):
            raise IndexError("Record index out of range")
        return index
    ## iterface methods
    def close(self):
        self.flush()
        self.stream.close()
    def flush(self):
        """Flush data to the associated stream."""
        if self.changed:
            self.header.setCurrentDate()
            self.header.write(self.stream)
            self.stream.flush()
            self._changed = False
    def indexOfFieldName(self, name):
        """Index of field named ``name``."""
        # FIXME: move this to header class
        names = [f.name for f in self.header.fields]
        return names.index(name.upper())
    def newRecord(self):
        """Return new record, which belong to this table."""
        return self.RecordClass(self)
    def append(self, record):
        """Append ``record`` to the database."""
        record.index = self.header.recordCount
        record._write()
        self.header.recordCount += 1
        self._changed = True
        self._new = False
    def addField(self, *defs):
        """Add field definitions.
        For more information see `header.DbfHeader.addField`.
        """
        if self._new:
            self.header.addField(*defs)
        else:
            raise TypeError("At least one record was added, "
                "structure can't be changed")
    ## 'magic' methods (representation and sequence interface)
    def __repr__(self):
        return "Dbf stream '%s'\n" % self.stream + repr(self.header)
    def __len__(self):
        """Return number of records."""
        return self.recordCount
    def __getitem__(self, index):
        """Return `DbfRecord` instance."""
        return self.RecordClass.fromStream(self, self._fixIndex(index))
    def __setitem__(self, index, record):
        """Write `DbfRecord` instance to the stream."""
        record.index = self._fixIndex(index)
        record._write()
        self._changed = True
        self._new = False
    #def __del__(self):
    #    """Flush stream upon deletion of the object."""
    #    self.flush()
 def demoRead(filename):
    _dbf = Dbf(filename, True)
    for _rec in _dbf:
        print()
        print(repr(_rec))
    _dbf.close()
 def demoCreate(filename):
    _dbf = Dbf(filename, new=True)
    _dbf.addField(
        ("NAME", "C", 15),
        ("SURNAME", "C", 25),
        ("INITIALS", "C", 10),
        ("BIRTHDATE", "D"),
    )
    for (_n, _s, _i, _b) in (
        ("John", "Miller", "YC", (1981, 1, 2)),
        ("Andy", "Larkin", "AL", (1982, 3, 4)),
        ("Bill", "Clinth", "", (1983, 5, 6)),
        ("Bobb", "McNail", "", (1984, 7, 8)),
    ):
        _rec = _dbf.newRecord()
        _rec["NAME"] = _n
        _rec["SURNAME"] = _s
        _rec["INITIALS"] = _i
        _rec["BIRTHDATE"] = _b
        _rec.store()
    print(repr(_dbf))
    _dbf.close()
 if (__name__=='__main__'):
    import sys
    _name = len(sys.argv) > 1 and sys.argv[1] or "county.dbf"
    demoCreate(_name)
    demoRead(_name)
 # vim: set et sw=4 sts=4 :
@@ -0,0 +1,182 @@
 #!/usr/bin/python
 """.DBF creation helpers.
 Note: this is a legacy interface.  New code should use Dbf class
    for table creation (see examples in dbf.py)
 TODO:
  - handle Memo fields.
  - check length of the fields accoring to the
    `http://www.clicketyclick.dk/databases/xbase/format/data_types.html`
 """
 """History (most recent first)
 04-jul-2006 [als]   added export declaration;
                    updated for dbfpy 2.0
 15-dec-2005 [yc]    define dbf_new.__slots__
 14-dec-2005 [yc]    added vim modeline; retab'd; added doc-strings;
                    dbf_new now is a new class (inherited from object)
 ??-jun-2000 [--]    added by Hans Fiby
 """
 __version__ = "$Revision: 1.4 $"[11:-2]
 __date__ = "$Date: 2006/07/04 08:18:18 $"[7:-2]
 __all__ = ["dbf_new"]
 from .dbf import *
 from .fields import *
 from .header import *
 from .record import *
 class _FieldDefinition(object):
    """Field definition.
    This is a simple structure, which contains ``name``, ``type``,
    ``len``, ``dec`` and ``cls`` fields.
    Objects also implement get/setitem magic functions, so fields
    could be accessed via sequence iterface, where 'name' has
    index 0, 'type' index 1, 'len' index 2, 'dec' index 3 and
    'cls' could be located at index 4.
    """
    __slots__ = "name", "type", "len", "dec", "cls"
    # WARNING: be attentive - dictionaries are mutable!
    FLD_TYPES = {
        # type: (cls, len)
        "C": (DbfCharacterFieldDef, None),
        "N": (DbfNumericFieldDef, None),
        "L": (DbfLogicalFieldDef, 1),
        # FIXME: support memos
        # "M": (DbfMemoFieldDef),
        "D": (DbfDateFieldDef, 8),
        # FIXME: I'm not sure length should be 14 characters!
        # but temporary I use it, cuz date is 8 characters
        # and time 6 (hhmmss)
        "T": (DbfDateTimeFieldDef, 14),
    }
    def __init__(self, name, type, len=None, dec=0):
        _cls, _len = self.FLD_TYPES[type]
        if _len is None:
            if len is None:
                raise ValueError("Field length must be defined")
            _len = len
        self.name = name
        self.type = type
        self.len = _len
        self.dec = dec
        self.cls = _cls
    def getDbfField(self):
        "Return `DbfFieldDef` instance from the current definition."
        return self.cls(self.name, self.len, self.dec)
    def appendToHeader(self, dbfh):
        """Create a `DbfFieldDef` instance and append it to the dbf header.
        Arguments:
            dbfh: `DbfHeader` instance.
        """
        _dbff = self.getDbfField()
        dbfh.addField(_dbff)
 class dbf_new(object):
    """New .DBF creation helper.
    Example Usage:
        dbfn = dbf_new()
        dbfn.add_field("name",'C',80)
        dbfn.add_field("price",'N',10,2)
        dbfn.add_field("date",'D',8)
        dbfn.write("tst.dbf")
    Note:
        This module cannot handle Memo-fields,
        they are special.
    """
    __slots__ = ("fields",)
    FieldDefinitionClass = _FieldDefinition
    def __init__(self):
        self.fields = []
    def add_field(self, name, typ, len, dec=0):
        """Add field definition.
        Arguments:
            name:
                field name (str object). field name must not
                contain ASCII NULs and it's length shouldn't
                exceed 10 characters.
            typ:
                type of the field. this must be a single character
                from the "CNLMDT" set meaning character, numeric,
                logical, memo, date and date/time respectively.
            len:
                length of the field. this argument is used only for
                the character and numeric fields. all other fields
                have fixed length.
                FIXME: use None as a default for this argument?
            dec:
                decimal precision. used only for the numric fields.
        """
        self.fields.append(self.FieldDefinitionClass(name, typ, len, dec))
    def write(self, filename):
        """Create empty .DBF file using current structure."""
        _dbfh = DbfHeader()
        _dbfh.setCurrentDate()
        for _fldDef in self.fields:
            _fldDef.appendToHeader(_dbfh)
        _dbfStream = open(filename, "wb")
        _dbfh.write(_dbfStream)
        _dbfStream.close()
 if (__name__=='__main__'):
    # create a new DBF-File
    dbfn=dbf_new()
    dbfn.add_field("name",'C',80)
    dbfn.add_field("price",'N',10,2)
    dbfn.add_field("date",'D',8)
    dbfn.write("tst.dbf")
    # test new dbf
    print("*** created tst.dbf: ***")
    dbft = Dbf('tst.dbf', readOnly=0)
    print(repr(dbft))
    # add a record
    rec=DbfRecord(dbft)
    rec['name']='something'
    rec['price']=10.5
    rec['date']=(2000,1,12)
    rec.store()
    # add another record
    rec=DbfRecord(dbft)
    rec['name']='foo and bar'
    rec['price']=12234
    rec['date']=(1992,7,15)
    rec.store()
    # show the records
    print("*** inserted 2 records into tst.dbf: ***")
    print(repr(dbft))
    for i1 in range(len(dbft)):
        rec = dbft[i1]
        for fldName in dbft.fieldNames:
            print('%s:\t %s'%(fldName, rec[fldName]))
        print()
    dbft.close()
 # vim: set et sts=4 sw=4 :
@@ -0,0 +1,467 @@
 """DBF fields definitions.
 TODO:
  - make memos work
 """
 """History (most recent first):
 26-may-2009 [als]   DbfNumericFieldDef.decodeValue: strip zero bytes
 05-feb-2009 [als]   DbfDateFieldDef.encodeValue: empty arg produces empty date
 16-sep-2008 [als]   DbfNumericFieldDef decoding looks for decimal point
                    in the value to select float or integer return type
 13-mar-2008 [als]   check field name length in constructor
 11-feb-2007 [als]   handle value conversion errors
 10-feb-2007 [als]   DbfFieldDef: added .rawFromRecord()
 01-dec-2006 [als]   Timestamp columns use None for empty values
 31-oct-2006 [als]   support field types 'F' (float), 'I' (integer)
                    and 'Y' (currency);
                    automate export and registration of field classes
 04-jul-2006 [als]   added export declaration
 10-mar-2006 [als]   decode empty values for Date and Logical fields;
                    show field name in errors
 10-mar-2006 [als]   fix Numeric value decoding: according to spec,
                    value always is string representation of the number;
                    ensure that encoded Numeric value fits into the field
 20-dec-2005 [yc]    use field names in upper case
 15-dec-2005 [yc]    field definitions moved from `dbf`.
 """
 __version__ = "$Revision: 1.14 $"[11:-2]
 __date__ = "$Date: 2009/05/26 05:16:51 $"[7:-2]
 __all__ = ["lookupFor",] # field classes added at the end of the module
 import datetime
 import struct
 import sys
 from . import utils
 ## abstract definitions
 class DbfFieldDef(object):
    """Abstract field definition.
    Child classes must override ``type`` class attribute to provide datatype
    infromation of the field definition. For more info about types visit
    `http://www.clicketyclick.dk/databases/xbase/format/data_types.html`
    Also child classes must override ``defaultValue`` field to provide
    default value for the field value.
    If child class has fixed length ``length`` class attribute must be
    overriden and set to the valid value. None value means, that field
    isn't of fixed length.
    Note: ``name`` field must not be changed after instantiation.
    """
    __slots__ = ("name", "decimalCount",
        "start", "end", "ignoreErrors")
    # length of the field, None in case of variable-length field,
    # or a number if this field is a fixed-length field
    length = None
    # field type. for more information about fields types visit
    # `http://www.clicketyclick.dk/databases/xbase/format/data_types.html`
    # must be overriden in child classes
    typeCode = None
    # default value for the field. this field must be
    # overriden in child classes
    defaultValue = None
    def __init__(self, name, length=None, decimalCount=None,
        start=None, stop=None, ignoreErrors=False,
    ):
        """Initialize instance."""
        assert self.typeCode is not None, "Type code must be overriden"
        assert self.defaultValue is not None, "Default value must be overriden"
        ## fix arguments
        if len(name) >10:
            raise ValueError("Field name \"%s\" is too long" % name)
        name = str(name).upper()
        if self.__class__.length is None:
            if length is None:
                raise ValueError("[%s] Length isn't specified" % name)
            length = int(length)
            if length <= 0:
                raise ValueError("[%s] Length must be a positive integer"
                    % name)
        else:
            length = self.length
        if decimalCount is None:
            decimalCount = 0
        ## set fields
        self.name = name
        # FIXME: validate length according to the specification at
        # http://www.clicketyclick.dk/databases/xbase/format/data_types.html
        self.length = length
        self.decimalCount = decimalCount
        self.ignoreErrors = ignoreErrors
        self.start = start
        self.end = stop
    def __cmp__(self, other):
        return cmp(self.name, str(other).upper())
    def __hash__(self):
        return hash(self.name)
    def fromString(cls, string, start, ignoreErrors=False):
        """Decode dbf field definition from the string data.
        Arguments:
            string:
                a string, dbf definition is decoded from. length of
                the string must be 32 bytes.
            start:
                position in the database file.
            ignoreErrors:
                initial error processing mode for the new field (boolean)
        """
        assert len(string) == 32
        _length = string[16]
        return cls(utils.unzfill(string)[:11].decode('utf-8'), _length,
            string[17], start, start + _length, ignoreErrors=ignoreErrors)
    fromString = classmethod(fromString)
    def toString(self):
        """Return encoded field definition.
        Return:
            Return value is a string object containing encoded
            definition of this field.
        """
        if sys.version_info < (2, 4):
            # earlier versions did not support padding character
            _name = self.name[:11] + "\0" * (11 - len(self.name))
        else:
            _name = self.name.ljust(11, '\0')
        return (
            _name +
            self.typeCode +
            #data address
            chr(0) * 4 +
            chr(self.length) +
            chr(self.decimalCount) +
            chr(0) * 14
        )
    def __repr__(self):
        return "%-10s %1s %3d %3d" % self.fieldInfo()
    def fieldInfo(self):
        """Return field information.
        Return:
            Return value is a (name, type, length, decimals) tuple.
        """
        return (self.name, self.typeCode, self.length, self.decimalCount)
    def rawFromRecord(self, record):
        """Return a "raw" field value from the record string."""
        return record[self.start:self.end]
    def decodeFromRecord(self, record):
        """Return decoded field value from the record string."""
        try:
            return self.decodeValue(self.rawFromRecord(record))
        except:
            if self.ignoreErrors:
                return utils.INVALID_VALUE
            else:
                raise
    def decodeValue(self, value):
        """Return decoded value from string value.
        This method shouldn't be used publicly. It's called from the
        `decodeFromRecord` method.
        This is an abstract method and it must be overridden in child classes.
        """
        raise NotImplementedError
    def encodeValue(self, value):
        """Return str object containing encoded field value.
        This is an abstract method and it must be overriden in child classes.
        """
        raise NotImplementedError
 ## real classes
 class DbfCharacterFieldDef(DbfFieldDef):
    """Definition of the character field."""
    typeCode = "C"
    defaultValue = b''
    def decodeValue(self, value):
        """Return string object.
        Return value is a ``value`` argument with stripped right spaces.
        """
        return value.rstrip(b' ').decode('utf-8')
    def encodeValue(self, value):
        """Return raw data string encoded from a ``value``."""
        return str(value)[:self.length].ljust(self.length)
 class DbfNumericFieldDef(DbfFieldDef):
    """Definition of the numeric field."""
    typeCode = "N"
    # XXX: now I'm not sure it was a good idea to make a class field
    # `defaultValue` instead of a generic method as it was implemented
    # previously -- it's ok with all types except number, cuz
    # if self.decimalCount is 0, we should return 0 and 0.0 otherwise.
    defaultValue = 0
    def decodeValue(self, value):
        """Return a number decoded from ``value``.
        If decimals is zero, value will be decoded as an integer;
        or as a float otherwise.
        Return:
            Return value is a int (long) or float instance.
        """
        value = value.strip(b' \0')
        if b'.' in value:
            # a float (has decimal separator)
            return float(value)
        elif value:
            # must be an integer
            return int(value)
        else:
            return 0
    def encodeValue(self, value):
        """Return string containing encoded ``value``."""
        _rv = ("%*.*f" % (self.length, self.decimalCount, value))
        if len(_rv) > self.length:
            _ppos = _rv.find(".")
            if 0 <= _ppos <= self.length:
                _rv = _rv[:self.length]
            else:
                raise ValueError("[%s] Numeric overflow: %s (field width: %i)"
                    % (self.name, _rv, self.length))
        return _rv
 class DbfFloatFieldDef(DbfNumericFieldDef):
    """Definition of the float field - same as numeric."""
    typeCode = "F"
 class DbfIntegerFieldDef(DbfFieldDef):
    """Definition of the integer field."""
    typeCode = "I"
    length = 4
    defaultValue = 0
    def decodeValue(self, value):
        """Return an integer number decoded from ``value``."""
        return struct.unpack("<i", value)[0]
    def encodeValue(self, value):
        """Return string containing encoded ``value``."""
        return struct.pack("<i", int(value))
 class DbfCurrencyFieldDef(DbfFieldDef):
    """Definition of the currency field."""
    typeCode = "Y"
    length = 8
    defaultValue = 0.0
    def decodeValue(self, value):
        """Return float number decoded from ``value``."""
        return struct.unpack("<q", value)[0] / 10000.
    def encodeValue(self, value):
        """Return string containing encoded ``value``."""
        return struct.pack("<q", round(value * 10000))
 class DbfLogicalFieldDef(DbfFieldDef):
    """Definition of the logical field."""
    typeCode = "L"
    defaultValue = -1
    length = 1
    def decodeValue(self, value):
        """Return True, False or -1 decoded from ``value``."""
        # Note: value always is 1-char string
        if value == "?":
            return -1
        if value in "NnFf ":
            return False
        if value in "YyTt":
            return True
        raise ValueError("[%s] Invalid logical value %r" % (self.name, value))
    def encodeValue(self, value):
        """Return a character from the "TF?" set.
        Return:
            Return value is "T" if ``value`` is True
            "?" if value is -1 or False otherwise.
        """
        if value is True:
            return "T"
        if value == -1:
            return "?"
        return "F"
 class DbfMemoFieldDef(DbfFieldDef):
    """Definition of the memo field.
    Note: memos aren't currenly completely supported.
    """
    typeCode = "M"
    defaultValue = " " * 10
    length = 10
    def decodeValue(self, value):
        """Return int .dbt block number decoded from the string object."""
        #return int(value)
        raise NotImplementedError
    def encodeValue(self, value):
        """Return raw data string encoded from a ``value``.
        Note: this is an internal method.
        """
        #return str(value)[:self.length].ljust(self.length)
        raise NotImplementedError
 class DbfDateFieldDef(DbfFieldDef):
    """Definition of the date field."""
    typeCode = "D"
    defaultValue = utils.classproperty(lambda cls: datetime.date.today())
    # "yyyymmdd" gives us 8 characters
    length = 8
    def decodeValue(self, value):
        """Return a ``datetime.date`` instance decoded from ``value``."""
        if value.strip():
            return utils.getDate(value)
        else:
            return None
    def encodeValue(self, value):
        """Return a string-encoded value.
        ``value`` argument should be a value suitable for the
        `utils.getDate` call.
        Return:
            Return value is a string in format "yyyymmdd".
        """
        if value:
            return utils.getDate(value).strftime("%Y%m%d")
        else:
            return " " * self.length
 class DbfDateTimeFieldDef(DbfFieldDef):
    """Definition of the timestamp field."""
    # a difference between JDN (Julian Day Number)
    # and GDN (Gregorian Day Number). note, that GDN < JDN
    JDN_GDN_DIFF = 1721425
    typeCode = "T"
    defaultValue = utils.classproperty(lambda cls: datetime.datetime.now())
    # two 32-bits integers representing JDN and amount of
    # milliseconds respectively gives us 8 bytes.
    # note, that values must be encoded in LE byteorder.
    length = 8
    def decodeValue(self, value):
        """Return a `datetime.datetime` instance."""
        assert len(value) == self.length
        # LE byteorder
        _jdn, _msecs = struct.unpack("<2I", value)
        if _jdn >= 1:
            _rv = datetime.datetime.fromordinal(_jdn - self.JDN_GDN_DIFF)
            _rv += datetime.timedelta(0, _msecs / 1000.0)
        else:
            # empty date
            _rv = None
        return _rv
    def encodeValue(self, value):
        """Return a string-encoded ``value``."""
        if value:
            value = utils.getDateTime(value)
            # LE byteorder
            _rv = struct.pack("<2I", value.toordinal() + self.JDN_GDN_DIFF,
                (value.hour * 3600 + value.minute * 60 + value.second) * 1000)
        else:
            _rv = "\0" * self.length
        assert len(_rv) == self.length
        return _rv
 _fieldsRegistry = {}
 def registerField(fieldCls):
    """Register field definition class.
    ``fieldCls`` should be subclass of the `DbfFieldDef`.
    Use `lookupFor` to retrieve field definition class
    by the type code.
    """
    assert fieldCls.typeCode is not None, "Type code isn't defined"
    # XXX: use fieldCls.typeCode.upper()? in case of any decign
    # don't forget to look to the same comment in ``lookupFor`` method
    _fieldsRegistry[fieldCls.typeCode] = fieldCls
 def lookupFor(typeCode):
    """Return field definition class for the given type code.
    ``typeCode`` must be a single character. That type should be
    previously registered.
    Use `registerField` to register new field class.
    Return:
        Return value is a subclass of the `DbfFieldDef`.
    """
    # XXX: use typeCode.upper()? in case of any decign don't
    # forget to look to the same comment in ``registerField``
    return _fieldsRegistry[chr(typeCode)]
 ## register generic types
 for (_name, _val) in list(globals().items()):
    if isinstance(_val, type) and issubclass(_val, DbfFieldDef) \
    and (_name != "DbfFieldDef"):
        __all__.append(_name)
        registerField(_val)
 del _name, _val
 # vim: et sts=4 sw=4 :
@@ -0,0 +1,273 @@
 """DBF header definition.
 TODO:
  - handle encoding of the character fields
    (encoding information stored in the DBF header)
 """
 """History (most recent first):
 16-sep-2010 [als]   fromStream: fix century of the last update field
 11-feb-2007 [als]   added .ignoreErrors
 10-feb-2007 [als]   added __getitem__: return field definitions
                    by field name or field number (zero-based)
 04-jul-2006 [als]   added export declaration
 15-dec-2005 [yc]    created
 """
 __version__ = "$Revision: 1.6 $"[11:-2]
 __date__ = "$Date: 2010/09/16 05:06:39 $"[7:-2]
 __all__ = ["DbfHeader"]
 import io
 import datetime
 import struct
 import time
 import sys
 from . import fields
 from .utils import getDate
 class DbfHeader(object):
    """Dbf header definition.
    For more information about dbf header format visit
    `http://www.clicketyclick.dk/databases/xbase/format/dbf.html#DBF_STRUCT`
    Examples:
        Create an empty dbf header and add some field definitions:
            dbfh = DbfHeader()
            dbfh.addField(("name", "C", 10))
            dbfh.addField(("date", "D"))
            dbfh.addField(DbfNumericFieldDef("price", 5, 2))
        Create a dbf header with field definitions:
            dbfh = DbfHeader([
                ("name", "C", 10),
                ("date", "D"),
                DbfNumericFieldDef("price", 5, 2),
            ])
    """
    __slots__ = ("signature", "fields", "lastUpdate", "recordLength",
        "recordCount", "headerLength", "changed", "_ignore_errors")
    ## instance construction and initialization methods
    def __init__(self, fields=None, headerLength=0, recordLength=0,
        recordCount=0, signature=0x03, lastUpdate=None, ignoreErrors=False,
    ):
        """Initialize instance.
        Arguments:
            fields:
                a list of field definitions;
            recordLength:
                size of the records;
            headerLength:
                size of the header;
            recordCount:
                number of records stored in DBF;
            signature:
                version number (aka signature). using 0x03 as a default meaning
                "File without DBT". for more information about this field visit
                ``http://www.clicketyclick.dk/databases/xbase/format/dbf.html#DBF_NOTE_1_TARGET``
            lastUpdate:
                date of the DBF's update. this could be a string ('yymmdd' or
                'yyyymmdd'), timestamp (int or float), datetime/date value,
                a sequence (assuming (yyyy, mm, dd, ...)) or an object having
                callable ``ticks`` field.
            ignoreErrors:
                error processing mode for DBF fields (boolean)
        """
        self.signature = signature
        if fields is None:
            self.fields = []
        else:
            self.fields = list(fields)
        self.lastUpdate = getDate(lastUpdate)
        self.recordLength = recordLength
        self.headerLength = headerLength
        self.recordCount = recordCount
        self.ignoreErrors = ignoreErrors
        # XXX: I'm not sure this is safe to
        # initialize `self.changed` in this way
        self.changed = bool(self.fields)
    # @classmethod
    def fromString(cls, string):
        """Return header instance from the string object."""
        return cls.fromStream(io.StringIO(str(string)))
    fromString = classmethod(fromString)
    # @classmethod
    def fromStream(cls, stream):
        """Return header object from the stream."""
        stream.seek(0)
        first_32 = stream.read(32)
        if type(first_32) != bytes:
            _data = bytes(first_32, sys.getfilesystemencoding())
        _data = first_32
        (_cnt, _hdrLen, _recLen) = struct.unpack("<I2H", _data[4:12])
        #reserved = _data[12:32]
        _year = _data[1]
        if _year < 80:
            # dBase II started at 1980.  It is quite unlikely
            # that actual last update date is before that year.
            _year += 2000
        else:
            _year += 1900
        ## create header object
        _obj = cls(None, _hdrLen, _recLen, _cnt, _data[0],
            (_year, _data[2], _data[3]))
        ## append field definitions
        # position 0 is for the deletion flag
        _pos = 1
        _data = stream.read(1)
        while _data != b'\r':
            _data += stream.read(31)
            _fld = fields.lookupFor(_data[11]).fromString(_data, _pos)
            _obj._addField(_fld)
            _pos = _fld.end
            _data = stream.read(1)
        return _obj
    fromStream = classmethod(fromStream)
    ## properties
    year = property(lambda self: self.lastUpdate.year)
    month = property(lambda self: self.lastUpdate.month)
    day = property(lambda self: self.lastUpdate.day)
    def ignoreErrors(self, value):
        """Update `ignoreErrors` flag on self and all fields"""
        self._ignore_errors = value = bool(value)
        for _field in self.fields:
            _field.ignoreErrors = value
    ignoreErrors = property(
        lambda self: self._ignore_errors,
        ignoreErrors,
        doc="""Error processing mode for DBF field value conversion
        if set, failing field value conversion will return
        ``INVALID_VALUE`` instead of raising conversion error.
        """)
    ## object representation
    def __repr__(self):
        _rv = """\
 Version (signature): 0x%02x
        Last update: %s
      Header length: %d
      Record length: %d
       Record count: %d
 FieldName Type Len Dec
 """ % (self.signature, self.lastUpdate, self.headerLength,
    self.recordLength, self.recordCount)
        _rv += "\n".join(
            ["%10s %4s %3s %3s" % _fld.fieldInfo() for _fld in self.fields]
        )
        return _rv
    ## internal methods
    def _addField(self, *defs):
        """Internal variant of the `addField` method.
        This method doesn't set `self.changed` field to True.
        Return value is a length of the appended records.
        Note: this method doesn't modify ``recordLength`` and
        ``headerLength`` fields. Use `addField` instead of this
        method if you don't exactly know what you're doing.
        """
        # insure we have dbf.DbfFieldDef instances first (instantiation
        # from the tuple could raise an error, in such a case I don't
        # wanna add any of the definitions -- all will be ignored)
        _defs = []
        _recordLength = 0
        for _def in defs:
            if isinstance(_def, fields.DbfFieldDef):
                _obj = _def
            else:
                (_name, _type, _len, _dec) = (tuple(_def) + (None,) * 4)[:4]
                _cls = fields.lookupFor(_type)
                _obj = _cls(_name, _len, _dec,
                    ignoreErrors=self._ignore_errors)
            _recordLength += _obj.length
            _defs.append(_obj)
        # and now extend field definitions and
        # update record length
        self.fields += _defs
        return _recordLength
    ## interface methods
    def addField(self, *defs):
        """Add field definition to the header.
        Examples:
            dbfh.addField(
                ("name", "C", 20),
                dbf.DbfCharacterFieldDef("surname", 20),
                dbf.DbfDateFieldDef("birthdate"),
                ("member", "L"),
            )
            dbfh.addField(("price", "N", 5, 2))
            dbfh.addField(dbf.DbfNumericFieldDef("origprice", 5, 2))
        """
        _oldLen = self.recordLength
        self.recordLength += self._addField(*defs)
        if not _oldLen:
            self.recordLength += 1
            # XXX: may be just use:
            # self.recordeLength += self._addField(*defs) + bool(not _oldLen)
        # recalculate headerLength
        self.headerLength = 32 + (32 * len(self.fields)) + 1
        self.changed = True
    def write(self, stream):
        """Encode and write header to the stream."""
        stream.seek(0)
        stream.write(self.toString())
        fields = [_fld.toString() for _fld in self.fields]
        stream.write(''.join(fields).encode(sys.getfilesystemencoding()))
        stream.write(b'\x0D')   # cr at end of all header data
        self.changed = False
    def toString(self):
        """Returned 32 chars length string with encoded header."""
        return struct.pack("<4BI2H",
            self.signature,
            self.year - 1900,
            self.month,
            self.day,
            self.recordCount,
            self.headerLength,
            self.recordLength) + (b'\x00' * 20)
        #TODO: figure out if bytes(utf-8) is correct here.
    def setCurrentDate(self):
        """Update ``self.lastUpdate`` field with current date value."""
        self.lastUpdate = datetime.date.today()
    def __getitem__(self, item):
        """Return a field definition by numeric index or name string"""
        if isinstance(item, str):
            _name = item.upper()
            for _field in self.fields:
                if _field.name == _name:
                    return _field
            else:
                raise KeyError(item)
        else:
            # item must be field index
            return self.fields[item]
 # vim: et sts=4 sw=4 :
@@ -0,0 +1,266 @@
 """DBF record definition.
 """
 """History (most recent first):
 11-feb-2007 [als]   __repr__: added special case for invalid field values
 10-feb-2007 [als]   added .rawFromStream()
 30-oct-2006 [als]   fix record length in .fromStream()
 04-jul-2006 [als]   added export declaration
 20-dec-2005 [yc]    DbfRecord.write() -> DbfRecord._write();
                    added delete() method.
 16-dec-2005 [yc]    record definition moved from `dbf`.
 """
 __version__ = "$Revision: 1.7 $"[11:-2]
 __date__ = "$Date: 2007/02/11 09:05:49 $"[7:-2]
 __all__ = ["DbfRecord"]
 import sys
 from . import utils
 class DbfRecord(object):
    """DBF record.
    Instances of this class shouldn't be created manualy,
    use `dbf.Dbf.newRecord` instead.
    Class implements mapping/sequence interface, so
    fields could be accessed via their names or indexes
    (names is a preffered way to access fields).
    Hint:
        Use `store` method to save modified record.
    Examples:
        Add new record to the database:
            db = Dbf(filename)
            rec = db.newRecord()
            rec["FIELD1"] = value1
            rec["FIELD2"] = value2
            rec.store()
        Or the same, but modify existed
        (second in this case) record:
            db = Dbf(filename)
            rec = db[2]
            rec["FIELD1"] = value1
            rec["FIELD2"] = value2
            rec.store()
    """
    __slots__ = "dbf", "index", "deleted", "fieldData"
    ## creation and initialization
    def __init__(self, dbf, index=None, deleted=False, data=None):
        """Instance initialiation.
        Arguments:
            dbf:
                A `Dbf.Dbf` instance this record belonogs to.
            index:
                An integer record index or None. If this value is
                None, record will be appended to the DBF.
            deleted:
                Boolean flag indicating whether this record
                is a deleted record.
            data:
                A sequence or None. This is a data of the fields.
                If this argument is None, default values will be used.
        """
        self.dbf = dbf
        # XXX: I'm not sure ``index`` is necessary
        self.index = index
        self.deleted = deleted
        if data is None:
            self.fieldData = [_fd.defaultValue for _fd in dbf.header.fields]
        else:
            self.fieldData = list(data)
    # XXX: validate self.index before calculating position?
    position = property(lambda self: self.dbf.header.headerLength + \
        self.index * self.dbf.header.recordLength)
    def rawFromStream(cls, dbf, index):
        """Return raw record contents read from the stream.
        Arguments:
            dbf:
                A `Dbf.Dbf` instance containing the record.
            index:
                Index of the record in the records' container.
                This argument can't be None in this call.
        Return value is a string containing record data in DBF format.
        """
        # XXX: may be write smth assuming, that current stream
        # position is the required one? it could save some
        # time required to calculate where to seek in the file
        dbf.stream.seek(dbf.header.headerLength +
            index * dbf.header.recordLength)
        return dbf.stream.read(dbf.header.recordLength)
    rawFromStream = classmethod(rawFromStream)
    def fromStream(cls, dbf, index):
        """Return a record read from the stream.
        Arguments:
            dbf:
                A `Dbf.Dbf` instance new record should belong to.
            index:
                Index of the record in the records' container.
                This argument can't be None in this call.
        Return value is an instance of the current class.
        """
        return cls.fromString(dbf, cls.rawFromStream(dbf, index), index)
    fromStream = classmethod(fromStream)
    def fromString(cls, dbf, string, index=None):
        """Return record read from the string object.
        Arguments:
            dbf:
                A `Dbf.Dbf` instance new record should belong to.
            string:
                A string new record should be created from.
            index:
                Index of the record in the container. If this
                argument is None, record will be appended.
        Return value is an instance of the current class.
        """
        return cls(dbf, index, string[0]=="*",
            [_fd.decodeFromRecord(string) for _fd in dbf.header.fields])
    fromString = classmethod(fromString)
    ## object representation
    def __repr__(self):
        _template = "%%%ds: %%s (%%s)" % max([len(_fld)
            for _fld in self.dbf.fieldNames])
        _rv = []
        for _fld in self.dbf.fieldNames:
            _val = self[_fld]
            if _val is utils.INVALID_VALUE:
                _rv.append(_template %
                    (_fld, "None", "value cannot be decoded"))
            else:
                _rv.append(_template % (_fld, _val, type(_val)))
        return "\n".join(_rv)
    ## protected methods
    def _write(self):
        """Write data to the dbf stream.
        Note:
            This isn't a public method, it's better to
            use 'store' instead publically.
            Be design ``_write`` method should be called
            only from the `Dbf` instance.
        """
        self._validateIndex(False)
        self.dbf.stream.seek(self.position)
        self.dbf.stream.write(bytes(self.toString(),
            sys.getfilesystemencoding()))
        # FIXME: may be move this write somewhere else?
        # why we should check this condition for each record?
        if self.index == len(self.dbf):
            # this is the last record,
            # we should write SUB (ASCII 26)
            self.dbf.stream.write(b"\x1A")
    ## utility methods
    def _validateIndex(self, allowUndefined=True, checkRange=False):
        """Valid ``self.index`` value.
        If ``allowUndefined`` argument is True functions does nothing
        in case of ``self.index`` pointing to None object.
        """
        if self.index is None:
            if not allowUndefined:
                raise ValueError("Index is undefined")
        elif self.index < 0:
            raise ValueError("Index can't be negative (%s)" % self.index)
        elif checkRange and self.index <= self.dbf.header.recordCount:
            raise ValueError("There are only %d records in the DBF" %
                self.dbf.header.recordCount)
    ## interface methods
    def store(self):
        """Store current record in the DBF.
        If ``self.index`` is None, this record will be appended to the
        records of the DBF this records belongs to; or replaced otherwise.
        """
        self._validateIndex()
        if self.index is None:
            self.index = len(self.dbf)
            self.dbf.append(self)
        else:
            self.dbf[self.index] = self
    def delete(self):
        """Mark method as deleted."""
        self.deleted = True
    def toString(self):
        """Return string packed record values."""
 #        for (_def, _dat) in zip(self.dbf.header.fields, self.fieldData):
 #            
        return "".join([" *"[self.deleted]] + [
           _def.encodeValue(_dat)
            for (_def, _dat) in zip(self.dbf.header.fields, self.fieldData)
        ])
    def asList(self):
        """Return a flat list of fields.
        Note:
            Change of the list's values won't change
            real values stored in this object.
        """
        return self.fieldData[:]
    def asDict(self):
        """Return a dictionary of fields.
        Note:
            Change of the dicts's values won't change
            real values stored in this object.
        """
        return dict([_i for _i in zip(self.dbf.fieldNames, self.fieldData)])
    def __getitem__(self, key):
        """Return value by field name or field index."""
        if isinstance(key, int):
            # integer index of the field
            return self.fieldData[key]
        # assuming string field name
        return self.fieldData[self.dbf.indexOfFieldName(key)]
    def __setitem__(self, key, value):
        """Set field value by integer index of the field or string name."""
        if isinstance(key, int):
            # integer index of the field
            return self.fieldData[key]
        # assuming string field name
        self.fieldData[self.dbf.indexOfFieldName(key)] = value
 # vim: et sts=4 sw=4 :
@@ -0,0 +1,170 @@
 """String utilities.
 TODO:
  - allow strings in getDateTime routine;
 """
 """History (most recent first):
 11-feb-2007 [als]   added INVALID_VALUE
 10-feb-2007 [als]   allow date strings padded with spaces instead of zeroes
 20-dec-2005 [yc]    handle long objects in getDate/getDateTime
 16-dec-2005 [yc]    created from ``strutil`` module.
 """
 __version__ = "$Revision: 1.4 $"[11:-2]
 __date__ = "$Date: 2007/02/11 08:57:17 $"[7:-2]
 import datetime
 import time
 def unzfill(str):
    """Return a string without ASCII NULs.
    This function searchers for the first NUL (ASCII 0) occurance
    and truncates string till that position.
    """
    try:
        return str[:str.index(b'\0')]
    except ValueError:
        return str
 def getDate(date=None):
    """Return `datetime.date` instance.
    Type of the ``date`` argument could be one of the following:
        None:
            use current date value;
        datetime.date:
            this value will be returned;
        datetime.datetime:
            the result of the date.date() will be returned;
        string:
            assuming "%Y%m%d" or "%y%m%dd" format;
        number:
            assuming it's a timestamp (returned for example
            by the time.time() call;
        sequence:
            assuming (year, month, day, ...) sequence;
    Additionaly, if ``date`` has callable ``ticks`` attribute,
    it will be used and result of the called would be treated
    as a timestamp value.
    """
    if date is None:
        # use current value
        return datetime.date.today()
    if isinstance(date, datetime.date):
        return date
    if isinstance(date, datetime.datetime):
        return date.date()
    if isinstance(date, (int, float)):
        # date is a timestamp
        return datetime.date.fromtimestamp(date)
    if isinstance(date, str):
        date = date.replace(" ", "0")
        if len(date) == 6:
            # yymmdd
            return datetime.date(*time.strptime(date, "%y%m%d")[:3])
        # yyyymmdd
        return datetime.date(*time.strptime(date, "%Y%m%d")[:3])
    if hasattr(date, "__getitem__"):
        # a sequence (assuming date/time tuple)
        return datetime.date(*date[:3])
    return datetime.date.fromtimestamp(date.ticks())
 def getDateTime(value=None):
    """Return `datetime.datetime` instance.
    Type of the ``value`` argument could be one of the following:
        None:
            use current date value;
        datetime.date:
            result will be converted to the `datetime.datetime` instance
            using midnight;
        datetime.datetime:
            ``value`` will be returned as is;
        string:
            *** CURRENTLY NOT SUPPORTED ***;
        number:
            assuming it's a timestamp (returned for example
            by the time.time() call;
        sequence:
            assuming (year, month, day, ...) sequence;
    Additionaly, if ``value`` has callable ``ticks`` attribute,
    it will be used and result of the called would be treated
    as a timestamp value.
    """
    if value is None:
        # use current value
        return datetime.datetime.today()
    if isinstance(value, datetime.datetime):
        return value
    if isinstance(value, datetime.date):
        return datetime.datetime.fromordinal(value.toordinal())
    if isinstance(value, (int, float)):
        # value is a timestamp
        return datetime.datetime.fromtimestamp(value)
    if isinstance(value, str):
        raise NotImplementedError("Strings aren't currently implemented")
    if hasattr(value, "__getitem__"):
        # a sequence (assuming date/time tuple)
        return datetime.datetime(*tuple(value)[:6])
    return datetime.datetime.fromtimestamp(value.ticks())
 class classproperty(property):
    """Works in the same way as a ``property``, but for the classes."""
    def __get__(self, obj, cls):
        return self.fget(cls)
 class _InvalidValue(object):
    """Value returned from DBF records when field validation fails
    The value is not equal to anything except for itself
    and equal to all empty values: None, 0, empty string etc.
    In other words, invalid value is equal to None and not equal
    to None at the same time.
    This value yields zero upon explicit conversion to a number type,
    empty string for string types, and False for boolean.
    """
    def __eq__(self, other):
        return not other
    def __ne__(self, other):
        return not (other is self)
    def __bool__(self):
        return False
    def __int__(self):
        return 0
    __long__ = __int__
    def __float__(self):
        return 0.0
    def __str__(self):
        return ""
    def __unicode__(self):
        return ""
    def __repr__(self):
        return "<INVALID>"
 # invalid value is a constant singleton
 INVALID_VALUE = _InvalidValue()
 # vim: set et sts=4 sw=4 :
@@ -33,7 +33,7 @@ class element:
            self.tag = tag.lower( )
        else:
            self.tag = tag.upper( )
-    
+
    def __call__( self, *args, **kwargs ):
        if len( args ) > 1:
            raise ArgumentError( self.tag )
@@ -42,14 +42,14 @@ class element:
        if self.parent is not None and self.parent.class_ is not None:
            if 'class_' not in kwargs:
                kwargs['class_'] = self.parent.class_
-            
+
        if self.parent is None and len( args ) == 1:
            x = [ self.render( self.tag, False, myarg, mydict ) for myarg, mydict in _argsdicts( args, kwargs ) ]
            return '\n'.join( x )
        elif self.parent is None and len( args ) == 0:
            x = [ self.render( self.tag, True, myarg, mydict ) for myarg, mydict in _argsdicts( args, kwargs ) ]
            return '\n'.join( x )
-            
+
        if self.tag in self.parent.twotags:
            for myarg, mydict in _argsdicts( args, kwargs ):
                self.render( self.tag, False, myarg, mydict )
@@ -63,7 +63,7 @@ class element:
            raise DeprecationError( self.tag )
        else:
            raise InvalidElementError( self.tag, self.parent.mode )
-    
+
    def render( self, tag, single, between, kwargs ):
        """Append the actual tags to content."""
@@ -89,7 +89,7 @@ class element:
            self.parent.content.append( out )
        else:
            return out
-    
+
    def close( self ):
        """Append a closing tag unless element has only opening tag."""
@@ -128,11 +128,11 @@ class page:
                                these two keyword arguments may be used to select
                                the set of valid elements in 'xml' mode
                                invalid elements will raise appropriate exceptions
-        
+
        separator --            string to place between added elements, defaults to newline
-        
+
        class_ --               a class that will be added to every element if defined"""
-        
+
        valid_onetags = [ "AREA", "BASE", "BR", "COL", "FRAME", "HR", "IMG", "INPUT", "LINK", "META", "PARAM" ]
        valid_twotags = [ "A", "ABBR", "ACRONYM", "ADDRESS", "B", "BDO", "BIG", "BLOCKQUOTE", "BODY", "BUTTON",
                "CAPTION", "CITE", "CODE", "COLGROUP", "DD", "DEL", "DFN", "DIV", "DL", "DT", "EM", "FIELDSET",
@@ -163,7 +163,7 @@ class page:
            self.deptags += map( string.lower, self.deptags )
            self.mode = 'strict_html'
        elif mode == 'loose_html':
-            self.onetags = valid_onetags + deprecated_onetags 
+            self.onetags = valid_onetags + deprecated_onetags
            self.onetags += map( string.lower, self.onetags )
            self.twotags = valid_twotags + deprecated_twotags
            self.twotags += map( string.lower, self.twotags )
@@ -183,16 +183,16 @@ class page:
    def __getattr__( self, attr ):
        if attr.startswith("__") and attr.endswith("__"):
-            raise AttributeError, attr
+            raise AttributeError(attr)
        return element( attr, case=self.case, parent=self )
    def __str__( self ):
-        
+
        if self._full and ( self.mode == 'strict_html' or self.mode == 'loose_html' ):
            end = [ '</body>', '</html>' ]
        else:
            end = [ ]
-        
+
        return self.separator.join( self.header + self.content + self.footer + end )
    def __call__( self, escape=False ):
@@ -232,7 +232,7 @@ class page:
        lang --     language, usually a two character string, will appear
                    as <html lang='en'> in html mode (ignored in xml mode)
-        
+
        css --      Cascading Style Sheet filename as a string or a list of
                    strings for multiple css files (ignored in xml mode)
@@ -306,7 +306,7 @@ class page:
    def css( self, filelist ):
        """This convenience function is only useful for html.
        It adds css stylesheet(s) to the document via the <link> element."""
-      
+
        if isinstance( filelist, basestring ):
            self.link( href=filelist, rel='stylesheet', type='text/css', media='all' )
        else:
@@ -322,7 +322,7 @@ class page:
            for name, content in mydict.iteritems( ):
                self.meta( name=name, content=content )
        else:
-            raise TypeError, "Metainfo should be called with a dictionary argument of name:content pairs."
+            raise TypeError ("Metainfo should be called with a dictionary argument of name:content pairs.")
    def scripts( self, mydict ):
        """Only useful in html, mydict is dictionary of src:type pairs will
@@ -332,20 +332,20 @@ class page:
            for src, type in mydict.iteritems( ):
                self.script( '', src=src, type='text/%s' % type )
        else:
-            raise TypeError, "Script should be given a dictionary of src:type pairs."
+            raise TypeError ("Script should be given a dictionary of src:type pairs.")
 class _oneliner:
    """An instance of oneliner returns a string corresponding to one element.
    This class can be used to write 'oneliners' that return a string
    immediately so there is no need to instantiate the page class."""
-    
+
    def __init__( self, case='lower' ):
        self.case = case
-    
+
    def __getattr__( self, attr ):
        if attr.startswith("__") and attr.endswith("__"):
-            raise AttributeError, attr
+            raise AttributeError(attr)
        return element( attr, case=self.case, parent=None )
 oneliner = _oneliner( case='lower' )
@@ -353,13 +353,13 @@ upper_oneliner = _oneliner( case='upper' )
 def _argsdicts( args, mydict ):
    """A utility generator that pads argument list and dictionary values, will only be called with len( args ) = 0, 1."""
-    
+
    if len( args ) == 0:
-        args = None, 
+        args = None,
    elif len( args ) == 1:
        args = _totuple( args[0] )
    else:
-        raise Exception, "We should have never gotten here."
+        raise Exception("We should have never gotten here.")
    mykeys = mydict.keys( )
    myvalues = map( _totuple, mydict.values( ) )
@@ -418,7 +418,7 @@ _escape = escape
 def unescape( text ):
    """Inverse of escape."""
-    
+
    if isinstance( text, basestring ):
        if '&amp;' in text:
            text = text.replace( '&amp;', '&' )
@@ -481,4 +481,4 @@ class CustomizationError( MarkupError ):
        self.message = "If you customize the allowed elements, you must define both types 'onetags' and 'twotags'."
 if __name__ == '__main__':
-    print __doc__
+    print (__doc__)
@@ -1,22 +1,65 @@
 # -*- coding: utf-8 -*-
 import csv
-from csv import *
+try:
    from itertools import izip
 except ImportError:
    izip = zip
 #http://semver.org/
-VERSION = (0, 8, 0)
+VERSION = (0, 10, 1)
 __version__ = ".".join(map(str,VERSION))
-def _stringify(s, encoding):
+pass_throughs = [
-    if type(s)==unicode:
+    'register_dialect',
-        return s.encode(encoding)
+    'unregister_dialect',
    'get_dialect',
    'list_dialects',
    'field_size_limit',
    'Dialect',
    'excel',
    'excel_tab',
    'Sniffer',
    'QUOTE_ALL',
    'QUOTE_MINIMAL',
    'QUOTE_NONNUMERIC',
    'QUOTE_NONE',
    'Error'
 ]
 __all__ = [
    'reader',
    'writer',
    'DictReader',
    'DictWriter',
 ] + pass_throughs
 for prop in pass_throughs:
    globals()[prop]=getattr(csv, prop)
 def _stringify(s, encoding, errors):
    if s is None:
        return ''
    if isinstance(s, unicode):
        return s.encode(encoding, errors)
    elif isinstance(s, (int , float)):
        pass #let csv.QUOTE_NONNUMERIC do its thing.
-    elif type(s) != str:
+    elif not isinstance(s, str):
        s=str(s)
    return s
-def _stringify_list(l, encoding):
+def _stringify_list(l, encoding, errors='strict'):
-    return [_stringify(s, encoding) for s in l]
+    try:
        return [_stringify(s, encoding, errors) for s in iter(l)]
    except TypeError as e:
        raise csv.Error(str(e))
 def _unicodify(s, encoding):
    if s is None:
        return None
    if isinstance(s, (unicode, int, float)):
        return s
    elif isinstance(s, str):
        return s.decode(encoding)
    return s
 class UnicodeWriter(object):
    """
@@ -28,78 +71,127 @@ class UnicodeWriter(object):
    >>> f.seek(0)
    >>> r = unicodecsv.reader(f, encoding='utf-8')
    >>> row = r.next()
-    >>> print row[0], row[1]
+    >>> row[0] == u'é'
-    é ñ
+    True
    >>> row[1] == u'ñ'
    True
    """
-    def __init__(self, f, dialect=csv.excel, encoding="utf-8", **kwds):
+    def __init__(self, f, dialect=csv.excel, encoding='utf-8', errors='strict',
-        self.writer = csv.writer(f)
+                 *args, **kwds):
        self.dialect = dialect
        self.encoding = encoding
-        self.writer = csv.writer(f, dialect=dialect, **kwds)
+        self.writer = csv.writer(f, dialect, *args, **kwds)
        self.encoding_errors = errors
    def writerow(self, row):
-        self.writer.writerow(_stringify_list(row, self.encoding))
+        self.writer.writerow(_stringify_list(row, self.encoding, self.encoding_errors))
    def writerows(self, rows):
        for row in rows:
          self.writerow(row)
    @property
    def dialect(self):
        return self.writer.dialect
 writer = UnicodeWriter
 class UnicodeReader(object):
-    def __init__(self, f, dialect=csv.excel, encoding="utf-8", **kwds):
+    def __init__(self, f, dialect=None, encoding='utf-8', errors='strict',
-        self.reader = csv.reader(f, dialect=dialect, **kwds)
+                 **kwds):
        format_params = ['delimiter', 'doublequote', 'escapechar', 'lineterminator', 'quotechar', 'quoting', 'skipinitialspace']
        if dialect is None:
            if not any([kwd_name in format_params for kwd_name in kwds.keys()]):
                dialect = csv.excel
        self.reader = csv.reader(f, dialect, **kwds)
        self.encoding = encoding
        self.encoding_errors = errors
    def next(self):
        row = self.reader.next()
-        return [unicode(s, self.encoding) for s in row]
+        encoding = self.encoding
        encoding_errors = self.encoding_errors
        float_ = float
        unicode_ = unicode
        return [(value if isinstance(value, float_) else
                 unicode_(value, encoding, encoding_errors)) for value in row]
    def __iter__(self):
        return self
    @property
    def dialect(self):
        return self.reader.dialect
    @property
    def line_num(self):
        return self.reader.line_num
 reader = UnicodeReader
 class DictWriter(csv.DictWriter):
    """
    >>> from cStringIO import StringIO
    >>> f = StringIO()
-    >>> w = DictWriter(f, ['a', 'b'], restval=u'î')
+    >>> w = DictWriter(f, ['a', u'ñ', 'b'], restval=u'î')
-    >>> w.writerow({'a':'1'})
+    >>> w.writerow({'a':'1', u'ñ':'2'})
-    >>> w.writerow({'a':'1', 'b':u'ø'})
+    >>> w.writerow({'a':'1', u'ñ':'2', 'b':u'ø'})
-    >>> w.writerow({'a':u'é'})
+    >>> w.writerow({'a':u'é', u'ñ':'2'})
    >>> f.seek(0)
-    >>> r = DictReader(f, fieldnames=['a'], restkey='r')
+    >>> r = DictReader(f, fieldnames=['a', u'ñ'], restkey='r')
-    >>> r.next() == {'a':u'1', 'r':[u"î"]}
+    >>> r.next() == {'a': u'1', u'ñ':'2', 'r': [u'î']}
    True
-    >>> r.next() == {'a':u'1', 'r':[u"ø"]}
+    >>> r.next() == {'a': u'1', u'ñ':'2', 'r': [u'\xc3\xb8']}
    True
    >>> r.next() == {'a': u'\xc3\xa9', u'ñ':'2', 'r': [u'\xc3\xae']}
    True
    >>> r.next() == {'a':u'é', 'r':[u"î"]}
    """
-    def __init__(self, csvfile, fieldnames, restval='', extrasaction='raise', dialect='excel', encoding='utf-8', *args, **kwds):
+    def __init__(self, csvfile, fieldnames, restval='', extrasaction='raise', dialect='excel', encoding='utf-8', errors='strict', *args, **kwds):
        self.fieldnames = fieldnames
        self.encoding = encoding
-        self.restval = restval
+        csv.DictWriter.__init__(self, csvfile, fieldnames, restval, extrasaction, dialect, *args, **kwds)
-        self.writer = csv.DictWriter(csvfile, fieldnames, restval, extrasaction, dialect, *args, **kwds)
+        self.writer = UnicodeWriter(csvfile, dialect, encoding=encoding, errors=errors, *args, **kwds)
-    def writerow(self, d):
+        self.encoding_errors = errors
-        for fieldname in self.fieldnames:
+
-            if fieldname in d:
+    def writeheader(self):
-                d[fieldname] = _stringify(d[fieldname], self.encoding)
+        fieldnames = _stringify_list(self.fieldnames, self.encoding, self.encoding_errors)
-            else:
+        header = dict(zip(self.fieldnames, self.fieldnames))
-                d[fieldname] = _stringify(self.restval, self.encoding)
+        self.writerow(header)
        self.writer.writerow(d)
 class DictReader(csv.DictReader):
-    def __init__(self, csvfile, fieldnames=None, restkey=None, restval=None, dialect='excel', encoding='utf-8', *args, **kwds):
+    """
-        self.restkey = restkey
+    >>> from cStringIO import StringIO
-        self.encoding = encoding
+    >>> f = StringIO()
-        self.reader = csv.DictReader(csvfile, fieldnames, restkey, restval, dialect, *args, **kwds)
+    >>> w = DictWriter(f, fieldnames=['name', 'place'])
    >>> w.writerow({'name': 'Cary Grant', 'place': 'hollywood'})
    >>> w.writerow({'name': 'Nathan Brillstone', 'place': u'øLand'})
    >>> w.writerow({'name': u'Willam ø. Unicoder', 'place': u'éSpandland'})
    >>> f.seek(0)
    >>> r = DictReader(f, fieldnames=['name', 'place'])
    >>> print r.next() == {'name': 'Cary Grant', 'place': 'hollywood'}
    True
    >>> print r.next() == {'name': 'Nathan Brillstone', 'place': u'øLand'}
    True
    >>> print r.next() == {'name': u'Willam ø. Unicoder', 'place': u'éSpandland'}
    True
    """
    def __init__(self, csvfile, fieldnames=None, restkey=None, restval=None,
                 dialect='excel', encoding='utf-8', errors='strict', *args,
                 **kwds):
        if fieldnames is not None:
            fieldnames = _stringify_list(fieldnames, encoding)
        csv.DictReader.__init__(self, csvfile, fieldnames, restkey, restval, dialect, *args, **kwds)
        self.reader = UnicodeReader(csvfile, dialect, encoding=encoding,
                                    errors=errors, *args, **kwds)
        if fieldnames is None and not hasattr(csv.DictReader, 'fieldnames'):
            # Python 2.5 fieldnames workaround. (http://bugs.python.org/issue3436)
            reader = UnicodeReader(csvfile, dialect, encoding=encoding, *args, **kwds)
            self.fieldnames = _stringify_list(reader.next(), reader.encoding)
        self.unicode_fieldnames = [_unicodify(f, encoding) for f in
                                   self.fieldnames]
        self.unicode_restkey = _unicodify(restkey, encoding)
    def next(self):
-        d = self.reader.next()
+        row = csv.DictReader.next(self)
-        for k, v in d.items():
+        result = dict((uni_key, row[str_key]) for (str_key, uni_key) in
-            if k == self.restkey:
+                      izip(self.fieldnames, self.unicode_fieldnames))
-                rest = v
+        rest = row.get(self.restkey)
-                if rest:
+        if rest:
-                    d[self.restkey] = [unicode(v, self.encoding) for v in rest]
+            result[self.unicode_restkey] = rest
-            else:
+        return result
                if v is not None:
                    d[k] = unicode(v, self.encoding)
        return d
@@ -8,6 +8,7 @@ import sys
 import os
 import tablib
 from tablib.compat import markup, unicode, is_py3
 from tablib.core import Row
@@ -206,6 +207,18 @@ class TablibTestCase(unittest.TestCase):
        self.assertEqual(self.founders[2:], [self.tom])
    def test_row_slicing(self):
        """Verify Row's __getslice__ method. Issue #184."""
        john = Row(self.john)
        self.assertEqual(john[:], list(self.john[:]))
        self.assertEqual(john[0:], list(self.john[0:]))
        self.assertEqual(john[:2], list(self.john[:2]))
        self.assertEqual(john[0:2], list(self.john[0:2]))
        self.assertEqual(john[0:-1], list(self.john[0:-1]))
    def test_delete(self):
        """Verify deleting from dataset works."""
@@ -228,7 +241,6 @@ class TablibTestCase(unittest.TestCase):
        # Delete from invalid index
        self.assertRaises(IndexError, self.founders.__delitem__, 3)
    def test_csv_export(self):
        """Verify exporting dataset object as CSV."""
@@ -306,6 +318,67 @@ class TablibTestCase(unittest.TestCase):
        self.assertEqual(html, d.html)
    def test_latex_export(self):
        """LaTeX export"""
        expected = """\
 % Note: add \\usepackage{booktabs} to your preamble
 %
 \\begin{table}[!htbp]
  \\centering
  \\caption{Founders}
  \\begin{tabular}{lrr}
    \\toprule
      first\\_name & last\\_name & gpa \\\\
    \\cmidrule(r){1-1} \\cmidrule(lr){2-2} \\cmidrule(l){3-3}
      John & Adams & 90 \\\\
      George & Washington & 67 \\\\
      Thomas & Jefferson & 50 \\\\
    \\bottomrule
  \\end{tabular}
 \\end{table}
 """
        output = self.founders.latex
        self.assertEqual(output, expected)
    def test_latex_export_empty_dataset(self):
        self.assertTrue(tablib.Dataset().latex is not None)
    def test_latex_export_no_headers(self):
        d = tablib.Dataset()
        d.append(('one', 'two', 'three'))
        self.assertTrue('one' in d.latex)
    def test_latex_export_caption(self):
        d = tablib.Dataset()
        d.append(('one', 'two', 'three'))
        self.assertFalse('caption' in d.latex)
        d.title = 'Title'
        self.assertTrue('\\caption{Title}' in d.latex)
    def test_latex_export_none_values(self):
        headers = ['foo', None, 'bar']
        d = tablib.Dataset(['foo', None, 'bar'], headers=headers)
        output = d.latex
        self.assertTrue('foo' in output)
        self.assertFalse('None' in output)
    def test_latex_escaping(self):
        d = tablib.Dataset(['~', '^'])
        output = d.latex
        self.assertFalse('~' in output)
        self.assertTrue('textasciitilde' in output)
        self.assertFalse('^' in output)
        self.assertTrue('textasciicircum' in output)
    def test_unicode_append(self):
        """Passes in a single unicode character and exports."""
@@ -325,6 +398,7 @@ class TablibTestCase(unittest.TestCase):
        data.xlsx
        data.ods
        data.html
        data.latex
    def test_book_export_no_exceptions(self):
@@ -406,6 +480,17 @@ class TablibTestCase(unittest.TestCase):
        self.assertEqual(_csv, data.csv)
    def test_csv_import_set_semicolons(self):
        """Test for proper output with semicolon separated CSV."""
        data.append(self.john)
        data.append(self.george)
        data.headers = self.headers
        _csv = data.get_csv(delimiter=';')
        data.set_csv(_csv, delimiter=';')
        self.assertEqual(_csv, data.get_csv(delimiter=';'))
    def test_csv_import_set_with_spaces(self):
        """Generate and import CSV set serialization when row values have
@@ -420,6 +505,19 @@ class TablibTestCase(unittest.TestCase):
        self.assertEqual(_csv, data.csv)
    def test_csv_import_set_semicolon_with_spaces(self):
        """Generate and import semicolon separated CSV set serialization when row values have
        spaces."""
        data.append(('Bill Gates', 'Microsoft'))
        data.append(('Steve Jobs', 'Apple'))
        data.headers = ('Name', 'Company')
        _csv = data.get_csv(delimiter=';')
        data.set_csv(_csv, delimiter=';')
        self.assertEqual(_csv, data.get_csv(delimiter=';'))
    def test_csv_import_set_with_newlines(self):
        """Generate and import CSV set serialization when row values have
@@ -431,7 +529,6 @@ class TablibTestCase(unittest.TestCase):
        data.headers = ('title', 'body')
        _csv = data.csv
        data.csv = _csv
        self.assertEqual(_csv, data.csv)
@@ -450,6 +547,108 @@ class TablibTestCase(unittest.TestCase):
        self.assertEqual(_tsv, data.tsv)
    def test_dbf_import_set(self):
        data.append(self.john)
        data.append(self.george)
        data.headers = self.headers
        _dbf = data.dbf
        data.dbf = _dbf
        #self.assertEqual(_dbf, data.dbf)
        try:
            self.assertEqual(_dbf, data.dbf)
        except AssertionError:
            index = 0
            so_far = ''
            for reg_char, data_char in zip(_dbf, data.dbf):
                so_far += chr(data_char)
                if reg_char != data_char and index not in [1, 2, 3]:
                    raise AssertionError('Failing at char %s: %s vs %s %s' % (
                        index, reg_char, data_char, so_far))
                index += 1
    def test_dbf_export_set(self):
        """Test DBF import."""
        data.append(self.john)
        data.append(self.george)
        data.append(self.tom)
        data.headers = self.headers
        _regression_dbf = (b'\x03r\x06\x06\x03\x00\x00\x00\x81\x00\xab\x00\x00'
            b'\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00'
            b'\x00\x00\x00FIRST_NAME\x00C\x00\x00\x00\x00P\x00\x00\x00\x00\x00'
            b'\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00LAST_NAME\x00\x00C\x00'
            b'\x00\x00\x00P\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00'
            b'\x00\x00GPA\x00\x00\x00\x00\x00\x00\x00\x00N\x00\x00\x00\x00\n'
            b'\x08\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\r'
        )
        _regression_dbf += b' John' + (b' ' * 75)
        _regression_dbf += b' Adams' + (b' ' * 74)
        _regression_dbf += b' 90.0000000'
        _regression_dbf += b' George' + (b' ' * 73)
        _regression_dbf += b' Washington' + (b' ' * 69)
        _regression_dbf += b' 67.0000000'
        _regression_dbf += b' Thomas' + (b' ' * 73)
        _regression_dbf += b' Jefferson' + (b' ' * 70)
        _regression_dbf += b' 50.0000000'
        _regression_dbf += b'\x1a'
        if is_py3:
            # If in python3, decode regression string to binary.
            #_regression_dbf = bytes(_regression_dbf, 'utf-8')
            #_regression_dbf = _regression_dbf.replace(b'\n', b'\r')
            pass
        try:
            self.assertEqual(_regression_dbf, data.dbf)
        except AssertionError:
            index = 0
            found_so_far = ''
            for reg_char, data_char in zip(_regression_dbf, data.dbf):
                #found_so_far += chr(data_char)
                if reg_char != data_char and index not in [1, 2, 3]:
                    raise AssertionError(
                        'Failing at char %s: %s vs %s (found %s)' % (
                        index, reg_char, data_char, found_so_far))
                index += 1
    def test_dbf_format_detect(self):
        """Test the DBF format detection."""
        _dbf = (b'\x03r\x06\x03\x03\x00\x00\x00\x81\x00\xab\x00\x00'
            b'\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00'
            b'\x00\x00\x00FIRST_NAME\x00C\x00\x00\x00\x00P\x00\x00\x00\x00\x00'
            b'\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00LAST_NAME\x00\x00C\x00'
            b'\x00\x00\x00P\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00'
            b'\x00\x00GPA\x00\x00\x00\x00\x00\x00\x00\x00N\x00\x00\x00\x00\n'
            b'\x08\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\r'
        )
        _dbf += b' John' + (b' ' * 75)
        _dbf += b' Adams' + (b' ' * 74)
        _dbf += b' 90.0000000'
        _dbf += b' George' + (b' ' * 73)
        _dbf += b' Washington' + (b' ' * 69)
        _dbf += b' 67.0000000'
        _dbf += b' Thomas' + (b' ' * 73)
        _dbf += b' Jefferson' + (b' ' * 70)
        _dbf += b' 50.0000000'
        _dbf += b'\x1a'
        _yaml = '- {age: 90, first_name: John, last_name: Adams}'
        _tsv = 'foo\tbar'
        _csv = '1,2,3\n4,5,6\n7,8,9\n'
        _json = '[{"last_name": "Adams","age": 90,"first_name": "John"}]'
        _bunk = (
            '¡¡¡¡¡¡¡¡£™∞¢£§∞§¶•¶ª∞¶•ªº••ª–º§•†•§º¶•†¥ª–º•§ƒø¥¨©πƒø†ˆ¥ç©¨√øˆ¥≈†ƒ¥ç©ø¨çˆ¥ƒçø¶'
        )
        self.assertTrue(tablib.formats.dbf.detect(_dbf))
        self.assertFalse(tablib.formats.dbf.detect(_yaml))
        self.assertFalse(tablib.formats.dbf.detect(_tsv))
        self.assertFalse(tablib.formats.dbf.detect(_csv))
        self.assertFalse(tablib.formats.dbf.detect(_json))
        self.assertFalse(tablib.formats.dbf.detect(_bunk))
    def test_csv_format_detect(self):
        """Test CSV format detection."""
@@ -517,11 +716,11 @@ class TablibTestCase(unittest.TestCase):
        _tsv = '1\t2\t3\n4\t5\t6\n7\t8\t9\n'
        _bunk = '¡¡¡¡¡¡---///\n\n\n¡¡£™∞¢£§∞§¶•¶ª∞¶•ªº••ª–º§•†•§º¶•†¥ª–º•§ƒø¥¨©πƒø†ˆ¥ç©¨√øˆ¥≈†ƒ¥ç©ø¨çˆ¥ƒçø¶'
-        self.assertEqual(tablib.detect(_yaml)[0], tablib.formats.yaml)
+        self.assertEqual(tablib.detect_format(_yaml), 'yaml')
-        self.assertEqual(tablib.detect(_csv)[0], tablib.formats.csv)
+        self.assertEqual(tablib.detect_format(_csv), 'csv')
-        self.assertEqual(tablib.detect(_tsv)[0], tablib.formats.tsv)
+        self.assertEqual(tablib.detect_format(_tsv), 'tsv')
-        self.assertEqual(tablib.detect(_json)[0], tablib.formats.json)
+        self.assertEqual(tablib.detect_format(_json), 'json')
-        self.assertEqual(tablib.detect(_bunk)[0], None)
+        self.assertEqual(tablib.detect_format(_bunk), None)
    def test_transpose(self):
@@ -538,6 +737,15 @@ class TablibTestCase(unittest.TestCase):
        self.assertEqual(second_row,
                   ("gpa",90, 67, 50))
    def test_transpose_multiple_headers(self):
        data = tablib.Dataset()
        data.headers = ("first_name", "last_name", "age")
        data.append(('John', 'Adams', 90))
        data.append(('George', 'Washington', 67))
        data.append(('John', 'Tyler', 71))
        self.assertEqual(data.transpose().transpose().dict, data.dict)
    def test_row_stacking(self):
        """Row stacking."""
@@ -594,6 +802,25 @@ class TablibTestCase(unittest.TestCase):
        self.assertEqual(third_row, expected_third)
    def test_remove_duplicates(self):
        """Unique Rows."""
        self.founders.append(self.john)
        self.founders.append(self.george)
        self.founders.append(self.tom)
        self.assertEqual(self.founders[0], self.founders[3])
        self.assertEqual(self.founders[1], self.founders[4])
        self.assertEqual(self.founders[2], self.founders[5])
        self.assertEqual(self.founders.height, 6)
        self.founders.remove_duplicates()
        self.assertEqual(self.founders[0], self.john)
        self.assertEqual(self.founders[1], self.george)
        self.assertEqual(self.founders[2], self.tom)
        self.assertEqual(self.founders.height, 3)
    def test_wipe(self):
        """Purge a dataset."""
@@ -611,6 +838,26 @@ class TablibTestCase(unittest.TestCase):
        self.assertTrue(data[0] == new_row)
    def test_subset(self):
        """Create a subset of a dataset"""
        rows = (0, 2)
        columns = ('first_name','gpa')
        data.headers = self.headers
        data.append(self.john)
        data.append(self.george)
        data.append(self.tom)
        #Verify data is truncated
        subset = data.subset(rows=rows, cols=columns)
        self.assertEqual(type(subset), tablib.Dataset)
        self.assertEqual(subset.headers, list(columns))
        self.assertEqual(subset._data[0].list, ['John', 90])
        self.assertEqual(subset._data[1].list, ['Thomas', 50])
    def test_formatters(self):
        """Confirm formatters are being triggered."""
@@ -680,18 +927,7 @@ class TablibTestCase(unittest.TestCase):
        # add another entry to test right field width for
        # integer
        self.founders.append(('Old', 'Man', 100500))
-
+        self.assertEqual('first_name|last_name |gpa   ', unicode(self.founders).split('\n')[0])
        self.assertEqual(
            """
 first_name|last_name |gpa   
 ----------|----------|------
 John      |Adams     |90    
 George    |Washington|67    
 Thomas    |Jefferson |50    
 Old       |Man       |100500
 """.strip(),
            unicode(self.founders)
        )
    def test_databook_add_sheet_accepts_only_dataset_instances(self):
@@ -719,5 +955,30 @@ Old       |Man       |100500
        except tablib.InvalidDatasetType:
            self.fail("Subclass of tablib.Dataset should be accepted by Databook.add_sheet")
    def test_csv_formatter_support_kwargs(self):
        """Test CSV import and export with formatter configuration."""
        data.append(self.john)
        data.append(self.george)
        data.headers = self.headers
        expected = 'first_name;last_name;gpa\nJohn;Adams;90\nGeorge;Washington;67\n'
        kwargs = dict(delimiter=';', lineterminator='\n')
        _csv = data.export('csv', **kwargs)
        self.assertEqual(expected, _csv)
        # the import works but consider default delimiter=','
        d1 = tablib.import_set(_csv, format="csv")
        self.assertEqual(1, len(d1.headers))
        d2 = tablib.import_set(_csv, format="csv", **kwargs)
        self.assertEqual(3, len(d2.headers))
    def test_databook_formatter_support_kwargs(self):
        """Test XLSX export with formatter configuration."""
        self.founders.export('xlsx', freeze_panes=False)
 if __name__ == '__main__':
    unittest.main()
@@ -4,12 +4,8 @@
 # and then run "tox" from this directory.
 [tox]
-envlist = py26, py27, py32, py33, pypy
+envlist = py26, py27, py32, py33, py34, pypy
 [testenv]
 commands = python setup.py test
-deps =
+deps = pytest
    pytest
    PyYAML
    xlrd
    omnijson
@@ -1,15 +0,0 @@
 # -*- coding: utf-8 -*-
 import tablib
 d = tablib.Dataset()
 with open('/Users/kreitz/Desktop/test.json') as f:
    d.json = f.read()
 # del d[900:]
 # print d.height
 print len(d.ods)
Author	SHA1	Message	Date
Timo Furrer	3c66697280	Merge pull request #315 from cls1991/develop DataBook().load function params error.	2019-03-02 12:13:11 +01:00
cls1991	13334b7996	DataBook().load function params error	2017-10-24 19:33:35 +08:00
kennethreitz	77a5c8d3fb	Merge pull request #309 from cristiano2lopes/delegate-type-coercion-to-openpyxl Delegate type coercion to openpyxl	2017-09-11 11:51:53 -04:00
Cristiano Lopes	79d66cd250	Handle raised exceptions while setting the value on the cell Convert the value to unicode, if not enough i think the bubbling of the exception is the best thing to do	2017-09-11 13:05:31 +01:00
Cristiano Lopes	77469ef655	Delegate type coercion to openpyxl	2017-09-11 12:47:51 +01:00
kennethreitz	e21676d3bd	Merge pull request #302 from kirsn/patch-2 `Nose` link corrected	2017-08-29 22:58:35 -04:00
Kiran Subbaraman	7c59e1ae86	`Nose` link corrected It now points to https://github.com/nose-devs/nose	2017-08-28 13:41:10 +05:30
kennethreitz	2814fbc381	v0.11.3	2016-02-16 08:49:28 -05:00
kennethreitz	9ca1d4ec54	Merge pull request #220 from kennethreitz/master Master	2016-02-16 08:46:37 -05:00
kennethreitz	abbb4e32d8	update footer in docs	2016-02-16 08:29:17 -05:00
kennethreitz	f6e757d569	v0.11.2	2016-02-16 08:27:59 -05:00
kennethreitz	9ba0451843	Merge pull request #219 from timofurrer/bugfix/export-only Fix export only formats	2016-02-16 08:17:56 -05:00
Timo Furrer	d99db57d75	Fix export only formats Formats like LaTeX could have never been exported because `setattr(cls, set_%s % fmt.title, fmt.import_set)` always failed for export-only formats and with that the exception was caught in the outer try/except and the format tuple was set to (None, None) with `cls._formats[fmt.title] = (None, None)`	2016-02-15 19:29:46 -08:00
kennethreitz	2299c00883	Merge pull request #216 from cmhofer/develop Error: frzn_col_idx not defined	2016-02-08 18:27:39 -05:00
Claudio Mike Hofer	5ba6f5d91a	frzn_col_idx not defined As in #53 already solved. Freeze panes set to A2 again.	2016-02-08 17:30:33 +01:00
kennethreitz	bbdf5f11ab	v0.11.1, fix packaging error	2016-02-07 13:46:03 -05:00
kennethreitz	851ba25702	Update README.rst	2016-02-07 11:00:14 -05:00
kennethreitz	039272b274	docs cleanup	2016-02-07 10:56:29 -05:00
kennethreitz	d6a7832e60	v0.11.0	2016-02-07 10:53:25 -05:00
kennethreitz	e51c4faec7	smarter detect_format function	2016-02-07 10:43:38 -05:00
kennethreitz	f7fc3244ee	updated history	2016-02-07 10:43:19 -05:00
kennethreitz	53d69bd3ea	fix __unicode__	2016-02-07 08:09:10 -05:00
kennethreitz	fcc9700d11	Fix for transpose().transpose() with duplicate keys #199	2016-02-07 07:29:08 -05:00
kennethreitz	1ec9c18a66	$ make test	2016-02-07 07:09:34 -05:00
kennethreitz	99c28fa560	Merge pull request #206 from kontza/develop Two 'raise AttributeError' converted to Python 3 -friendly format.	2016-02-07 07:09:12 -05:00
kennethreitz	fa7fb579fd	Merge pull request #193 from jhermann/patch-1 Formats .tsv and .html are implemented by now	2016-02-07 07:03:25 -05:00
kennethreitz	be24de19dc	Merge remote-tracking branch 'origin/develop' into develop	2016-02-07 07:01:46 -05:00
kennethreitz	1d4f4b68ca	cleanup	2016-02-07 07:01:13 -05:00
kennethreitz	8debeb26ac	Merge branch 'develop' into import_export # Conflicts: # tablib/core.py # tablib/formats/_csv.py # tablib/formats/_xlsx.py	2016-02-07 07:00:55 -05:00
kennethreitz	38e1ee6c3d	Merge pull request #186 from hdzierz/develop Added a mechanism to avoid datetime.datetime issues when serializing dat...	2016-02-07 06:44:20 -05:00
kennethreitz	a774789252	/s/unique/remove_duplicates #182	2016-02-07 06:40:46 -05:00
kennethreitz	995eabad37	Merge pull request #182 from cherepski/develop Adding ability to unique all rows in a dataset.	2016-02-07 06:38:58 -05:00
kennethreitz	d90358bf69	Merge branch 'develop' of https://github.com/rabinnankhwa/tablib into develop # Conflicts: # AUTHORS	2016-02-07 06:36:12 -05:00
kennethreitz	c5920249de	python 3.2 is terrible	2016-02-07 06:32:10 -05:00
kennethreitz	9b6a73c97c	fixed stuipid test	2016-02-07 06:29:07 -05:00
kennethreitz	679bd115b6	Merge branch 'develop' of https://github.com/papisz/tablib into develop	2016-02-07 06:09:55 -05:00
kennethreitz	32cbc36fc1	Merge branch 'latex-export' of https://github.com/mloesch/tablib into develop	2016-02-07 06:08:23 -05:00
kennethreitz	8bded88559	update development guide	2016-02-07 06:01:56 -05:00
kennethreitz	f8f57a467e	updates to install guide	2016-02-07 05:56:19 -05:00
kennethreitz	a11a993955	fix documentation	2016-02-07 05:52:45 -05:00
kennethreitz	25894f2948	remove bunk file	2016-02-07 05:47:28 -05:00
kennethreitz	591b89693e	remove TODO.rst	2016-02-07 05:46:45 -05:00
kennethreitz	85d9c2497e	--universal	2016-02-07 05:46:20 -05:00
kennethreitz	eaf52b691e	Merge pull request #204 from dallagi/notabs Replace tabs with whitespaces	2016-01-27 17:19:37 -05:00
kennethreitz	6f53c5d2b9	Merge pull request #209 from jdharms/develop Small documentation fix in Dataset class	2016-01-27 17:17:38 -05:00
kennethreitz	90ee799576	Merge pull request #208 from stclair/develop Fix XLSX import	2016-01-27 17:16:22 -05:00
Iuri de Silvio	c02a21ccd2	Merge pull request #213 from go8ose/develop Add section on importing to tutorial.	2016-01-20 10:48:06 -02:00
Geoff Crompton	fa045ca114	Add section on importing to tutorial.	2016-01-18 12:13:15 +11:00
Daniel Harms	65703550c3	Small documentation fix in Dataset class	2015-11-10 14:15:37 -05:00
Wes	1fcb98f9ae	Fix XLSX import Calling import_set on an XLSX file was throwing a TypeError from Openpyxl. Openpyxl Reader load_workbook requires a file-like object as the first argument. This commit fixes the error by passing in a file-like object instead of a string.	2015-11-09 06:45:28 -07:00
Rumpu-Jussi	e2d45ecff7	More Python 3 -friendly formatting.	2015-10-27 14:46:43 +02:00
Rumpu-Jussi	47d92277cc	More Python 3 -friendly formatting.	2015-10-27 14:45:55 +02:00
Rumpu-Jussi	fdd74b5b0c	More Python 3 -friendly formatting.	2015-10-27 14:44:07 +02:00
Rumpu-Jussi	de052f0fac	Two 'raise AttributeError' converted to Python 3 -friendly format.	2015-10-27 14:33:44 +02:00
Marco Dalla G	2f3acf5af4	Added myself to authors, as indicated in README	2015-10-07 11:31:26 +03:00
Marco Dalla G	c4e8755cd2	Replaced tabs with whitespaces	2015-10-07 11:25:56 +03:00
Mathias Loesch	79dc4524a0	Added LaTeX table export format	2015-06-04 09:26:35 +02:00
Iuri de Silvio	a785d77901	Merge pull request #194 from tommyanthony/develop Fixed a compatibility bug for Python 3	2015-05-27 13:19:36 -03:00
Thomas Anthony	b3485ec942	Fixed a compatibility bug for Python 3 by adding xrange to compat.py. The code in tablib/formats/_xls.py used xrange in parsing excel spreadsheets. xrange is not a builtin for Python 3, so I've added xrange = range in compat.py and imported it in tablib/formats/_xls.py.	2015-05-26 20:06:42 -07:00
Jürgen Hermann	28b358c9da	Formats .tsv and .html are implemented by now Removed mentioning of "wanted" formats that exist.	2015-04-08 15:59:55 +02:00
Iuri de Silvio	24657520e9	Merge pull request #189 from tsroten/issue_184 Fixes Row slicing. Fixes #184.	2015-04-05 20:05:31 -03:00
Iuri de Silvio	66d9e50984	New import/export interface with dataset and databook `import_` and `export` methods and overloaded `import_set` and `import_book` functions.	2015-04-05 19:51:56 -03:00
Thomas Roten	541fba6786	Fixes Row slicing. Fixes #184 .	2015-03-28 16:14:27 -04:00
Helge	bc6398ffb0	Added a mechanism to avoid datetime.datetime issues when serializing data	2015-03-02 15:06:31 +13:00
Kevin Cherepski	dca7bc9a7d	Adding ability to unique all rows in a dataset.	2015-02-04 11:53:14 -05:00
Iuri de Silvio	2fbda0f43d	Merge pull request #176 from sramana/develop Fix import errors when installed from source	2014-11-15 16:28:57 -02:00
Ramana Varanasi	e350f9428b	Fix import errors when installed from source	2014-11-10 16:03:10 +05:30
Iuri de Silvio	68dba0a77d	Merge pull request #173 from amarandon/develop Fix JSON import example	2014-10-04 11:26:55 -03:00
Alex Marandon	028be03c2c	Fix JSON import example The example was triggering this error: JSONError: Expecting property name: line 1 column 3 (char 2) This is because JSON property names should be wrapped in double quotes. While at it, I've fixed the typo in "last_name"	2014-10-03 09:17:38 +02:00
Iuri de Silvio	e1d65ba3c8	Merge pull request #172 from thibault/patch-1 Minor typo correction	2014-09-26 16:35:29 -03:00
Thibault J.	e4cb3bcd9b	Minor typo correction Requests -> Tablib	2014-09-23 11:46:05 +02:00
Iuri de Silvio	bf9510e0c7	Merge pull request #170 from phargogh/dbf_docs_repair Cleaning up DBF API documentation	2014-09-06 09:54:21 -03:00
James Douglass	82ae3ca507	Cleaning up DBF documentation Fixing indentation issues (off by one space), which caused problems with the sphinx rendering of the DBF docstring and otherwise cleaning up the sphinx docstring.	2014-09-05 14:56:33 -07:00
rabinnankhwa	5fbdd56fba	filter row and column values	2014-08-31 00:12:44 +05:45
rabinnankhwa	f187cef5f4	adding support for creating subset of a dataset.	2014-08-30 23:52:35 +05:45
rabinnankhwa	87892d7266	used get method of dictionary instead of exception handling	2014-08-30 08:56:17 +05:45
rabinnankhwa	20e2ce5ba0	__getslice__ method of Row classcorrected	2014-08-30 08:26:08 +05:45
Iuri de Silvio	48e576954d	Merge pull request #153 from phargogh/dbf-support Support for dBase (DBF) files	2014-08-26 08:24:36 -03:00
James Douglass	a21f8187f8	Adding DBF support. Squashing two squashes. Adding DBF support Adding the DBFpy python package The DBFpy package provides basic dbf support for python. Still need to write an interface format file for tablib. Adding DBF format and imports in compat.py Adding DBF format to formats.__init__ DBF format had not been committed to formats.__init__, so I’m adding it. Adding a dbf import test Adding at test to check whether a DBF can be created properly and compare it against a regression binary string. Adding an import_set test (and renaming another) Adding an import_set test that conforms with the other import_set tests for other formats. I’m also adding an export_set function. Fixing system site-packages import Importing dbfpy from tab lib.packages instead of system site packages. Fixing a syntaxError in dbfpy/dbfnew.py Fixing an issue with ending field definitions DBFPY, when writing a DBF, terminates the field definitions with a newline character. When importing a DBF from a stream, however, DBFPY was looking only for the \x0D character rather than the newline. Now we consider both cases. Adding a test for dbf format detection Adding DBF filetype detection tests Adding tests for YAML, JSON, TSV, CSV using the DBF detection function. Handling extra exceptions in dbf detection Adding exception handling for struct.error, an exception that DBFPY raises when trying to unpack a TSV table. Since it’s not a DBF file, we know it’s not a DBF and return False. Fixing an issue with the DBF set exporting test The DBF set export test needed a bit enabled (probably the writeable bit?) before the test would match the regression output. Updating dbf interface Updating the int/float class/type checking in the dbf format file. This allows for python2 and python3 compatibility. Tweaking dbfpy to work with python3 Altering a couple of imports. Updating dbf tests for binary data compatibility Making regression strings binary and improving debug messages for dbf assertion errors. Improving file handling for python 2 and 3 Updating DBF file handling for both python 2 and 3 in the _dbf interface. Adding a (seemingly) functional dbfpy for python3 I’ve made dbfpy python3 compatible! Tests appear to pass. A significant change was made to the format detection test whereby I made the input string a binary (bytes) string. If the string is not a bytes string by the time we try to detect the format, we try to decode the string as utf-8 (which admittedly might not be the safest thing to do) and try to decode anyways. Updating imports for tablib dbf interface Now importing python2 or python3 versions as appropriate. Updating dbf package references in compat.py Cleaning up debugging print statements Updating stream handling in dbf interface Factoring the open() call out of the py3 conditional and removing the temp file before returning the stream value. Adding dbfpy3 init.py I had apparently missed the dbfpy3 init file when committing dbfpy3. Adding dbfpy and dbfpy3 to setup.py's package list Switching test order of formats Putting dbf format testing ahead of TSV. In some of my tests with numeric DBF files, I encountered an issue where the ASCII horizontal tab character (0x09) would appear in a numeric DBF. Because of the order of tabular format imports, though, format detection would recognize it as a TSV and not as a DBF. Adding my name to AUTHORS. Adding a DBF property to tab lib core Documentation includes examples on how to explicitly load a DBF straight from a file and how to load a DBF from a binary string. Also, how to write the binary data to a file. Adding DBF format notes to README Adding exclamation point to DBF section title Matching formatting of XLS section Updating setup.py to match current dev state Setup.py had been updated since I forked the tablib repo, so I’m updating setup.py to match its current structure while still maintaining DBF compatibility. Fixed callable collumn test the test was sending a list instead of a function CORE CONTRIBUTORS 🍰 @iurisilvio v0.10.0 WHEELS 3.3, 3.4 makefile for WHEELS v0.10.0 history ALL Separate py2 and py3 packages to avoid installation errors. Fix #151 Running travis and tox with python 3.4. Adding DBF support Adding the DBFpy python package The DBFpy package provides basic dbf support for python. Still need to write an interface format file for tablib. Adding DBF format and imports in compat.py Adding DBF format to formats.__init__ DBF format had not been committed to formats.__init__, so I’m adding it. Adding a dbf import test Adding at test to check whether a DBF can be created properly and compare it against a regression binary string. Adding an import_set test (and renaming another) Adding an import_set test that conforms with the other import_set tests for other formats. I’m also adding an export_set function. Fixing system site-packages import Importing dbfpy from tab lib.packages instead of system site packages. Fixing a syntaxError in dbfpy/dbfnew.py Fixing an issue with ending field definitions DBFPY, when writing a DBF, terminates the field definitions with a newline character. When importing a DBF from a stream, however, DBFPY was looking only for the \x0D character rather than the newline. Now we consider both cases. Adding a test for dbf format detection Adding DBF filetype detection tests Adding tests for YAML, JSON, TSV, CSV using the DBF detection function. Handling extra exceptions in dbf detection Adding exception handling for struct.error, an exception that DBFPY raises when trying to unpack a TSV table. Since it’s not a DBF file, we know it’s not a DBF and return False. Fixing an issue with the DBF set exporting test The DBF set export test needed a bit enabled (probably the writeable bit?) before the test would match the regression output. Updating dbf interface Updating the int/float class/type checking in the dbf format file. This allows for python2 and python3 compatibility. Tweaking dbfpy to work with python3 Altering a couple of imports. Updating dbf tests for binary data compatibility Making regression strings binary and improving debug messages for dbf assertion errors. Improving file handling for python 2 and 3 Updating DBF file handling for both python 2 and 3 in the _dbf interface. Adding a (seemingly) functional dbfpy for python3 I’ve made dbfpy python3 compatible! Tests appear to pass. A significant change was made to the format detection test whereby I made the input string a binary (bytes) string. If the string is not a bytes string by the time we try to detect the format, we try to decode the string as utf-8 (which admittedly might not be the safest thing to do) and try to decode anyways. Updating imports for tablib dbf interface Now importing python2 or python3 versions as appropriate. Updating dbf package references in compat.py Cleaning up debugging print statements Updating stream handling in dbf interface Factoring the open() call out of the py3 conditional and removing the temp file before returning the stream value. Adding dbfpy3 init.py I had apparently missed the dbfpy3 init file when committing dbfpy3. Adding dbfpy and dbfpy3 to setup.py's package list Switching test order of formats Putting dbf format testing ahead of TSV. In some of my tests with numeric DBF files, I encountered an issue where the ASCII horizontal tab character (0x09) would appear in a numeric DBF. Because of the order of tabular format imports, though, format detection would recognize it as a TSV and not as a DBF. Adding my name to AUTHORS. Adding a DBF property to tab lib core Documentation includes examples on how to explicitly load a DBF straight from a file and how to load a DBF from a binary string. Also, how to write the binary data to a file. Adding DBF format notes to README Adding exclamation point to DBF section title Matching formatting of XLS section Updating setup.py to match current dev state Setup.py had been updated since I forked the tablib repo, so I’m updating setup.py to match its current structure while still maintaining DBF compatibility. Fixed callable collumn test the test was sending a list instead of a function CORE CONTRIBUTORS 🍰 @iurisilvio v0.10.0 WHEELS 3.3, 3.4 makefile for WHEELS v0.10.0 history ALL Separate py2 and py3 packages to avoid installation errors. Fix #151 Running travis and tox with python 3.4.	2014-08-21 22:06:42 -07:00
Iuri de Silvio	8479df725e	Fix some http schemes to follow page scheme.	2014-08-10 11:47:13 -03:00
Iuri de Silvio	333deb2311	Merge pull request #160 from ustun/patch-1 Typo	2014-07-30 08:55:06 -03:00
Ustun Ozgur	0b714f21e1	Typo	2014-07-30 14:46:50 +03:00
Iuri de Silvio	ae730b00b1	Merge pull request #154 from fusionbox/freeze-panes Only freeze the headers row, not the headers columns	2014-06-26 14:00:12 -03:00
Iuri de Silvio	84e8b0384f	Merge pull request #155 from fusionbox/update-unicodecsv Update the vendored unicodecsv to fix None handling	2014-06-24 22:42:56 -03:00
Gavin Wahl	7a2842a8af	Update the vendored unicodecsv to fix None handling The old version of unicodecsv incorrectly (according https://docs.python.org/2/library/csv.html#csv.writer) encoding None values as the string 'None', instead of the string '' as the python documentation specifies. The newest version of unicodecsv has fixed this. Fixes #121	2014-06-24 15:22:12 -06:00
Gavin Wahl	954bbdccf3	Only freeze the headers row, not the headers columns Fixes #53	2014-06-16 15:31:00 -06:00
Iuri de Silvio	7acaa8460d	Running travis and tox with python 3.4.	2014-05-27 21:18:14 -03:00
Iuri de Silvio	84e7e251ae	Separate py2 and py3 packages to avoid installation errors. Fix #151	2014-05-27 19:25:15 -03:00
Kenneth Reitz	dc868eff31	ALL	2014-05-27 12:52:57 -04:00
Kenneth Reitz	43356e908c	v0.10.0 history	2014-05-27 12:52:43 -04:00
Kenneth Reitz	f7acc19523	makefile for WHEELS	2014-05-27 12:51:51 -04:00
Kenneth Reitz	c5972db8f0	Merge branch 'develop'	2014-05-27 12:51:30 -04:00
papisz	70716fdd21	CSV custom delimiter support	2014-04-09 22:35:56 +02:00
kennethreitz	2bc6122ee8	Merge pull request #113 from dec0dedab0de/master Fixed callable column test	2014-01-08 11:44:06 -08:00
James Patrick Robinson Jr	1aafc7e2f4	Fixed callable collumn test the test was sending a list instead of a function	2013-08-28 14:03:58 -04:00