Changelog

3.0.1 #322

Technical changes
- Fix build.

3.0.0 #320

Technical changes
- Migrate config to pyproject.toml.
- Comment out conda build.
- Add a openfisa_survey_manager.paths module to store paths (breaking change).

2.3.5 #319

Technical changes
- Fix the conda build by moving the Numpy pinning to install_requires.
- Add a conda build step in CI to inform the dev about a problem before merging.

2.3.4 #317

Technical changes
- add "numpy >=1.24.2, <2.0" in setup.py to solve conda build

2.3.3 #316

Technical changes
- Solve an issue caused by #299 when there is a target_entity_count without variables of the same entity.
- Correct the CHANGELOG which had two errors.

2.3.2 #315

Technical changes
- Transform input arrays of Enums variables in EnumArray type to improve computation speed due to changes in Openfisca-core 42

2.3.1 #300

New features
- Add a new distance function method, the hyperbolic sinus. It allows to cap the weight ratio with a one dimensionnal parametring, and with less computation issues than with the logit method.
- It is described in Calmar 2 documentation

2.3.0 #299

New features
- Allows for calibration of variables defined in two different entities ; it requires that one is included in the other (ex : individuals and fiscal household : each individual is in only one fiscal household). To use it, a variable identifying in the smaller entity the identifier of the larger one is needed (the name must be given in parameter "id_variable_link").
- Reshape the input of calmar and calibration (with dicts more than dataframes to allow several entities)
- Adapt the tests The idea comes from Insee's Calmar 2 documentation.

2.2.7 #311

Bug fix
- Fix filepath and table fetcher when using parquet format.

2.2.6 #308

Minor change.
- Update OpenFisca-Core to 43.0.0

2.2.5 #305

Minor change.
- Add docstrings and types to some tests.

2.2.4 #304

Minor change.
- Add docstrings and types to some tests.

2.2.3 #297

Minor change.
- Update configarser to 8.0.0

2.2.2 #302

Minor change.
- Update OpenFisca-Core to 42.0.0

2.2.1 #298

For compute_aggregate, replace the warning by an assert when weighted = True (the default) and no weight is defined for the variable. Before, a warning was displayed and the aggregate was computed using uniform weights.

2.2.0 #295

Improve support for parquet file format : - If a parquet file is provided to build-collection it will be cleaned an added to the collection as a parquet file. - If the option keep_original_parquet_file is passed the parquet file is kept and added to the collection as is as is.

2.1.0 #277

New features
- Support parquet file format :
  - if a parquet file is provided to build-collection it won't be converted in HDF5 but added to the collection as is.
  - It is possible to provide a folder containing many files : each files will be used as a separate table.
- Run a simulation on a part of the input datasets (for the moment it works only for two entities simulations) :
  - With a batch size option
  - With a filter_by option

2.0.10 #285

Technical changes

Some arguments for calibration were not consistently named.

2.0.9 #294

Technical changes
- Add config files directory option in set_table_in_survey

2.0.8 #292

Technical change
- With the security enforcement of PyPi, we have to use token instead of login/password.

2.0.7 #293

Technical changes
- Improve correction done in https://github.com/openfisca/openfisca-survey-manager/pull/286/files

2.0.6 #291

Technical changes
- Fix the script check-version-and-changelog.sh to fix issue #288
- Upgrade CI actions

2.0.5 #286

New features
- Allows sub-annual weighted aggregates in compute_aggregate when the weights are annual.

2.0.4 #283

Correction function compute_pivot_tables in ReformScenario

2.0.3 #282

Correction typo error in create_data_frame_by_entity

2.0.2 #280

Add a try/except for the tests config in openfisca_survey_manager/init.py

2.0.1 #279

Technical changes

Fix names of ids columns when the merge option is True in create_data_frame_by_entity.

2.0.O #273

Breaking changes

This is a major refactoring of the AbstractSurveyScenario object and affects other related objects.

Refactor AbstractSurveyScenario
Create ReformScenario
Monkey patch openfiscca_core.simulations.Simulation and openfisca_core.simulations.simulation_builder.SimulationBuilder.
Adapt AbstractAggregates accordingly

Rationale

The main goal was to separate the different steps to produce an impact analysis on survey or administrative data and to create a more flexible tools to deal with different use case. To do so, we performed the following changes:

Create a generic AbstractSurveyScenario that can hande as many simulations as needed.
Move to the appropriate (lower) level the methods to load the data or perform some calculation, mainly:
- Monkey patch the Simulation objects to deal all loading and calculation using pandas that are not available in the original openfisca_core.simulations.Simulation object which rely solely on numpy (and will not change anytime soon for good reason)
- Monkey patch the SimulationBuilder to add the needed methods to init the simulation from tabular data.
Create a ReformScenario that retains the main characteristics of the old AbstractSurveyScenario
Adapt AbstractAggregates to these new scenarios. Might need more refactoring to be more generic, but works with actual use case mainly openfisca-france-data.

Migration

Users of AbstractSurveyScebario should use ReformScenario.
Use attribute period instead of year.
The generic simulation initialisation from survey data goes through the method Simulation.new_from_tax_benefit_system with a data dict argument with new keys as collection, id_variable_by_entity_key, role_variable_by_entity_key, used_as_input_variables to mimic at the simulation level what was done before this PR at the scenario level.

1.1.9 #274

Technical changes
- Adapt config files location to private Ci-runs of LexImpact (hack like everything dealing with config files location definition).

1.1.8 #272

Technical changes
- Use openfisca-core version >= 41.0.3.

1.1.7 #271

Technical changes
- Set pandas dependency to version >= 2.0.3, <3.0.

1.1.5 #265

Technical changes
- Use find_namespace_packages and instead of find_packages in setup.py

1.1.4 #264

Technical changes
- Change importlib metadata import to work with all Python version

1.1.3 #263

Technical changes
- Use importlib instead of pkg_resources to avoid deprecation warnings

1.1.2 #262

Technical changes
- Remove old CircleCI continuous integration configuration
- Set README CI badge to current GitHub Actions CI

1.1.1 #261

Technical changes
- Fix Conda build step in publish-to-conda GitHub Actions job
  - Define OpenFisca-Survey-Manager package dependencies once for PyPI and conda
    - Use setup.py general requirement and extra requirements for conda package
    - Adapt tables library name to pytables for conda
- Build conda package from repository sources instead of PyPI .tar.gz

1.1.0 #260

New features

Add options in inflate_parameters and inflate_parameter_leaf:
- start_update_instant : Instant of the year when the inflation should start, if different from January 1st
- round_ndigits : number of digits in the rounded result
Adjustment of inflate_parameters to use it with parameter leaf

1.0.2 #259

Technical changes
- A parameter config_files_directory exist but it is not used evrywhere, this PR generalize it.
- Add tests using this parameter.

1.0.1 #257

Technical changes
- In GitHub Actions workflow, fixes the check-for-functional-changes → deploy → publish-to-conda jobs sequence
  - Fix the activation of the deploy job by fixing how it gets check-for-functional-changes output status
  - Allow the activation of publish-to-conda job that needs the deploy job
- Add conda configuration files to non functional files for CI

1.0.0 #252

Technical improvement
- Impacted periods: all.
- Impacted areas: all.
- Details:
  - Upgrade every dependencies & use their latest versions

0.47.2 #249

Technical changes
- Fix default_config_directory for use with openfisca-france-data in a CI

0.47.1 #246

Bug fix
- Debug france data ci (fixes 0.47.0)

0.47.0 #245

Technical changes
- Fix default_config_directory for use with openfisca-france-data in a CI

0.46.19 #244

Technical changes
- Bump to publish package

0.46.18 #243

Technical changes
- Bump to publish package

0.46.17 #242

Technical changes
- Bug fix in SurveyCollection.load

0.46.16

CI test

0.46.15 #236

Technical changes
- Put back test in CI
- Fix coveralls config fot GitHub Actions
- Add a test for create_data_frame_by_entity
- Bump Actions and Python version to fix warnings

0.46.14 #234

Technical changes
- Convert every cells of a column to string.

0.46.13 #233

Technical changes
- Correcting the code asking for the period before it's instated
- Checking the new period assignment

0.46.12 #232

Technical changes
- Deal with Nan in Enum variables

0.46.11 #227

Technical changes
- Add build of a tar.gz
- Add a make entry for build
- Move CI from Circle CI to GitHub Action (Except make test that run only on CircleCI)

0.46.10 #229

Technical changes
- Add tar.gz to PyPi
- Add display readme to PyPi

0.46.9 #228

Technical changes
- Refactor tables method to mutualize code
- Save variables in table survey data

0.46.8 #226

Technical changes
- Add a set seed in mark_weighted_percentiles, so that when a survey scenario with a baseline and a reform is run, variables which use this function take the same value for a given entity between the baseline and the reform.

0.46.7 #227

Technical changes
- Handle explicitly SAS related dependecy.

0.46.6 #224

Bug fix
- Using pyreadstat instead of SAS7BDAT which is no more the canonical way to read sas files into pandas dataframes.

0.46.5 #223

Bug fix
- Deal with HDF5 file opening strict policy in build-collection

0.46.4 #219

Technical changes
- Better handling of CategoricalDtype in input data

0.46.3 #217

Bug fix
- Deal with HDF5 file opening strict policy

0.46.2 #214

New features
- Introduce AbsstractSurveyScenario.calculate_series

0.46.1 #211

Technical changes
- Improve dialect detection for csv files

0.46 #210

Technical changes
- Hack to deal with encodings and delimiter not detected by pandas.read_csv

0.45 #143

Technical changes
- In compute_marginal_tax_rate allow for automatic aggregation on group entity when target and varying variables entity are not the same and the varying variable entity is a person one.

0.44.2 #208

Bug fix
- Fix typo.

0.44.1 #207

Bug fix
- Fix aggregates export to html.

0.44 #206

New feature
- Ability to export aggregates to html.

0.43 #135

New feature
- Introduce aggregates.

0.42.3 #189

Technical changes
- Accept categorical columns in input data frames to initialize Enum variables.

0.42.2 #204

Technical changes
- Add on sub-periods when creating a quantile on a larger period

0.42.1 #200

Bug fix
- Let numpy dependence come from openfisca-core

0.42.0 #198

New feature
- Allow to build collections/surveys from csv files

0.41.3 #196

Bug fix
- Enforce HDF store closing when done

0.41.2 #194

Bug fix
- Enforce us of np.array for weights and filters when computing aggregates

0.41.1 #187

Update dependencies

0.41.0 #185

New features
- Add a method to compute quantile
- Extend the computation of marginal tax rate

0.40.1 #185

Technical improvement
- Introduce weighted option in compute_aggregate and compute_pivot_table
- Change weights to alternative_weights in compute_aggregate and compute_pivot_table

0.40.0 #184

Technical improvement
- Add weights keyword argument to compute_aggregate and compute_pivot_table
Improve documentation
- Use googl style in docstring
- Add some docstring

0.39.1 #178

Bug fix
- Fix inflate that inflated twice when baseline_simulation == simulation

0.39.0 #170

Add statistical helpers to compute top and bottom shares

0.38.3 #XXX

Fix _set_used_as_input_variables_by_entity

0.38.2 #162

Update pytables and numpy dependencies

0.38.1 #158

Clarify documentation on configuration directory and build-collection command

0.38.0 #156

New features
- Introduce survey_scenario.generate_performance_data(output_dir)
  - This generates a performance graph and CSV tables containing details about execution times of OpenFisca formulas

0.37.3 #157

Technical changes
- Add tables library to default requirements
Add documentation for users installing, configuring and running the module for the first time

0.37.2 #155

Technical changes
- Improve error mesage in build_collection (fix previous version)

0.37.1 #154

Technical changes
- Improve error mesage in build_collection

0.37.0

Technical changes
- Add ignorecase argument to Survey.get_values

0.36.3 #152

Technical changes
- Fix asof for TaxScale
- Use simulation.get_known_periods instead of Holder's method in summariaze_variable

0.36.0 #152

Technical changes
- Create collections directory when it is missing

0.35.2 #150

Technical changes
- Fix assets inclusion

0.35.1 #149

Technical changes
- Fix deprecation in pandas.
- Fix stripping of coicop categories

0.35 #148

Introduce some functions to deal with coicop nomenclature

0.34 #147

Better handling of categorical variables

0.33 #145

Convert string-like columns to category and save to HDF files in table mode

0.32.1 #144

Fix typo (remove quotes) in inflate

0.32 #143

Remove python 2 unicode marks u" and u'.

0.31 #140

Group column dropping since DataFrame.drop is expensive.

0.30.1 #137

Fix bug in input data loader

0.30.0 #136

Adding description
Adding function documentation.

0.29.0 #134

New features
- Introduce compute_marginal_tax_rate.

0.28.0 #133

Fix _set_used_as_input_variables_by_entity
Add missing custom_input_data_frame before initializing the data
Fix entity ids setting

0.27.0 #132

Technical changes
- Fix create_data_frame_by_entity
- Fix some deprecations

0.26.0

New features
- Neutralized variables are now correctly handled by summarize_variable
- Extend testing to doctest

0.25.0 #126

New features
- create_data_frame_by_entity is able to handle expressions for filtering (filter_by can be an expression)
- This allow compute_aggregate and compute_pivot_table to handle expressions as well for filter_by.
Deprecations
- Deprecate helper get_entity
- Deprecate helper get_weights

0.24.0 #127

Fix a bug in create_data_frame_by_entity

0.23.0 #124

Rename weight_column_name_by_entity to weight_variable_by_entity

0.22.0 #123

Add github templates

0.21.0 #122

Use SimulationBuilder.join_with_persons to initialize entites

0.20.0 #120

Adapt to SimulatioBuilder shipping with openfisa-core v34

0.19.1 #107

Fix set_table_in_survey

0.19.0 #103

Add a --path PATH option to build-collection

0.18.5 #101

Add documentation to init_from_data
Split setters to gain readability

0.18.4

Add badges to help and reassure users/contributors

0.18.3

Update setup.py with missing dependencies

0.18.2

Incrementally fixing Calibration

0.18.1

Lower version for pandas dependency

0.18.0

Add a way of creating a raw_data.ini file on Google Colab

0.17.5

Add pyxdg as a core dependency

0.17.4

Fix config.ini initialisation
Remove ptyhon2 tests

0.17.3

Update MANIFEST.in

0.17.2

Minor change in dependencies to allow more flexibility and use in Binder

0.17.1

Minor Python 2/3 compatibility string issue fixed

0.17

Fix, test and document calmar

0.16.5

Fix asof
Clean Makefile
Clean style

0.16.4

Use simulation.delete_arrays introduced by openfisca-core version 24.10.0

0.16.2, 0.16.3

Add stata file conversion helper for survey scenarios

0.16.1

Rename test_random_data_generator method to create_randomly_initialized_survey_scenario
Improve doc tests
use pytest instead of nosetest

0.16.0

Provide summarize_variable for enums

0.15.3

Fix asof

0.15.2

Test tagging

0.15.1

Remove travis config file

0.15.0

openfisca-survey-manager can be used with both python 2 and 3

0.14.2

Use simulation.set_input introduced by openfisca-core version 24.5.0

0.14.1

Use weightedcalcs to compute quantiles

0.14.0

Introduce new tools: asof extract from any tax_benefit_system its variables and parameters as of some date

0.13.0

Introduce new option : add the count_non_zero value in the aggfunc argument of compute_aggregate

0.12

Introduce new SurveyScenario methods:
- dump_simulations: dumps the `survey_scenario simulations
- restore_simulations: retores previously dumped `survey_scenario simulations

0.11.1

Fix travis tests

0.11.0

Add legislation parameters inflator

0.10.1

Cleaner checks for travis use

0.10

Migrate to a new method to pass data to SurveyScenario

0.9.10

Add a difference argument for compute_aggregate (fixes #45)

0.9.9

Add trace and debug attributes to AbstractSurveyScenario to use with new_simulation

0.9.8

Create directory for config templates files

0.9.7

Remove unused imports

0.9.6

Add a Quantile class inheriting for Variable

0.9.5

Pandas deprecates the use of sort_index for sort_values
Numpy deprecates use of np.float with issubdtype

0.9.4

Fix bug when initialising mono-entity (person-only) TaxBenefitSystem

0.9.3

Fix difference pivot_table computation
Clarify code (use variable instead of column) and add some doctring

0.9.2

Hack to custom default_config_files_diretory at CASD when using taxipp

0.9.0

Migrate to openfisca-core v20 syntax
Fix a bug in create_data_frame_by_entity

0.8.13

Migrate to openfisca-core v14.1.2 syntax

0.8.12

Fix a bug resulting from pandas v0.20 pivot_table fix

0.8.11

Decrease logs verbosity by starting using the DEBUG level more often

0.8.10

Fix a bug in compute_pivot_table which was no more able to compute non-difference pivot-table

0.8.9

Fix a bug when variables are missing form the tax and benefit system in create_entity_by_dataframe

0.8.8

Improve handling of difference option in create_entity_by_dataframe

0.8.7

Improve create_entity_by_dataframe by adding expressions and merge options and enhancing filter_by

0.8.4

Fix summarize_variable when dealing with neutralized variables

0.8.3

Add humanize to dependencies

0.8.2

Fix a bug when output_cache is unset

0.8.1

Add automatic push to PyPi

0.8.0

Improve compute_aggregates and compute_pivot_table

0.6.1

Fix config_files_directory default in utils

0.6.0

Adapat to new syntax (progressive elimination of entity_key_plural)

0.5.2

Fix path of entry point build-collection
Add entry point build-collection

### 0.5.1

Fix tagging

## 0.5

Create Changelog.md
Check version and changelog when pushing

Files

CHANGELOG.md

Latest commit

History