INI-Tax-Analyzer

Data Preparation
Login

Tax Data Preparation Overview

This document provides an overview of how the CSV-formatted input files required by the model have been prepared. The methods used in this preparation can be refined after gaining experience using the model. This overview contains two sections:

Preparing Model Input Data for 2017

The INI-Tax-Analyzer input files (tax.csv and tax_weights.csv) are derived from publicly available data files in the TPRU-India/taxcalc repository. The details of this data derivation process can be found in the data/prep.py Python script.

Extrapolating Input Data beyond 2017

The prior section describes how the 2017 variables file and the 2017 weights file have been generated. In order to use the model in years after 2017, the weights and values of each monetary variable need to be extrapolated to subsequent years.

The variable and weight extrapolation assumptions are included in the growth_factors.csv file. These are the same assumptions as used in the TPRU-India/taxcalc model. These assumptions can be change as new historical data become available and as economic projections change.