Skip to content

Latest commit

 

History

History
37 lines (18 loc) · 2.04 KB

README.md

File metadata and controls

37 lines (18 loc) · 2.04 KB

Permitted Perjury: finding lying landlords in NYC

This repo contains the code that helped produce the September HRI report about building permit fraud. We found over 10,000 falsified permits!

Read about our report in the nytimes, politico, and amNY.

How to recreate the entire report and dataset

Requirements

make, fish, psql, python3, pip, pyvenv, csvkit, jq, pandoc, bbler, nycdb

See the Makefile for a better sense of what's going on. Note that downloading the data will take WEEKS!

  1. make jobs.csv

This produces a csv of all department of buildings jobs for likely rent-stabilized buildings since 2016. It requires setting up a running instance of nycdb.

  1. make possible_liars.csv

This will generate a list of potential liars by downloading tax bills and job filings pages from city websites. Expect this task to produce LOTS of errors and take many weeks. See bbler for the scripts to parse and download tax bills and job filings.

  1. make liars.csv

Filter the list of liars to those whose permit applications contain falsified or suspicious information.

  1. make liars.zip

This will create a zip file with a folder for each permit containing all the PDFs and documents so each lying landlord can be manually verify as needed.

To build the report make report

To download a list of buildings with lead paint violations: make lead_paint_bbls.csv. This is used for statistics in the report.

In the folder notebook there is a Jupiter notebook used to derive the statistics in the report.