项目作者: cbare

项目描述 :
A project to look at filings to the FCC on net neutrality
高级语言: Python
项目地址: git://github.com/cbare/FCC-filings.git
创建时间: 2017-05-17T04:47:49Z
项目社区:https://github.com/cbare/FCC-filings

开源协议:GNU General Public License v3.0

下载


FCC-filings

A project to look at public comments to the FCC on net neutrality.

The FCC takes public comment through its Electronic Comment Filing Systems and also makes available a nice API for the ECFS, at least for now. To use it, you’ll need to sign up for an API key.

Getting the filings

You know you want them. Here’s how to get them as a series of .json files with 1000 filings each.

  1. python3 python/get_filings.py --limit 1000 --max 10000 --start 500000 -o my_directory

Read in the filings

OK, now fire up ipython and read them and concatenate them into an array.

  1. import glob
  2. import json
  3. import os
  4. def read_filings(output_dir):
  5. filings = []
  6. path = os.path.join(output_dir, '*.json')
  7. for filename in glob.iglob(path):
  8. print(filename)
  9. with open(filename, encoding='utf-8') as f:
  10. filings += json.load(f)['filings']
  11. return filings
  12. filings = read_filings('data/may_15')

There are some nice analyses of this data set by these folks:

  • Jeffrey Fossett
  • @nhf/whats-up-with-all-of-those-identical-comments-on-the-fcc-net-neutrality-docket-105835f59c3e">Nathaniel Fruchter
  • @csinchok/an-analysis-of-the-anti-title-ii-bots-463f184829bc">Chris Sinchok

The issue of spam-bots in the comment filings got some press coverage:

More general information on net neutrality: