1130 points by danso 2626 days ago | 154 comments on HN
| Mild positive
Contested
Editorial · v3.7· 2026-02-28 11:41:05 0
Summary Information Access & Democratic Transparency Champions
This article celebrates the Congressional passage of the OPEN Government Data Act (H.R. 4174) in December 2018, positioning public information access as foundational to democratic governance. The content directly champions Article 19 (freedom of information) and Article 21 (democratic participation) of the UDHR, framing government data transparency as a canonical principle enabling informed citizenship. The editorial and structural signals strongly align with human rights values of accountability and transparency.
Making public data open by default can arguable be an imporant step towards fostering societal equity. However, it needs to be not only "open", which typically means stashed away in some corner as a spreadsheet or database file, but accessible and useful to people. The UK has been pushing open data for years now and more and more institutions are now realizing this. Shameless plug for a research project that is aiming to make open data more accessible and to democratize data-science: https://data-in.place/ ...
The first problem here is the data formats state and federal governments use. You'll see a hodgepodge, but primarily MS Office $version .
The biggest problem with this terrible binary format is that metadata can leak a great deal that should have not been released. So this leads to PDF output of word/excel.
The next area is that especially local government offices have no way of setting up a data portal. I'm working on this right now, where the only way to get data out of Bloomington,IN is to do FOIA requests every week/month over the data you want. This absolutely should be available via a portal, and not locked behind "in person, mail, fax at cost of .10$ a page".
As a next step I would love to see an exploration of a legal system where a change of a law or regulation is backed by both data (data and methodology directly referenced by the law) and a description of the expected impact. Something such as:
> By changing law A related to B, we expect the increase of C to be at least D in the next E months.
If not achieved, the change is reversed/reduced. Hopefully that would allow experimentation without taking the risk of creating a system too bad in case the implementation or policy isn’t good enough.
It's good to see the US govt making an effort to step up its technical level. A vast if hidden problem in the political sphere is that most politicians do not have a technical education. This creates a serious misconfiguration of the govt alongside other centers of soft power like large cap tech companies.
There is no way around it that these companies have to work with the government to secure public interests from 21st century threats. Just today I read an article about how black hat hackers are targeting outdated industrial control systems more vigorously than ever before. The government on its own without technical upgrades cannot face down this problem in its current condition. Which is why opening up data is a beneficial thing.
Openness of data is a double edged sword. It will make malicious agents' job easier to have as much data as possible in a consistently machine readable format, but it will also help those on the other side.
If tech is one of the things that can bolster and improve government, tech needs to work in the optimal environment. Which is one with open data.
All I want for this Christmass is an authoritative list of all US federal agencies. I kid you not, there is no such list and the number of federal agencies is uncertain. From Wikipedia:
>Legislative definitions of a federal agency are varied, and even contradictory, and the official United States Government Manual offers no definition. While the Administrative Procedure Act definition of "agency" applies to most executive branch agencies, Congress may define an agency however it chooses in enabling legislation, and subsequent litigation, often involving the Freedom of Information Act and the Government in the Sunshine Act. These further cloud attempts to enumerate a list of agencies.
"The Open, Public, Electronic, and Necessary Government Data Act (AKA the OPEN Government Data Act)"
Please, could we end this obsession with backronyms in congress. Perhaps some congressperson could create a suitably titled backronym act to rid us of backronym acts. This should just be the one singular "Government Data Act" and it should be amended as needed to cover any law changes around "Government Data".
I'm biased (I work on the BigQuery team) but I'm always excited to see more public datasets made available in BigQuery: https://cloud.google.com/bigquery/public-data/. It would be great to have government data available through a variety of cloud services with free exports.
Some personal favorites among BigQuery public datasets include NOAA GHCN[0], the Census Bureau's Zip Code Tabulation Area [1], and FEC Campaign Finance [2].
public information should be open by default to the public in a machine-readable format, where such publication doesn’t harm privacy or security
federal agencies should use evidence when they make public policy
The word “should” is used in both one and two. If my time in the government taught me anything, it’s that “should” is only slightly stronger than “may”. If an instruction says “shall”, then it is required to be done.
I've been working with the GPOs api. The engineer is quite responsive on github and the api is pretty snappy. Constantly asking for feedback and releasing new features. I think we're headed in the right direction. A shame my project isn't further a long for a shameless plug.
If you're interested in informing the public on legislation, have experience on the hill or UI/UX experience hit me up. I'm just some dude with an idea who lives in a terminal. Money is secondary.
Too bad thing related to "security" won't be included.
All I want for Christmas: accountability for the DoD budget, which for 2019 will be $717 Billion, the majority of the USG's discretionary spending.
In 2015, an audit of the budget revealed $125 Billion in wasteful spending, and this was covered up. In 2016, the Office of Inspector General for DoD said that the Army made 6.5 Trillion in wrongful adjustments to its 2015 accounting. https://en.wikipedia.org/wiki/United_States_Department_of_De...
We still do not know how many contractors the DOD employs, or how much money they are paid, because the numbers are just not recorded. We do know, though, that the numbers and budgets we do have are often inaccurately reported, according to the DOD OIG.
People whine a lot about paying taxes, but the politicians that always complain about taxes are extracting record amounts of tax money for a military that is mismanaged, doesn't do its accounting properly, can't build modern fighting vehicles, and doesn't record basic information like how many people they employ.
One has to be careful here. Lots of internal details may give trolls and conspiracy theorists fodder to generate controversy and fake news, often by taking things out of context. It may slow work-flow because workers will be hesitant to write anything without expensive pre-vetting.
FWIW, the author, Alex Howard (who I'm friends-via-Twitter with), is as familiar with this as anyone. He was previously a senior analyst at the Sunlight Foundation, which is a prominent open-government organization: https://sunlightfoundation.com/author/ahoward/
The submitted post includes a link [0] to an article he wrote a couple days back, which provides more context on "How did open government data get into the US Code?", including the nitty-gritty of how the original bill was proposed in the last session but ultimately left out of legislation. Howard writes that the legislation was "one of the primary legislative priorities for me during my years as a senior analyst and then deputy director at the Sunlight Foundation"
The Federal Reserve [board] bases its data on data from the Reserve Banks (think Federal Reserve of New York)- which are private and get data from banks chartered in the region (so the New York Fed is partially owned by JP Morgan). Requiring the Federal Reserve [board] to release underlying data could get sticky and be a legal minefield.
I’ll defend this practice. It’s the only way of knowing for sure that you’re transmitting exactly the information you intend to send. Even copy/paste often picks up other stuff you don’t intend.
As long as those spreadsheets/database files are accessible to someone with technical skill, people can pull in the data and use tools to make it more accessible and useful. Ideally, yes, the data is useful to begin with, but as long as it's available, there's nothing stopping individuals with the skills from making it useful.
Of course, there are exceptions: the PDFs that are often provided by the prosecution as part of the discovery process are prohibitively difficult to deal with, and should be considered a violation of Brady vs. Maryland, IMO.
In case you haven't seen it, I think https://www.data.gov/ is an attempt to answer your point about making it "accessible and useful to people." There's room for improvement, but it's a start.
> federal agencies should use evidence when they make public policy
the cynic in me just figures that this moves the goal post such that special interest groups will adapt to produce the right evidence for the desired outcomes
>Making public data open by default can arguable be an imporant step towards fostering societal equity
I think this was one of my biggest shocks doing work for the government, collecting public data, payed by tax funded grants. Public data isn't for the public.
We went into this project with all these starry eyed dreams of making a public online database and freely posting everything we collected, with maps and interactive tools, status reports. It was part of our grant proposal.
Then reality came and we found out public data meant a government password protected database with access fees where our data would be available to people willing to pay for it or we'd lose our funding. The data were for companies or individuals willing to pay the government not for the public.
This still doesn't sit well with me nearly 6 years later. That was never what we wanted out of that project and it wasn't what was planned or accepted when we wrote our proposal.
Is it? My understanding is that although the board is appointed by the President and Congress, once appointed they don't answer to the government, so their data isn't really "government data".
But I'll be the first to admit I know very little about how the Federal Reserve works.
It's worth noting that easy to access isn't always the best. A great example of this is mugshots, which are open data, but there are now websites that automatically scrape and index people's mugshots and use SEO to rank highly on searches. They then blackmail the person to have it removed.
As long as the data is accessible and reasonable easy to get, where journalists and data scientists who really care about the data can get it, then I think we're in a good place.
I was kind of concerned right off the bat with that numbered list:
1. public information should be open by default to the public in a machine-readable format, where such publication doesn’t harm privacy or security
I'm sure literally everything that they wish to keep opaque will declared to be covered under one or both of these incredibly vague categories and nothing will significantly change. Is there any elaboration in the bill that defines what they can call a matter of privacy or security? Even if there is, it wouldn't matter much because how are people going to tell if they keep it locked down in the first place? And they would not risk any sort of real blowback for abusing this and getting caught. Tell me there have not been far, far worse scandals that resulted in no consequences for the perps and cowed silence from the public. I don't think they're hiding the X-Files in there or anything, but this won't magically cause a more transparent, just, or equitable government unless it has serious teeth and tight language.
And 2. federal agencies should use evidence when they make public policy
Somehow I wonder if the data from the Kansas experiment will be taken into consideration and turned into public policy by this current administration, or if they will cherry-pick evidence selectively to justify only wildly unpopular legislation because someone (possibly an industry with a conflict of interest?) contrives some p-hacked research to back it up. Just because something is scientific doesn't necessarily mean it's good government. It is often so, but I'm always very wary when they trot out a bill with lots of bold language touting justice and democracy, truth, stuff like that. If the US legislature passes a bill called "protect innocent puppies from being kicked in the name of god and freedom" you can be 95% sure that this bill will enable a great wave of puppy-kicking despite its holy name.
Since every law is a trade-off, having both positive and negative effects, I wish each law would enumerate the expected/possible positive and negative effects. In other words, I want the trade-off to be explicit and for the lawmakers to express why they believe the positive effects outweigh the negatives ones.
It seems like the solution here would be to have a government agency that counts how many government agencies there are. I believe that could be done with an executive order but correct me if I'm wrong.
You can access the Reserve's information online as part of the Federal Reserve System's Open Government webpage implementing the Open Government Directive from 2009: https://www.federalreserve.gov/open/open.htm
Despite all that, the Federal Reserve is not a government institution, so such legislation would be similar to singling out your business to post all information in machine readable format online.
Shameless plug: does anybody know where I could get detailed spending and receipts data for US federal budget? Including all federal agencies? There is very limited data at https://www.whitehouse.gov/omb/supplemental-materials/ but I am looking for more detailed data.
CORE ALIGNMENT: Article is centered entirely on codifying the right to public information access. Explicitly states 'public information should be open by default to the public in a machine-readable format' and celebrates this as foundational law. Multiple sources quote support for 'open data' as principle and practice.
FW Ratio: 63%
Observable Facts
Central thesis: 'Congress has passed a law to make open data part of the US Code'
Explicit principle: 'public information should be open by default to the public in a machine-readable format, where such publication doesn't harm privacy or security'
Daniel Castro quote: 'The OPEN Government Data Act will ensure that the federal government releases valuable data sets'
Sarah Joy Hays quote: 'The bill sets a presumption that all government information should be open data by default: machine-readable and freely-reusable'
Site provides free public access to all content and allows public comments
Inferences
The legislation directly instantiates the UDHR Article 19 right to seek and receive information, elevating it to statutory law
The celebratory tone and multi-stakeholder support signal recognition of information access as a fundamental democratic right
Public-access site structure reinforces the principle that information should be freely accessible
Content frames open government data as essential to democratic participation and informed citizenship. States evidence-based policymaking enables citizens to understand 'federally-funded practices, policies and programs that deliver the best outcomes.' Transparent data enables voters to hold government accountable.
FW Ratio: 57%
Observable Facts
Article frames legislation as enabling 'informed democratic participation': 'Each federal agency has an evaluation officer, an evaluation policy and evidence-building plans'
Michele Jolin quote: 'We can maximize the impact of public investments' through evidence-building, implying citizen understanding
Gavin Baker quote: 'will enable libraries to provide businesses, researchers and students with valuable data' — multiple stakeholder classes
The legislation codifies 'presumption that all government information should be open' — supporting informed electoral/civic choice
Inferences
Data transparency is positioned as prerequisite to democratic accountability and participation
The emphasis on 'evidence-based' policy suggests citizens should be able to evaluate government performance
Public comment and sharing structures on site reinforce participatory engagement
Implicitly supports equal dignity by advocating for universal access to government information, enabling informed participation regardless of socioeconomic status
FW Ratio: 67%
Observable Facts
Article celebrates 'evidence-based policymaking' that serves 'the American people' broadly
Legislative language discussed applies to all citizens: 'all government information should be open data by default'
Inferences
Equal information access supports the principle that all persons possess inherent dignity and right to participate equally
Content implies that open government data contributes to a favorable social and institutional order. References 'historic moment' and government functioning 'more efficiently, more effectively' through transparency and evidence use
FW Ratio: 67%
Observable Facts
Article describes this as 'historic moment: Congress has passed a law to make open data part of the US Code'
Michele Jolin: 'toward a more efficient, more effective government that uses evidence and data to improve results'
Inferences
Transparency is framed as contributing to well-functioning democratic institutions
Content acknowledges privacy considerations as a boundary on open data; legislation includes carve-outs 'for data that does not concern monetary policy' and recognizes privacy/security limits
FW Ratio: 67%
Observable Facts
Article explicitly states legislation applies only 'where such publication doesn't harm privacy or security'
Text notes: 'carve out in Title I for data that does not concern monetary policy'
Inferences
The inclusion of privacy safeguards shows the legislation balances information access with privacy protection
Weak alignment: content mentions researchers and students as beneficiaries of open data, implying educational value, but education itself is not the focus
FW Ratio: 50%
Observable Facts
Gavin Baker quote mentions 'will enable libraries to provide businesses, researchers and students with valuable data'
Inferences
Educational institutions and researchers are identified as beneficiaries, suggesting data access supports learning
Site structure enables information access through public content, public comments, social sharing, and subscription mechanisms; no paywalls or information barriers
'historic win for open government in the United States of America'; celebration of American leadership and bipartisan achievement
bandwagon
Multiple celebratory tweets and quotes from advocates (Data Coalition, Results for America, open data community leaders) all expressing support and excitement
appeal to authority
Quotes attributed to government officials and recognized advocacy leaders (Michele Jolin, Daniel Castro, Sarah Joy Hays, named Congressional sponsors)