Investigative Data Warehouse

Investigative Data Warehouse (IDW) is a searchable database operated by the FBI. It was created in 2004. Much of the nature and scope of the database is classified. The database is a centralization of multiple federal and state databases, including criminal records from various law enforcement agencies, the U.S. Department of the Treasury's Financial Crimes Enforcement Network (FinCEN), and public records databases. According to Michael Morehart's testimony before the House Committee on Financial Services in 2006, the "IDW is a centralized, web-enabled, closed system repository for intelligence and investigative data. This system, maintained by the FBI, allows appropriately trained and authorized personnel throughout the country to query for information of relevance to investigative and intelligence matters."[1]

Overview

The size of the database appears to be growing rapidly. In 2004, according to a government solicitation for bids to manage the project, it was approximately 10TB in size. In 2005, according to one FBI official, the IDW contained approximately 100 million documents. In 2006 it contained more than 560 million documents and was accessible by more than 12,000 individuals. According to the FBI's website, as of August 22, 2007, the database contained 700 million records from 53 databases and was accessible by 13,000 individuals around the world.

As of 2007, the FBI is the subject of a lawsuit brought by the EFF (Electronic Frontier Foundation) because of a lack of public notice describing the database and the criteria for including personal information, as required by the Privacy Act of 1974. The lawsuits are a result of two Freedom of Information Act requests filed by the EFF in 2006.

It was built in part by Chiliad corporation,[2][3] the FBI Office of the Chief Technology Officer,[4] and others. Companies listed on the FOIA files include Northrup Grumman [5] and others.

Purpose

Investigative Data Warehouse–Secret (IDW-S) "provides data and data processing/analysis services to FBI agents and analysts as they perform counter-terrorism, counter-intelligence, and law enforcement missions". The core subsystem supports the Counter-Terrorism Division (CTD), the Special Event Unit, and via DOCLAB-S, the Joint Intelligence Committee Investigation (JICI) and IntelPlus.[6]

According to a 2005 email, "IDW will also be used for criminal and other authorized non-CT investigations as it evolves." (CT being counter terrorism) [7]

Subsystems


Within the system, there were subsystems named IDW-S Core, SPT, and DOCLAB-S[8]

The special projects team (SPT):

allows for the rapid import of new specialized data sources. These data sources are not made available to the general IDW users but instead are provided to a small group of users who have a demonstrated "need-to-know". The SPT System is similar in function to the IDW-S system, with the main difference is a different set of data sources. The SPT System allows its users to access not only the standard IDW Data Store but the specialized SPT Data Store.[9]

Privacy

According to internal emails, the FBI performed several Privacy Impact Assessments (PIAs) of the IDW system. They worked with lawyers from their National Security Law Branch (NSLB) to attempt to make sure their system was complying with various laws regarding sharing of information and secrecy [10] (for example, rule 6e of the Federal Rules of Criminal Procedure, regarding the secrecy of Grand Jury material [11]).

The Information Sharing Policy Group (ISPG) formed a Discretionary Access Control Team (DACT), to work on "approval of data sets" and "access control requirements" for IDW and DataMart, and responding to other Intelligence Community agencies requesting access.[12]

The EFF FOIA IDW website states "Despite the vast amount of personal information contained in the IDW, the FBI has never published a Privacy Act notice describing the system or explaining the ways in which the records might be used." [13]

There was also a 2005 email from someone on the Office of General Council (OGC) about "preliminary staff musings that maybe we should limit FBI PIA requirements to non-NS systems" (NS being National Security).[14] There was also an email from 2006 saying that 'national security systems are exempt from E-Gov',[15] apparently referring to the E-Government Act of 2002, which has a section that deals with privacy.

Data sources

The IDW used many data sources. The FOIA documents from EFF are heavily redacted, but some of the sources are as follows:

  • FBI Automated Case Support system (ACS), subset of the Electronic Case File (ECF) system[16]
  • Joint Intelligence Committee Investigation documents (JICI),[17] with OCR text [18]
  • "Open Source News" (public websites, such as the Washington Post and others)[19]
  • Secure Automated Messaging Network (SAMNet)[17]
  • Violent Gang and Terrorist Organizing File (VGTOF)
  • DARPA TIDES program ('open source news' that has been organized and collected)
  • IntelPlus Filerooms, with OCR text[18]
  • FBI National Crime Information Center (NCIC)[16]
  • FBI Records Management Division (RMD), Document Laboratory (DocLab), FBIHQ
  • MiTAP [20] (collects data from public sources, websites, etc.)
  • SPT-Specific data sources (partial list, FOIA files have large parts redacted):
    • Unified Name Index (UNI) extracts
    • Financial Center (FinCen),[21] including Bank Secrecy Act data [22]
    • "Various Sources", including the Transportation Security Administration[23]
    • FBI Counterterrorism Division (CTD)
    • Telephone numbers / addresses from ACS
    • Case data from ACS
    • Terrorist Watch List (TWL)[24]
    • "Other NJTTF data"
    • DoS ... Lost/Stolen Passport data
    • No Fly List, from TSA
    • Selectee list, from TSA
    • ACS/ECF with some case types excluded
    • CIA non-TS/non-SCI Technical Discussions (TDs) and Intelligence Information Reports (IIRs) from 1978 to the May 2004[25]

There was also talk of linking the FTTTF "Data Mart" with IDW.[26]

The data in IDW is classified at the 'Secret' level or lower. Higher classifications are not allowed, and can be removed[27]

gollark: No, that seems to just *naturally* have no users
gollark: Initial CUDA support (it is apparently maybe 10% faster on nvidia stuff, but generally the same) and nobody ever bothered to change it because all the researchers just bought from nvidia? That seems kind of implausible.
gollark: Which does make me wonder why machine learning tools aren't written against it.
gollark: Yes. This is vendor lockin. OpenCL works basically fine.
gollark: The new iGPUs are several times more powerful than my ~4 year old one apparently.

See also

References

Sources consulted
  • EFF. "FOIA: DOJ's Investigative Data Warehouse". Retrieved 2009-03-18.
  • EFF (October 17, 2006). "EFF Sues for Information on Huge FBI Database of Personal Information: 'Investigative Data Warehouse' Includes Hundreds of Millions of Entries". Retrieved 2009-03-17.
  • FBI (various dates). "EFF Freedom of Information Act (FOIA) files, 2008 April 8, idw02" (PDF). Electronic Frontier Foundation. Retrieved 2009-03-17. Check date values in: |date= (help) (Contains various emails from inside the FBI regarding the IDW)
  • FBI (various dates). "EFF Freedom of Information Act (FOIA) Files, 2008 Apr 8, idw01" (PDF). Electronic Frontier Foundation. Retrieved 2009-03-18. Check date values in: |date= (help) (Contains various emails from inside the FBI regarding the IDW)
  • FBI (various dates). "EFF Freedom of Information Act (FOIA) files, 2008 June 9, idw04" (PDF). Electronic Frontier Foundation. Retrieved 2009-03-18. Check date values in: |date= (help) (Contains various emails from inside the FBI regarding the IDW)
  • FBI (Sep 6, 2006). "By the Numbers: FBI Transformation Since 2001". Retrieved 2009-03-17.
  • FBI Information Resources Division (IRD) (2003-12-03). "Investigative Data Warehouse-SECRET (IDW-S), System Security Plan" (PDF). Electronic Frontier Foundation. p. 58. Retrieved 2009-03-17.
  • FBI Information Resources Division, Data and Information Management Section (2005-01-24). "Investigative Data Warehouse - Secret (IDW-S), System Security Plan" (PDF). Electronic Frontier Foundation. p. 13. Retrieved 2009-03-18.
  • FBI Information Resources Division, Data Management Section (2004-12-01). "Investigative Data Warehouse, Privileged Users Guide" (PDF). Electronic Frontier Foundation. p. 9. Retrieved 2009-03-18.
  • FBI Office of the Program Management Executive (2004-11-29). "Security Concept of Operations (S-CONOPS), Investigative Data Warehouse (IDW) Program" (PDF). Electronic Frontier Foundation. p. 50.
  • Gross, Grant (17 October 2006). "EFF files lawsuit to gain information on FBI database". Network World (reprinting IDG News Service story). Archived from the original on 24 January 2007. Retrieved 3 October 2007.
  • Morehart, Michael F.A. (May 26, 2005). "Statement of Michael F.A. Morehart, Section Chief, Terrorist Financing Operations Section, Counterterrorism Division, Federal Bureau of Investigation, Before the House Committee on Financial Services". Retrieved 2009-03-17.
  • Nakashima, Ellen (30 August 2006). "FBI Shows Off Counterterrorism Database". The Washington Post.
Endnotes
  1. Morehart 2005, op. cit.
  2. "Chiliad Case Study" (PDF). Archived from the original (PDF) on 2012-05-24. Retrieved 2009-03-18.
  3. David Gardner (2006-08-30). "FBI Shows off Counterterrorism Database". Information Week. Retrieved 2009-03-18.
  4. EFF FOIA Files, 2008 Apr 8, idw01, page 28 of linked pdf
  5. EFF FOIA files, 2008 Apr 8 idw01, page 27 of linked pdf
  6. FBI, IDW-S System Security Plan, 2005 Jan 24
  7. EFF FOIA files, 2008 Apr 8 idw02, pg 13 of linked PDF
  8. FBI, IDW-S System Security Plan, 2005 Jan 24. It is unclear from the FOIA documents the difference between IDW-S and IDW, and thus whether Core SPT and DOCLAB-S are under IDW, or IDW-S.
  9. FBI, S-CONOPS IDW, 2004 Nov 29 page 52 of linked pdf
  10. EFF FOIA Files, 2008 April 8 idw02. Most of this FOIA release is emails within the FBI concerning PIAs
  11. EFF FOIA Files, 2008 April 8 idw02, page 73 of linked pdf. For Rule 6e, see https://www.law.cornell.edu/rules/frcrmp/Rule6.htm Cornell
  12. EFF FOIA Files, 2008 April 8 idw02 pg 74, 75 of linked pdf
  13. EFF website, FOIA: DOJ's Investigative Data Warehouse
  14. EFF FOIA Files, 2008 April 8 idw02, page 10 of linked pdf. This particular email also mentions the VCF system (which was later scrapped), saying that PIAs for VCF could 'entail substantial costs'
  15. EFF FOIA Files, 2008 Jun 9 idw04, page 35 of linked pdf
  16. FBI, IDW Privileged Users Guide, 2004 Dec 1
  17. FBI, IDW-S System Security Plan, 2003 Dec 3
  18. FBI IDW Status Update, 2005 Sep 21
  19. FBI IDW Status Update, 2005 Sep 21. 'Open Source News' is, in other documents, referred to alongside MiTAP and/or DARPA TIDES.
  20. Note: Some FBI documents list DARPA TIDES, some list MiTAP, some simply say "Open Source News". They are related projects, if not perhaps the same thing.
  21. Financial Crimes Enforcement Network
  22. EFF FOIA files 2008 Apr 8 idw02, pg 8/9 of linked pdf
  23. FBI S-CONOPS IDW 2004 Nov 29 page 53 of linked pdf
  24. EFF FOIA Files, 2008 Apr 8, idw02 page 83 of linked PDF
  25. EFF FOIA Files, 2008 Apr 8, idw01, page 33 of linked pdf
  26. EFF FOIA Files, 2008 Apr 8, idw02. Page 37 of linked pdf
  27. EFF FOIA files, 2008 Apr 2, idw01, page 43
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.