General Discussion
Related: Editorials & Other Articles, Issue Forums, Alliance Forums, Region ForumsWIRED: Yes, Donald Trump, the FBI Can Vet 650,000 Emails in Eight Days
https://www.wired.com/2016/11/yes-donald-trump-fbi-can-vet-650000-emails-eight-days/
and from the WSJ editorial page:
http://www.wsj.com/articles/the-political-mr-comey-1478476250
Response to kpete (Original post)
Name removed Message auto-removed
X_Digger
(18,585 posts)If you cataloged the IDs of the existing emails, then drop these into a db, this becomes a simple 'where unique id not in (select id from table)'.
backscatter712
(26,355 posts)First, screen out the ones that didn't involve Clinton or her staffers. Which involves searching on To, From, Cc, and Bcc headers. Probably eliminated 99% of them right there.
Next, screen out the duplicates of what has already been investigated. That probably screens out 99+% of what was left.
And the FBI also has software similar to what universities use to flag papers for plagiarism - searching for snippets of text from a database of classified material. They ran that too.
There was probably all of three emails left after screening the irrelevant crap out, and they probably had nothing but baby pictures or appointments.
And the Drumpfucks were all thinging that this was The One! The set of emails that This Time, would really ensure that their hated Hillary would finally get her turn on the chopping block!
Their tears of disappointment are sweet nectar to me.
yardwork
(61,588 posts)In other words, Comey probably already knew there was nothing there when he wrote his letter "reopening" the case.
This was a blatant attempt to influence the election, probably designed to protect Republican control of Congress.
ProfessorGAC
(64,960 posts)You and i are almost always on the same page, but that WSJ piece is a hit job on Clinton and the Obama administration. Yeah, it takes Comey to task too, but does an awfully lot smearing not at all directed at Comey.
TrogL
(32,822 posts)Take a file of the 650,000 emails and a file of the emails you already have off Clinton's server.
Extract the headers off the emails you have and take a checksum of the contents. If you've got the RAM, this might fit in an array, otherwise put it in a lookup file.
For each 650,000 emails, extract the header, take a checksum of the contents, then see if it matches something in the lookup table. If it does, write it to a "matches" file. Otherwise examine the header and see if it's government business, then write that to a "govt" file.
Then rummage through the government file looking for anything interesting.
With the duplicates gone, I'd expect a nearly empty file.