jonny saunders @json_dirs, Twitter Profile

jonny saunders @json_dirs

4 years ago

More fun publisher surveillance: Elsevier embeds a hash in the PDF metadata that is *unique for each time a PDF is downloaded*, this is a diff between metadata from two of the same paper. Combined with access timestamps, they can uniquely identify the source of any shared PDFs.

73 3K 7K 0 2K

Download Image

jonny saunders @json_dirs

4 years ago

You can see for yourself using exiftool. To remove all of the top-level metadata, you can use exiftool and qpdf: exiftool -all:all= <path.pdf> -o <output1.pdf> qpdf --linearize <output1.pdf> <output2.pdf> To remove *all* metadata, you can use dangerzone or mat2

11 121 819 0 144

Sebastian S. Cocioba🪄🌷 @ATinyGreenCell

4 years ago

@json_dirs @naturepoker1 You're doing the lord's work, Jonny.

1 0 15 0 0

Stef @[email protected] Christensen @Wikisteff

4 years ago

@json_dirs It's entertaining that the big E thinks that someone who can write sci hub won't be able to scrub metadata

0 1 7 0 0

Kristov Atlas @kristovatlas

4 years ago

@json_dirs What are the odds that people inserting nonces into PDF metadata are also inserting invisible image watermarks that survive conversion?

1 0 2 0 0

pbs @secretmeowth

3 years ago

@json_dirs @wydna00 PDFs are incredible

0 0 0 0 0

Michael Sobrepera @mjsobrep

4 years ago

@json_dirs There is a @zotero pdf cleaning plug-in begging to be made here.

2 0 56 0 2

Petar @pvtodorov

4 years ago

@json_dirs Lol I'm going to go through my Zotero library, extract all of these, and add then to my fat GDPR request for Elsevier

2 1 34 0 0

jon @jon_roelofs

4 years ago

@json_dirs I wonder how smart their DMCA takedown logic is. If you construct a new pdf with different content but that same hash, will their system still issue the paperwork?

3 2 24 0 0

Dodge This Security @shotgunner101

4 years ago

@json_dirs Reminds me of a writeup I seen once of how social media sites like Facebook track who downloaded and shared and image, who uploaded it, etc. There was a special identifier Facebook adds to the processing of the image to track each of those actions and link it all back.

1 2 20 0 1

David S Chang @dschan02

4 years ago

@json_dirs The academic publishing system is actively harming scientific and medical progress. It's pure evil. Prestige journals need to go, publish-or-perish needs to go.

0 2 14 0 0

Todd Carpenter tac_NISO at social.niso.org @TAC_NISO

4 years ago

@json_dirs This is a cross-industry system developed by @STMAssoc to address changes in EU Copyright law to actually facilitate sharing. It was discussed in a @scholarlykitchn post last May, which describes how it works and why. It is not for surveillance scholarlykitchen.sspnet.org/2021/05/17/stm…

3 0 16 0 4

Violeta Calleja Solanas @VioletaCalleja

4 years ago

@json_dirs Any idea if Elsevier checks that in the PDFs you upload to your Mendeley library?

1 1 11 0 1

John Muccigrosso [email protected] @jdmuccigrosso

4 years ago

@json_dirs OK, got a bug in my ear over this & wrote a little python script that doesn't require linearizing & maybe increasing the file size as it removes the metadata. Option to keep author/title. Also updated the applescript droplet to use this instead of pdf. github.com/Jmuccigr/scrip…

1 1 11 0 7