David Rosenthal discusses the last 25 years of digital preservation efforts in regards to academic journals. It's a long-standing problem and discontinued journals continue to disappear from the Internet. Paper, microfilm, and microfiche are slow to degrade and are decentralized and distributed. Digital media are quick to disappear and the digital publications are usually only in a single physical place leading to single point of failure. It takes continuous, unbroken effort and money to keep digital publications accessible even if only one person or institution wishes to retain acccess. He goes into the last few decades of academic publishing and how we got here and then brings up 4 points abuot preservation, especially in regards to Open Access publishing.
Lesson 1: libraries won't pay enough to preserve even subscription content, let alone open-access content.
[...] Lesson 2: No-one, not even librarians, knows where most of the at-risk open-access journals are.
[...] Lesson 3: The production preservation pipeline must be completely automated.
[...] Lesson 4: Don't make the best be the enemy of the good. I.e. get as much as possible with the available funds, don't expect to get everything.
He posits that focus should be on the preservation of the individual articles, not the journals as units.
Previously:
(2020) Internet Archive Files Answer and Affirmative Defenses to Publisher Copyright Infringement Lawsuit
(2018) Vint Cerf: Internet is Losing its Memory
(2014) The Importance of Information Preservation
(Score: 2) by canopic jug on Monday September 21 2020, @04:12AM (2 children)
On top of that, PDF is a scripting language not a presentation or layout language. We'd be much better off standardizing on TeX.
Money is not free speech. Elections should not be auctions.
(Score: 0) by Anonymous Coward on Monday September 21 2020, @06:34AM (1 child)
Isn't TeX still scripting since it's Turing complete?
(Score: 0) by Anonymous Coward on Monday September 21 2020, @09:16AM
Postscript (an interpreted programming language) is also Turing Complete,
https://en.wikipedia.org/wiki/PostScript#The_language [wikipedia.org]
...but not intended to be human-written. Originally files were all ascii, but later raster images were allowed too.
Fun tip -- if you have .ps source files, they can be displayed directly by lightweight ebook reader SumatraPDF.