#internetarchive

frenchhope@diaspora-fr.org
odhage@diasp.nl

Internet Archive
IA Files Notice of Appeal in Publishers’ Lawsuit Against Libraries

Earlier this month, the Internet Archive submitted its notice of appeal in the case of Hachette v. Internet Archive. In March, when the decision was first handed down, we expressed our belief that the lower court had made factual and legal errors—a blow to all libraries and the communities we serve.

Through the Controlled Digital Lending (CDL) program, the Internet Archive and other libraries make and lend out digital scans of print books in our collections, subject to strict technical controls. Each book loaned via CDL has already been bought and paid for, so authors and publishers have already been fully compensated for those books. Nonetheless, publishers Hachette, HarperCollins, Wiley, and Penguin Random House sued the Archive in 2020, claiming incorrectly that CDL violates their copyrights. Our Controlled Digital Lending (CDL) program preserves traditional library lending in the digital world, ensuring that communities can utilize critical resources, that cultural artifacts can be shared, and that students everywhere can access books. We’re committed to defending the rights of libraries to serve their patrons online.

Thus our fight continues. We understand that this will be a demanding process, but it is essential to combat this attack to safeguard the existence of library collections in the digital era. Brewster Kahle, the Founder and Digital Librarian of the Internet Archive, conveyed a statement regarding the appeal in a blog post on our website.
#InternetArchive
#freedomfight
#library

aktionfsa@diasp.eu

22.08.2023 Musikindustrie klagt gegen Internet Archive

Copyright auch auf mehr als 70 Jahre alte Musik

Im Internet Archive lassen sich seit den 1990er-Jahren ältere Versionen von Websites aufrufen. Damit leistet das Archiv einen wichtigen Dienst für die Forschung aber auch für Journalisten, die sich für Veränderungen im Netz interessieren. Mit ihrer Arbeit stößt das Archiv aber auch auf Gegenwehr. Erst kürzlich gab es eine Einigung mit 4 Großverlagen, die gegen den Verleih von digitalen Büchern in der Corona-Zeit vorgegangen waren.

Nun klagen mehrere große Plattenlabels, der Vorwurf lautet: Das Online-Archiv hätte sich mit der Veröffentlichung der Aufnahmen alter Schellakplatten massiver Verletzungen des Urheberrechts schuldig gemacht. Während bei den Musikverlagen ähnliche Aufnahmen digital nachbearbeitet wurden, hat das Archiv sie mit allem Rauschen, Knistern und Knacken veröffentlicht.

Die Musikverlage sehen sich bei 3.000 Musiktiteln von derzeit mehr als 400.000 Aufnahmen verletzt und erklären einen erlittenen Schaden von bis zu 412 Millionen US-Dollar. Die Klage wegen mehr als 70 Jahre alter Musiktitel ist nur möglich geworden, nachdem mit dem „Music Modernization Act“ die Laufzeit des urheberrechtlichen Schutzes für Aufnahmen verlängert worden war. Über ein ähnliches Vorgehen auf Drängen des Disney Verlags bei Büchern hatten wir schon vor einiger Zeit berichten müssen (Copyright-Verlängerung auf 144 Jahre in den USA ).

Vergleicht man die Summe von 412 Millionen Dollar mit den Zahlen der Aufrufe der beanstandeten Titel von meist unter 10 pro Monat, so sieht man, dass die Aufnahmen fast ausschließlich zu Forschungszwecken genutzt werden, während Musikliebhaber bei den qualitativ bearbeiteten Kopien der Musikindistrie fünfig werden. Der von der Industrie vorgegeben Schaden wird auch in 1000 Jahren nicht eintreten.

Mehr dazu bei https://netzpolitik.org/2023/wegen-100-jahre-alter-schallplatten-musikindustrie-verklagt-internet-archive/
Kategorie[21]: Unsere Themen in der Presse Short-Link dieser Seite: a-fsa.de/d/3vN
Link zu dieser Seite: https://www.aktion-freiheitstattangst.org/de/articles/8499-20230822-musikindustrie-klagt-gegen-internet-archive.htm
Link im Tor-Netzwerk: http://a6pdp5vmmw4zm5tifrc3qo2pyz7mvnk4zzimpesnckvzinubzmioddad.onion/de/articles/8499-20230822-musikindustrie-klagt-gegen-internet-archive.html
Tags: #InternetArchive #Copyright #Klage #Musikindistrie #Buchverlage #Informationsfreiheit #Meinungsfreiheit #Pressefreiheit #Internetsperren #Netzneutralität #OpenSource #ACTA #TTIP #IPRED #SOPA #CETA #Urheberrecht

beaubobobonobo@diaspora.psyco.fr

The Internet Archive kept many books and media lost to time alive for us. So many pieces of lost media, including vintage films, books someone uploaded from their personal collection, political news that got scrubbed off the world, to name a few.

But alas, this is the gardening company deciding to burn down the seed vaults because nobody will buy their packaged seeds.

By the way, if they believed they could get away with it, they would definitely try to outlaw physical libraries as well.

#InternetArchive #bookpublishers #CopyrightInfringement #DigitalLending #ebooks #libraries
(https://arstechnica.com/tech-policy/2023/03/publishers-beat-internet-archive-as-judge-rules-e-book-lending-violates-copyright/)

dredmorbius@diaspora.glasswings.com

Internet Archive's Wayback Machine APIs

Say, for the sake of argument, that you've^†^ spent a few weeks trying to ensure that a set of URLs were archived at the Internet Archive's Wayback Machine, and you're aware that not all of those URLs were in fact archived, but you^†^ aren't sure just which ones were or were not.

The question might occur to you^†^, "Is there some way of testing whether or not a particular URL has or has not been successfully archived? Preferably in an automated manner?"

And the answer to that question would be YES!!! Yes there is!!!

What you^†^ are looking for are the Internet Archive Wayback Machine APIs, and specifically:

Wayback Availability JSON API

This simple API for Wayback is a test to see if a given url is archived and currenlty accessible in the Wayback Machine. This API is useful for providing a 404 or other error handler which checks Wayback to see if it has an archived copy ready to display.

Quoting from the Archive:


The API can be used as follows:

http://archive.org/wayback/available?url=example.com

which might return:

{
    "archived_snapshots": {
        "closest": {
            "available": true,
            "url": "http://web.archive.org/web/20130919044612/http://example.com/",
            "timestamp": "20130919044612",
            "status": "200"
        }
    }
}

if the url is available. When available, the url is the link to the archived snapshot in the Wayback Machine At this time, archived_snapshots just returns a single closest snapshot, but additional snapshots may be added in the future.

If the url is not available (not archived or currently not accessible), the response will be:

{"archived_snapshots":{}}

https://archive.org/help/wayback_api.php

It's also possible to query for a specific timestamp, though not AFAICT for saves within a date range.

You^†^ are now running that check on a set of 1300 or so URLs you'd^†^ hoped to have saved in the past two months or so.


See Also: Data Migration Tips and Questions.


Notes:

†: And my "you" I of course me "me".


#DataMigration #InternetArchive #APIs #Joindiaspora #JoindiasporaCom #Pluspora #Archival

dredmorbius@diaspora.glasswings.com

Multiple failures of archive requests of my Joindiaspora content to Internet Archive

I'm seeing failures in archiving my posts to the Internet Archive from Joindiaspora

I'm using generated indices of my Joindiaspora posts to ensure that those posts are federated to Glasswings, as well as archived at the Internet Archive (which tends to be generally discoverable) and Archive.Today.

I'm submitting those requests manually for now.

I've seen two successive instances of failures to archive what are public posts, with the Internet Archive apparently unable to find the content by the URL.

IA's response when using the https://web.archive.org/save/<URL> submission is:

Not Found
The requested URL was not found on the server. If you entered the URL manually please check your spelling and try again.

See screenshot.

The posts in question:

(This has just begun occurring, there may be more to come, I'm keeping tabs on failures as they occur.)

Archival at Archive.Today has been succeeding, see:

Whether this represents an issue on the part of Archive.org, or actiions by Joindiaspora's administrator(s), who've ... been increasingly personally hostile to me in recent weeks, is not clear.

Again, this is my own content, and manual requests.

#Joindiaspora #InternetArchive #WaybackMachine

dredmorbius@diaspora.glasswings.com

Observation on archival sites: Archive.Today vs. Internet Archive

Some of my followers may have noted I've been archiving a number of older posts from my previous account of late....

In doing this, I've noticed a few things about Archive.Today (a/k/a Archive.Is) and the Internet Archive's Wayback Machine.

It turns out that Archive.Today is really convenient to invoke with DDG set as my default search engine as I simply highlight the navigation bar for a page, prepend the "!ais" bang search to the head of the URL (followed by a space) and hit return.

Archive.Today helpfully offers links for other potential archive sites, including the Internet Archive, so I don't have to independently call up that URL.

Archive.Today responds very quickly. There's a practically instant response that the page is or is not archived, and if not, the "save" form also pops up nearly instantly.

By contrast, the Internet Archive takes a few seconds to respond whether or not the page is archived, and a few further seconds when requesting a page be saved.

(Both sites have a two-stage submission. The Internet Archive does have a submission URL which should work in one fell swoop, though it occasionally breaks and error-detection is ... difficult.)

Archive.Today's processing queue ranges from 0 to 10k or so slots.

The Internet Archive is currently reporting ~10 hours to process archival requests.

AT does include comments on Diaspora* posts. IA does not.

My manual workflow has evolved to:

  • Pull up page, reload in Diaspora* (otherwise cookies may not be current, forcing a log-out / log-in cycle, also annoying).
  • Mark the post "tagged" to indicate it's been archived. I typically also "like" it to set a sharper visual indicator.
  • Prepend '!ais ' to the navigation bar and hit <enter>.
  • Open "Search in Internet Archive" in a new tab, then select that tab to get IA working on finding the post.
  • Switch back to the Archive.Today tab and select save, then confirm. At that point the request is processing.
  • Switch back to the Internet Archive tab, wait for the page to fully load, request archive, wait for that page to load, confirm, and wait for the request to return.
  • Even after this stage, the IA request may still fail. Detecting this is ... difficult.

I may also save content from the original (JoindiasporaCom) address, though mostly I'm working through Glasswings. I have run an automated submission of all my posts from the take-out JSON archive, and will run that another time or so before final shutdown. That will at least preserve post content online, but not the comments threads :-(

Hopefully this information may be useful to others.

#Archival #WebArchival #ArchiveIs #ArchiveToday #InternetArchive #WaybackMachine