• Questions and discussion related to the archive belong here
  • Threads can be posted without images
  • Keep it legal and safe for work
  • No off-topic threads pls

Images have different hashes on 4chan vs archive

No.1313 ViewReplyReportDelete
For whatever reason, all of the images on the archive now seem to have a slightly different filesize and a slightly different image hash from what the actual images on 4chan do/did

You can download an image off of a current /v/ thread (say this the OP image here: https://boards.4channel.org/v/thread/640320202 / https://is2.4chan.org/v/1686513022851627.png / hash: ceb6T3DlA6WH9lxxp6ImLA) and it will have a different filesize, a different hash (and as a result no results will show up here on arch.b4k.co if you search by hash) vs the same image arch.b4k.co: (https://arch.b4k.co/v/thread/640320202/ / https://arch-img.b4k.co/v/1686513022851.png / hash: 95IXBfWOf-YJavsGwZnkew)

This even seems to be retroactive: I can't search a file I have on my computer i've posted to /v/ in the past via the file hash search now, because now the hashes of those images is different from what I have saved locally, even though the file should be the same and I could get results via searching that file by hash before (and if I search by filename instead, it works fine)

tl;dr:

arch.b4k.co seems to have changed the all of the images in the archives which now breaks searching by hash using the original 4chan uploads.

What's going on here? Can this be reversed? Do images on the archive have extra lossy compression now?
2 posts omitted

March 19 - April 05 archives disappeared?

No.1685 ViewReplyReportDelete
It seems like archives between this time period have completely vanished
1 post omitted

No.1680 ViewReplyReportDelete
Is it possible to read threads with no deleted posts?
God damn, I'm sick of that gore and scat poster in the Fire Emblem threads.

No.1670 ViewReplyReportDelete
So it looks like every once in awhile, they change the way images are stripped/optimized, and that means a whole new set of MD5s to worry about.
It'll be, you saved an image awhile back, drag it to the MD5 window to search for it, and it'll only show you old results, or sometimes no results at all.
The only way to get the new MD5 would be to post it and re-save it, which is stupid.
This really messes with image search, and there ought to be a way to deal with this.

No.1662 ViewReplyReportDelete
Is searching dead or is it coming back at a later time?
3 posts omitted

No.1675 ViewReplyReportDelete
is it possible to search for part of a longer string?
eg. Hello in HelloWorld / HelloX

No.1567 ViewReplyReportDelete
Is there going to be an uptick in storage costs because of the mass of AI pics that now get posted on a regular basis?
5 posts omitted

No.1607 ViewReplyReportDelete
Is there a way to search for a post containing exclusively the word(s) youre searching for.

No.1628 ViewReplyReportDelete
Sorry if it's a dumb question, I don't 100% know how archiving works, but is there any way the /vg/ archive could be updated to include all the posts from archived.moe? That one has basically every single post all the way up to the first one archived (rather than only up to mid-2019 like this one), but searching is disabled. Is it possible to include all the posts from that in this one and make them searchable? Would that take up way too much storage?

Did the images just get purged?

No.1611 ViewReplyReportDelete
I'm collecting images from dalle threads to archive them and intend to upload them after theyve been sorted properly for a public archive at some point.

It seems now that all images are now 404, many threads that Ive archived are now reporting every image is 404.

Did they get purged or is something else happening?

Example thread from nov , everything 404

https://arch.b4k.co/v/thread/659083396

But very recent threads, such as this from mid-december on seem ok

https://arch.b4k.co/v/thread/660722446

No.1617 ViewReplyReportDelete
Can you search exclusively for posts with the name field filled (i.e. any name EXCEPT FOR Anonymous)?

No.1604 ViewReplyReportDelete
i haven't checked this archive in ages, what happened to everything before 2019?
trying to find some old /vp/ posts

No.1562 ViewReplyReportDelete
There's been a recurring bug where using certain parameters, such as wildcard asterisks (*) or pipes (|) while searching can result in the search to load indefinitely. What gives?

No.1583 ViewReplyReportDelete
Searches on /v/ archive isn't working

Error! The search backend returned an error.
>Error! The search backend returned an error.