Request for uploaders

pentel

A file name "xxxxxx_SaNet.st" will likely get deleted faster than "SaNet.st_xxxxxx.rar" depending on how search is performed.

For example the Sonic movie, those files in this format "SaNet.st_xxxxxx.rar" is still available while the Sonic movie files in this format "xxxxx" are deleted (other sites that has the Sonic movie).

pentel

SamuRa1

Files in Sanet don't get deleted faster than other sites is because the files are prefix with something else other than the actual title. It can still get discovered and deleted depending on the search pattern but it just not make it easy to find those files. To illustrate, you will find the Sonic movie files in Sanet are still not deleted but if you go to other sites where the Sonic movie name has not been altered or prefix with something else is deleted. Sanet is now probably the best site to get hold of the Sonic 2 movie.

speedfiend

pentel Thank you!

speedfiend
Anyone worried about the bots sent out to remove certain files from the internet? I've noticed a lot of slight misspellings of certain filenames to thwart them and adding "sanet" to the front might accomplish the same thing.

habiru

pentel That is a damned good point. heh...

tecnico82

so many good suggestions in this topic!
Just don't forget that cataloging the product of human knowledge is like living in the event horizon...that is, starting with a simple query like this "[Login to see the link]" you will soon find yourself unable to remember why you started cataloging in the first place. And while this is happening, the information continues to grow and the time that you are living(marked by the pace of your searches) is already passed. And there is the black hole that is so much talked about.

MIZANing

tecnico82
A limitless task, and life is short, sometimes I feel like I am lost in a maze

EbeneZ

MediaFanatic Yes, eBook-Tools.

Would have been easier if you had given the name of the tool in your previous post 😃

I will try your solution with calibre on a spare machine with a subset of books but the problem i see with the workflow you suggest is that i have first to let calibre ingest (and copy) all my books then delete all the unwanted and that's not a trivial task considered the size and struct of my library

I tried your tips for Windows explorer but the title column is empty for all the books i have downloaded this week so no use for me but cool tips anyway. For TC, it's enough for me and the interface is light, i found D opus overweight since the Amiga version, will try XYplorer (thanks) i have also a few others like multi commander and one with 4 views but the name escapes me

Back to my research i found that in fact ebook-meta (from calibre) may be enough but slow ; the ideal would be an isbn database in mysql 😀

Thank you for your patience

umbecono

EbeneZ Back to my research i found that in fact ebook-meta (from calibre) may be enough

[Login to see the link] adds metadata to the file content. You supply the metadata; ebook-meta doesn't look it up.
[Login to see the link] retrieves metadata from the web. It doesn't add metadata to the file content or the filename. nb: ebook-tools uses this to retrieve metadata.

the ideal would be an isbn database in mysql

[Login to see the link]
[Login to see the link]

I tried your tips for Windows explorer but the title column is empty for all the books i have downloaded this week

Not surprising. That column is data that was passed to the Windows shell by some provider program and saved by the shell itself. Windows doesn't do it natively, not for most file types, and ecumenical Calibre doesn't waste energy pandering to this or that particular shell, so if you want that column you'll probably have to write a shell extension yourself.

Uthred

I installed Advanced Renamer and it does really well the job. Thank you to remember me to install this great utility.

MediaFanatic

habiru ... Except it's not accurate. It's a logical assumption, it's just not true in this instance.

Obviously the discussion of DMCA avoidance cannot be had on a public forum that is Google-indexed; however, the short version is that it is a very time-consuming and costly process that is far more nuanced and complex than adding words to the filename.

Back on topic...

Innovative Renaming Options:

If we're to return to the topic of renaming (moving on from calibre, which I've covered in-depth) -- there are some alternatives that I feel are easier for beginners that would like to do "multi-step" renaming (eg: replace space-alt characters, proper-case title, remove sanet/other terms, etc -- as a single process).

One of those is is "Master Renamer" and it has one of the most powerful, yet extremely easy renaming capabilities:

There are also batch-preset "drop" tools that do similar and I suspect those would be easier/quicker for many SANET-style users. One of the most popular is a tool known as "dropit":

This "dropit" tool is similar to Den4B mentioned above. However, like most of these "file automation" tools, the rules can be more robust.

For example, move any files with video-extensions to a "videos" directory; move any files with PDF/ePub extensions to a "eBooks" directory. Plus all the renaming workflows/presets you like. Perhaps add decompression as another step.

In short, these "dropit" style tools offer more features and sit perpetually ready to fix-up your downloads instantly.

Finally .. if you want a free/easy renamer on Windows -- MS makes a very nice product that very few people know about. It's part of the MS "PowerToys". This is not the most powerful tool; however, it is one of the most responsive and quick to use:

...

From memory -- I believe all of the above solutions are free.

thebuzzard

I downloaded 4 books today without noticing they were all named "sanet.st," not "sanet.st,pdf" or "sanet.st.epub," which would have allowed me, on a Mac, to hit the <spacebar> to see an actual preview of the book cover in order to rename it. Instead, the "preview" of "sanet.st" is a blank white box. It was necessary to trial-by-error relabel them as .pdf or .epub. Now I see I have a download of a file 1.5 gb file - suggesting it is not a book - simply named "sanet st" and another file named "nsanet.st" Had I taken the suggestion to rename these files through scripting JDownloader as I downloaded, I would have files named ".st" Now, I could be wrong, but I can't imagine that any of this was the intention of this thread.

I would note, btw, that macOS has a great renaming app called "Renamer.app" which is extremely comprehensive and detailed, and after quickly previewing a file which is simply an ISBN+extension by the macOS, renaming it as I wish with Renamer.app.

umbecono

thebuzzard It was necessary to trial-by-error relabel them as .pdf or .epub.

I don't think so. Try running [Login to see the link] from a command prompt:
file sanet.st
(or whatever you renamed it).

Had I taken the suggestion to rename these files through scripting JDownloader as I downloaded, I would have files named ".st"

Are you sure? The regex matches ()()(sanet)(.s) and (n)()(sanet)(.s), respectively. Wouldn't you have files named sanet and nsanet?

macOS has a great renaming app called "Renamer.app" which is extremely comprehensive and detailed

Ah, if only it could have given you the file type too. (macOS uses filename extensions for this? Who knew?)

thebuzzard

umbecono Honestly, this is a non-issue to me as the the Renamer.app can batch remove anything by name, position in the file name (e.g. starting at the 6th character, remove 7 characters - sanet.st - without naming them), dates, ISBN, etc. As I mentioned, of the 8 or 9 files I downloaded, 5 files happened to be named the same. One large 2-part file was a music app that I renamed by adding ".part1.rar" & ".part2.rar." Unfortunately, the combined .dmg (which I found by using a hex-reader) was corrupted, even after downloading & renaming it twice. The books were either pdf or epub. I also frequently rename a file (e.g. when the file name is an ISBN) by cutting & pasting the name of the book from here to JDownloader before downloading. And so it goes...

MediaFanatic

.
(1) Windows Explorer & Metadata (eBook Title, etc)

Several years back, MS rewrote file-explorer (Vista timeframe). At that point they allowed new property-handlers where any proprietary format's metadata could be exposed.

At the same time, they began including a fair number of standardized file-format property handlers, for certain types of audio/video, images, etc. Around this same time they wrote the search-index, creating "ifilters" that allow access to a proprietary file's full content.

In the case of ePub's, they're just .ZIP files with embedded HTML, essentially. Therefore, I had assumed that ePub's were being handled intrinsically within Windows; however, it appears that in my case, SumatraPDF is handling my ePub metadata. Installing SumatraPDF should address ePubs.

Adobe and PDF-XChange, two of the more capable PDF tools, will install property handlers for your PDF's

At this point, you should be able to see more eBook metadata (Titles in this case) directly in any modern File Explorer, or in the case of TC, you may need that shell-tools addin, I'm not certain.

began supporting metadata of several common files. Around that same time, they
.

.
(2) Calibre Workaround -v- Scripted ISBN scraper

A simple Python+BS script would allow easy scraping of OpenLibrary without downloading the entire DB; however, a DB call is obviously more reliable if you don't mind the hassle of continually updating every month, re-importing, etc.

A Python+BS scrape is very easy because it's a simple HTTP call w/ISBN concatenated to the URL:
https://openlibrary.org/books/ISBN-GOES-HERE

However...

There is no need for this as the Calibre workaround is a lot less work -- and the eBook-Tools is the pre-existing alternative.

I don't understand the suggestion that the Calibre workaround is complex. Yes there is overhead; however, it's one single command to copy-out all files it process. It's one single command to delete all files it processed. That's either two CLI commands or a half-dozen clicks in a GUI.

Between book for myself, my wife, and my family -- I have about 15,000 eBooks. Both solutions work fine in my case; I simply process new eBooks in a batch, when I get around to it. That might be 500 books at a time.
.

.
(3) Bad Filename Issue

I believe [Login to see the link]'s point is that the file extension was incorrectly replaced?

Taking the last period "." in a filename, the characters to the right are assumed to be the file-extension. Because [Login to see the link]'s file was renamed "filename.sanet.st", the extension appeared as "st" rather than "pdf".

As a result, no amount of RegEx (or renaming) would have helped in his case, because the error would be on the uploader's side, where the file-extension was {accidentally?} removed.

[Login to see the link] makes an excellent suggestion of using the linux command "file". When that's not an option, it's crucial to have a quick/simple hex editor on-hand. Also, in Windows, there is a open source version of the linux file command ( [Login to see the link] ).

Don't forget to notify the uploader on the actual SA upload blog page, using the "comments" section. These comments get forwarded to the uploaders and they will often fix the issue. While that's reactive and won't help you; when enough people do this, it ultimately yields a karmatic result that helps all of us.
.

.
(4) Automating Processes to Eliminate These Issues

Don't forget about the benefit to using an automation solution to run pre-scripted workflows; I've provided this above. Just drag-in files when you download and have that run preconfigured scripts based on rules you've setup.

Also ... Don't forget about using jDownloader to rename files upon downloading, saving you from the rename issue (although not in [Login to see the link]'s case). I've explained this somewhere above, as well.

MediaFanatic

Fortunately because you use jDownloader, you don't need to cut-and-paste.

You can setup the jDownloader Packagizer Rules to rename using a field on the web page itself, such as the title you're copying manually.

If you prefer, you can go so far as to only do this when it has an ePub/Mobi/PDF (et al) ... extension.

EbeneZ

MediaFanatic You can setup the jDownloader Packagizer Rules to rename using a field on the web page itself, such as the title you're copying manually.

Is it possible here on sanet? i have just started to read the docs on packagizer

MediaFanatic

EbeneZ -

Yes, you can automate jDownloader to save time when downloading from SoftArchive.

The first step is configuring a JSON formatted string to be entered under the "Link Crawler Rules"
( Settings > Advanced Settings > Search "Link Crawler Rules" )

This is important because SA will not expose the download links unless you are logged-in. Therefore, you will need to establish a regex URL-matching string, then pass all SAnet cookies ( id, sa_remember, AdskeeperStorage, PHPSESSID ).

This will allow you to right-click any SA page and choose "Download w/jDownloader" (assuming you have the extension installed). jDownloader can then access the SA page as an authenticated user, with access to scrape the download links.

You'll also include a regex expression to recognize the download links. If you skip this step, it will grab all sorts of things that are unrelated to the download links. This is not as bad as it sounds because each page will attempt to download the same misc items and if you have proper jDownloader filters setup, they will all get ignored.

For a walkthrough:
Search for "DeepDecyrpt" or "LinkCrawler Rules" under jDownloader and there are example JSON strings they have published.

MediaFanatic

I have located a tool that will assist anyone that is unable to use Metadata fields in File Explorer.

It's a free/open source tool that will allow you to see your associations and repair or add new associations where they are missing.

File Meta Association Manager
[Login to see the link]

After you make the proper associations, you'll be able to see eBook metadata in your File Explorer (when the Metadata exists). This includes Title, Author, Book Cover Thumbnail, etc.

You can use this metadata like any other column, to sort-by, rename, etc.

Grim1

I have a problem. The new saved file name <jd:orgfilename:1><jd:orgfilename:3> turns out to be the text we want omitted i.e. sanet.

When plugging the regex code into an online tester ([Login to see the link]) the regex correctly identifies the sanet text. Could there be an error in the way I am handling the new saved file name?

Also I would prefer the regex be case insensitive as there are a range of choices uploaders apply including SaNet.ST but have not figured out how to do that. If you can provide guidance that would be great and I imagine it would simplify the code since the various sanet permutations wouldn't need explicate inclusion.

MediaFanatic

Grim1 - I'm sorry I had just typed that up quickly for the post. In addition, when I write regex for example purposes, I typically use logical ORs, because I think they are simpler for people to read. As a result, there are time I'll mix-up the capture groups.

I've created a new version that I've tested for you. This version is also case-INsensitive as you requested:
(?i)(.*?)([ ._-]{0,3})(sanet[ ._-]?(st|ws|cd|la|lc|ws|is)?)[ ._-]{0,3}(.*)

Then use replacement filename:
<jd:orgfilename:1><jd:orgfilename:5>

This regex is taking the 1st (.*?) and 5th (.*) capture group, excluding all others.

(I was forced to put that in code brackets, to be able to preserve the codes)

I've added new logic so that no matter what symbol proceeds the "sanet", follows "sanet", even if multiple symbols (eg " - ") -- and different toplevel domains (or no top-level) -- in all of these circumstances it will still catch/remove those.

« Previous Page Next Page »