I previously suggested adding a feature for Intelligent Renaming Using AI.
Here, similarly, I'd like to recommend AI Based De-Duplication.
Often times files like videos can be the same but in a different file size or codec that provides no traditional way to identify a duplicate. The filename usually leaves a clue, but only for a human.
For example, take this set of files:
- Law and Order S01E01 - Pilot 720p.mp4
- LawAndOrder season 1 episode 1 1080p.avi
- Law andOrder - s1 EP1 pilot 2160p.mpg
- S01E01.Law.And.Order.1999.Pilot.1080p.h265.mkv
You as a human can tell at a glance that these are all the same thing, but good luck trying to systemically remove duplicates with any kind of traditional check.
This is where AI can come in. With new options in the Find Duplicates dialog box, you could store and use instructions that link out to a chatgpt in much the same manner I referenced in my post about Intelligent Renames (linked above.)
Directory Opus 14 could reach out to Chatgpt with a set of filenames and use pre-crafted instructions to find duplicates. Then ChatGPT would return a list of the duplicates. Directory opus would take care of the mechanics of presenting to the user the duplicates and removal but ChatGPT would do the intelligent analysis to find the dups in a situation where md5 hashing, or filesizes are of no use.
This is where I'm suggesting a new feature:
It would need to store and pass instructions like this one for TV Series: "I'm going to give you a list of tv shows and I want you to analyze the names and return to me only the sets of files that appear to be duplicate episodes. You should return the full-path and filename and you should include a unique identifier number to group the duplicates together. You should always return them in order of grouped file identifier."
In Practice, the user would be able to
- build up a list of directories they want to check for dups on the left.
- pick "AI Duplicate Check" and choose your favorite stored instructions. Then hit FIND. (providing a default would be good.)
- Directory Opus would reach out to ChatGPT via their API and hand it the list of names with the instructions.
- ChatGPT would return only the list of duplicates with a group identifier so DOPUS would no how to present the results.
- User confirms that they are indeed dups.
- User deletes the dups as desired using the traditional interface.
Noting like this exists in any file manager I'm aware of today. I hope you will consider it. Could be a major feature for the next release.