Register on the forum now to remove ALL ads + popups + get access to tons of hidden content for members only!
vintage erotica forum vintage erotica forum vintage erotica forum
vintage erotica forum

Go Back   Vintage Erotica Forums > Information & Help Forum > Help Section > Tutorials

Follow Vintage Erotica Forum on Twitter
Best Porn Sites Meet Our Girls Register FAQ Members List Calendar Mark Forums Read

Notices
Tutorials Step by step Guides and How to's with screengrabs.


View Poll Results: Is this tutorial usefull
Yes 7 58.33%
No, don't care about duplicates 4 33.33%
No, hard to follow. 1 8.33%
Multiple Choice Poll. Voters: 12. You may not vote on this poll

Reply
 
Thread Tools Display Modes
Old 08-10-2013, 07:19 PM   #1
Artsey
Member
 
Join Date: Aug 2011
Posts: 65
Thanks: 5,435
Thanked 363 Times in 64 Posts
Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+
Default Deleting Duplicates

Large collections of pictures frequently have a lot of duplicates of varying qualities. In some of my folders I have found as many as 15 copies of the same image. In my Jayne Mansfield folder before deleting duplicates I had 10,000 images. After the delete I had 5000. When browsing a folder with a lot of duplicates you see the same image repetedly and some images you seldom see at all because you dont get that far into a large collection before you get done browsing.

When they are the same file size or similar name listing as thumbnails arranged by size or name places the similar sized or named images next to each other making them easy to spot. Deleting duplicates with varying sizes and names can be an almost impossibly time consuming task. These are some techniques to make the task faster and more reliable.

The first thing is to sort the collection into a broad easy to spot criteria like bikinis, one piece, costume etc. If you have more than one or two hundred images in some criteria you should divide it into more criterias because it takes too long to scan through much more than that to find the duplicates and you get fatigued and miss them. You might divide the bikini collection by those with water in the image or grass. You will probably put the bikini images back into the bikini folder after the deletion of duplicates but wait untill you have worked the whole folder. You should make up more criteria and if you invent some that work for you share it here. I found that some criteria don't work so good. You wouldn't use some thing like braclets because that is too hard to see in thumbnails. You might think that sorting by how the picture is framed, such as bust, medium or full length, might work. It doesn't because you may decide a "bust" image is a medium image some times when you see it and then you still have a duplicate. Or someone may have cropped an image to make it a bust while you have a better quality full version in medium frame. Also you wouldn't want to sort by color and black and white because you may have a great color image and don't want to keep a black and white version.

So here's the technique using bikinis.

Go through your base folder and move all bikini images into a temporary folder. I put all bra shots in too as it's too hard to see the differance between a bikini top and a bra in a thumbnail. Just clickity click right through the folder as quick as you can spot the bikini. If you still have over a couple of hundred bikini images in the temporary folder it's too many to start deleting. For Raquel Welch I had 1200. Move all the images with water in them to another folder I call that one "sort". This got me down to 500 to 600 bikini images with water and without. Still too many. I sorted into yet another folder I call delete. Here I put images that are very specific like a specific bikini or photo shoot or standing, reclining, sitting or kneeling. Now you may find that you have few enough images, less than 150, to begin to delete.

As you do this preliminary sorting the duplicates will become more and more visible. You want to resist deleting the duplicates with over 150 in the folder. They are visible in a quick scan but you still have to spot them individualy amongst a lot of other images that slows the technique down.

To delete;

I use ACDSee for all this. You may have to adjust your technique to fit your software.

Scan through the final delete folder and while holding the control key click on the duplicates you spot. Select the ones you have a lot of first to get them out of the way so later you can quickly spot the duplicates you have fewer of. Then view the selected files as a slideshow that you can advance image by image. Here you decide which ones you want to delete. I have found that I still keep some duplicates because I may have a good quality image with a water mark or that is cropped and a lower quality clean or full image. I'll also keep artisticly altered duplicates like colorized versions.

I'll mention here that you should check the file size because sometimes you may see two copies that look identical but one may be more jpeg compressed than the other or one may be a larger file size but still be intentical with no jpeg compression artifacts. in these cases simply zooming in with zoom lock and switching from one to the other may allow you to see which is the better image. Sometimes you might want to copy the images into an image editor to compare.

After deleting the duplicates move the images you want to keep into the final bikini folder. This gets them out the the way to find more duplicates. Then go back to the delete folder and look for more duplicates.

Don't do preliminary sorting into broad criterias all at once. Do one broad criteria, then sort that into a sub criteria, then a sub sub criteria, then delete and work your way back up the criteria chain to the base folder. For example sort into bikinis-water-standing-delete. Then do bikinis-water-sitting-delete. Then do bikinis-water-reclining-delete. After you delete all the duplicates of bikinis with water images sort for the bikinis-grass-standing-delete, bikinis-grass-sitting-delete.

Well thats all for now. If anyone thinks of additions, sorting criterias or some area where this essay could be improved or made clearer please post.

Some additional criterias I use;

costumes
street clothes
indoor
outdoor
single
group
color of clothes

Also when posting sub criteria mention the main criteria and or sub sub criteria formatted eg

swimsuit-water-standing

or

Main criteria-subcriteria-sub sub criteria
Artsey is offline   Reply With Quote
The Following 6 Users Say Thank You to Artsey For This Useful Post:


Old 08-10-2013, 08:58 PM   #2
hos
no remorse.
 
hos's Avatar
 
Join Date: Nov 2008
Location: relocated to lurkerland.
Posts: 8,940
Thanks: 28,006
Thanked 66,617 Times in 8,189 Posts
hos 350000+hos 350000+hos 350000+hos 350000+hos 350000+hos 350000+hos 350000+hos 350000+hos 350000+hos 350000+hos 350000+
Default

no wonder that you have so much work to do before you have the slightest chance of finding dupes.

the first thing i would do in your situation is organizing the images properly. having every image of the same model in a single folder isn't really helpful to get along with a collection. sadly this mess is often based on unorganized guys who share their random images in single posts / files. my recommendation, if you are a collector: do not load such posts entirely, may pick some single ones you really like to have. but for a model based collection it's totally pita.

keep images together which belong together. from same TV show, same gallery, same mag story etc. keep them together in a folder, create a shortcut to 1 of these images and store only the shortcut in Model XXX gallery folder. then it's much easier to stay on track with shootings you already have - entirely or not.
__________________
Always the same pack, sunshiny dandering around like a bitch in heat to the presenters..

Last edited by hos; 08-10-2013 at 09:10 PM..
hos is offline   Reply With Quote
The Following 6 Users Say Thank You to hos For This Useful Post:
Old 08-10-2013, 10:39 PM   #3
Artsey
Member
 
Join Date: Aug 2011
Posts: 65
Thanks: 5,435
Thanked 363 Times in 64 Posts
Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+
Default

Quote:
Originally Posted by hos View Post
the first thing i would do in your situation is organizing the images properly.
keep images together which belong together. from same TV show, same gallery, same mag story etc. keep them together in a folder,. then it's much easier to stay on track with shootings you already have - entirely or not.
That works good for professional models with several dozen images of any given shoot and several dozen shoots in total. There are many filing methods and many that work better for varying purposes such as finding a particular image or simply browsing.

But for Jayne Mansfield and my other favorite movie stars there are too many differant "shoots" with too few images in each. It makes a mess of dozens, if not hundreds, of folders. Worse than combining shoots of a few images into criteria folders of a couple hundred images. With general movie stars, most I only have a few of any given one, and most I wouldn't recognize the name or what is in the folder. It is more efficient to combine similar stars. For instance I combine Yvonne Decarlo with Elvira, Tura Satana and Elizabeth Taylor. I combine Barbara Nichols with Diana Dors.

Also there's the problem with naming a multiplicity of folders and then remembering the name when looking for that folder, or what's in it. For a distinctive costume it's easy. Like Raquel Welch's stars and stripes myra Breckinridge, 1,000,000 BC or magic christian costumes. I put all 233 1,000,000 BC costume images into one folder. 356 of the rest of those costume images into a single costumes folder. Not so easy for street clothes, glamor wear, or all the different bikinis and swimsuits.

While filing methods and priorities are related to sorting and finding duplicates this thread is merely focused on finding duplicates on your own hard drive. Not at all about finding duplicates on VEF.

Last edited by Artsey; 08-10-2013 at 11:00 PM..
Artsey is offline   Reply With Quote
The Following 5 Users Say Thank You to Artsey For This Useful Post:
Old 08-10-2013, 11:49 PM   #4
hos
no remorse.
 
hos's Avatar
 
Join Date: Nov 2008
Location: relocated to lurkerland.
Posts: 8,940
Thanks: 28,006
Thanked 66,617 Times in 8,189 Posts
hos 350000+hos 350000+hos 350000+hos 350000+hos 350000+hos 350000+hos 350000+hos 350000+hos 350000+hos 350000+hos 350000+
Default

thanks for giving me another reason to not collect celebs, lol

(no pun intended)
__________________
Always the same pack, sunshiny dandering around like a bitch in heat to the presenters..
hos is offline   Reply With Quote
The Following 3 Users Say Thank You to hos For This Useful Post:
Old 08-11-2013, 09:14 AM   #5
G-Type
Former Staff
 
G-Type's Avatar
 
Join Date: Dec 2007
Location: 36 Quai des Orfèvres
Posts: 9,407
Thanks: 94,188
Thanked 183,244 Times in 9,272 Posts
G-Type 750000+G-Type 750000+G-Type 750000+G-Type 750000+G-Type 750000+G-Type 750000+G-Type 750000+G-Type 750000+G-Type 750000+G-Type 750000+G-Type 750000+
Default

There is perhaps a solution available.

I use to upload my collections with an uploader-tool named "Irada". Before you cry loud and say that this appears to be off-topic...yes, at first sight perhaps.

Along with this (stand-alone) tool comes a smart sub-function which allows to scan folders for dupes. It detects them by scanning the MD5-checksums and lists all pics having the identical one which is the true proof of a real dupe.

Unfortunately, it´s a tool being developped in Germany and the creators failed so far to publish their program in english language as well.

But anyway, here is the link (as I have said, no installing required and no registry-entries, it can also be started from a USB-stick etc.)

Code:
bihe.berlios.de/page/?loc=irada
__________________
Avec le temps va tout s´en va,
on oublie le visage et l´on oublie la voix.
Le coeur quand ça bat plus c´est pas la peine d´aller,
chercher plus loin faut laisser faire et c´est très bien.
G-Type is offline   Reply With Quote
The Following 10 Users Say Thank You to G-Type For This Useful Post:
Old 08-11-2013, 10:16 AM   #6
palo5
Super Moderator
 
palo5's Avatar
 
Join Date: Sep 2009
Posts: 10,208
Thanks: 365,880
Thanked 119,340 Times in 10,333 Posts
palo5 500000+palo5 500000+palo5 500000+palo5 500000+palo5 500000+palo5 500000+palo5 500000+palo5 500000+palo5 500000+palo5 500000+palo5 500000+
Default

Quote:
Originally Posted by GeorgeMcSoon View Post
Along with this (stand-alone) tool comes a smart sub-function which allows to scan folders for dupes. It detects them by scanning the MD5-checksums and lists all pics having the identical one which is the true proof of a real dupe
I thought there were many duplicate finders available. I use one called 'Easy Duplicate Finder' which is fast and lets you set convenient parameters, including min/max file sizes. It must have saved me hundreds of GB in wasted space over the years

The trouble is, it only detects exact matches - it doesn't matter if filenames are different, but the files themselves must be exactly the same. So if you have 2 pics/movies/files that are the same, but different sizes, it won't find them (unless there's some option I've overlooked)
palo5 is online now   Reply With Quote
The Following 8 Users Say Thank You to palo5 For This Useful Post:
Old 08-11-2013, 01:15 PM   #7
karloschaos
Senior Member
 
karloschaos's Avatar
 
Join Date: Nov 2005
Posts: 450
Thanks: 3,963
Thanked 4,496 Times in 438 Posts
karloschaos 10000+karloschaos 10000+karloschaos 10000+karloschaos 10000+karloschaos 10000+karloschaos 10000+karloschaos 10000+karloschaos 10000+karloschaos 10000+karloschaos 10000+karloschaos 10000+
Default

I use this. It's free & very easy to use. It will detect the same image with different file size, resolution & file name or vary the level of similarity right down to at least 80%, beyond which even a blind man could tell the difference. You can tell it to look within just 1 folder or between more than 1 sub-folders within a main 1. For a massive folder just leave it running in the background. You can automate it to auto-delete or check each dup before deleting. Saves a hell of a lot of time for me.

http://www.prismaticsoftware.com/dup...pdetector.html
karloschaos is offline   Reply With Quote
The Following 11 Users Say Thank You to karloschaos For This Useful Post:
Old 08-29-2013, 03:52 PM   #8
Artsey
Member
 
Join Date: Aug 2011
Posts: 65
Thanks: 5,435
Thanked 363 Times in 64 Posts
Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+Artsey 1000+
Default

Just an update on some statistics on duplicates. It's turning out that I'm averaging 1/3 to 1/2 of my files are duplicates. I have just under half a million image files. It's still a large task but it is managable now and I am making progress. And I get to see many files I haven't seen in years. Probably would never have seen them again and wouldn't recognize them if I saw them on the internet so might have downloaded them again and got some more dupes.
Artsey is offline   Reply With Quote
The Following User Says Thank You to Artsey For This Useful Post:
Old 12-13-2013, 08:15 AM   #9
deepsepia
Vintage Member
 
deepsepia's Avatar
 
Join Date: Jul 2007
Location: Upper left corner
Posts: 2,843
Thanks: 11,994
Thanked 24,058 Times in 2,819 Posts
deepsepia 100000+deepsepia 100000+deepsepia 100000+deepsepia 100000+deepsepia 100000+deepsepia 100000+deepsepia 100000+deepsepia 100000+deepsepia 100000+deepsepia 100000+deepsepia 100000+
Default

For folks on the Mac, Gemini is a very good duplicate finder.
deepsepia is offline   Reply With Quote
Old 12-13-2013, 09:24 AM   #10
Dimitri Yar
Former Staff
 
Dimitri Yar's Avatar
 
Join Date: Nov 2013
Location: Terra
Posts: 4,893
Thanks: 12,438
Thanked 108,446 Times in 4,603 Posts
Dimitri Yar 500000+Dimitri Yar 500000+Dimitri Yar 500000+Dimitri Yar 500000+Dimitri Yar 500000+Dimitri Yar 500000+Dimitri Yar 500000+Dimitri Yar 500000+Dimitri Yar 500000+Dimitri Yar 500000+Dimitri Yar 500000+
Default

I use this : Anti Twin Freeware for private using. That program is verry good in my opinion. Or for picture collection i use The!Checker to avoid dups. Also effectiv. Hopefully it helps.
__________________
Life is but a dream within a dream !
To view links or images in signatures your post count must be 0 or greater. You currently have 0 posts.



To view links or images in signatures your post count must be 0 or greater. You currently have 0 posts.

To view links or images in signatures your post count must be 0 or greater. You currently have 0 posts.


Dimitri Yar is offline   Reply With Quote
The Following 2 Users Say Thank You to Dimitri Yar For This Useful Post:
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump




All times are GMT. The time now is 08:38 AM.






vBulletin Optimisation provided by vB Optimise v2.6.1 (Pro) - vBulletin Mods & Addons Copyright © 2017 DragonByte Technologies Ltd.