<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>duplicates &#8211; Alexander Shyrokov&#039;s blog</title>
	<atom:link href="http://sjcomp.com/blog/?feed=rss2&#038;tag=duplicates" rel="self" type="application/rss+xml" />
	<link>http://sjcomp.com/blog</link>
	<description>This&#039;s only my opinion, so keep it that way</description>
	<lastBuildDate>Mon, 21 Oct 2019 13:42:29 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.9.16</generator>
	<item>
		<title>Find duplicate files using OpenOffice.Calc</title>
		<link>http://sjcomp.com/blog/?p=334</link>
		<pubDate>Wed, 14 Oct 2009 02:35:18 +0000</pubDate>
		<dc:creator><![CDATA[Alexander Shyrokov]]></dc:creator>
				<category><![CDATA[Coding]]></category>
		<category><![CDATA[duplicates]]></category>
		<category><![CDATA[file managment]]></category>

		<guid isPermaLink="false">http://sjcomp.com/blog/?p=334</guid>
		<description><![CDATA[I had a problem of having a set of directories with some files duplicated in some of the directories. The search for a software to remove duplicate files did not reveal any free utilities to accomplish this task. So I decided to use OpenOffice.Calc to do the job. First I got the list of all [&#8230;]]]></description>
				<content:encoded><![CDATA[<p>I had a problem of having a set of directories with some files duplicated in some of the directories. The search for a software to remove duplicate files did not reveal any free utilities to accomplish this task. So I decided to use <a href="http://www.openoffice.org/product/calc.html">OpenOffice.Calc</a> to do the job. </p>
<ul>
<li>First I got the list of all the files:
<pre code>
dir /b/s >list.csv
</pre>
</li>
<li>Then, I opened list.csv in <a href="http://www.openoffice.org/product/calc.html">OpenOffice.Calc</a> and used this formula to extract only the file name:
<pre code>
=RIGHT(A1;LEN(A1)-SEARCH("\\[^\\]*$";A1;1))
</pre>
</li>
<li>After that, I sorted the sheet by the file names and added a formula that detected duplicate file names:
<pre code>
=IF(B2=B1;"DUP";"")
</pre>
</li>
<li>Search for the string DUP (in values) quickly showed me where the duplicates are.</li>
</ul>
<p>I know it was not as easy as running a specific program, but it was easier for me to do that than to find a free software.</p>
]]></content:encoded>
			</item>
	</channel>
</rss>
