Managing Files…
Most (if not all) people who use computers know what a File is, which is, in short, a piece of data with name, size, type, etc. A file can be a Word document, a text document, an executable binary, a compressed archive, etc.
Having a lots of files means you own a lot of information. However, it soon becomes problem when the number of files you own grows (uncontrollably). As a student, as well as a computer hobbyist, you may have already guessed that I have this kind of problem as well. What I am aware of is that I produce (and collect) lots of files which have accumulated for several years now. Currently, I have several thousands (or tens of thousands) files, totaling to about more than 200gigabytes in size. That does not include files that have been archived into CDs or DVDs. So, I guess now it’s time for me to think about how to organize those files.
Generally, I think the problems with having too many files are :
- Storage size : what if you run out of space? (is not a problem if you have unlimited storage)
- My current solution : add more drives, archive to DVDs
- Organization : how to organize them in a logical way? (is not a problem if you have an AI agent that knows about all your files and can find and order them for you automatically)
- My current solution : per-category separation, shallow hierarchies (2-3 levels), but still needs a lot of work to maintain (otherwise there will be many "unsorted" files).
- Storage reliability : how big is the risk of losing these files? (is not a problem if your hardware never fails and the filesystem never crashes)
- My current solution : RAID-1 (mirrored storage) - basically store the same data in two identical drives (just in case one fails)
However, the types of files also matters. I think files can be categorized as :
- Other people’s work - easily re-obtainable (Cat 1)
- This kind of file can be easily downloaded from the internet, or something you can retrieve from other medias
- Generally I don’t care to much about the storage of this kind of files, since losing it is not much of a problem. Lately, I have been deleting small downloaded files, because redownloading them when needed is much simpler than organizing the files.
- Other people’s work - not easily re-obtainable (Cat 2)
- This include big files, or files acquired through some occasional events or procedures, or files that is no more available elsewhere.
- Still have to be stored well
- My work (Cat 3)
- I generally save everything I made, directly and indirectly. This include codes, history, logs, reports, documents, homeworks, and of course, photos.
- Losing any of this kind of file means losing it forever, unless manually remade.
Currently out of these 200gigabytes I mentioned, I guess around 100 to 150 gigs are of Cat 3 while the the rest of them are of Cat 2 and Cat 1. So, that means later when I run out of storage, I should delete these Cat 1 files first (or should I do it now? hmm…).
Organizing files have always been needing a lot of work and time. Several years ago, I categorized my files in a deep-hierarchy structure. But that approach require quite a lot of effort everytime I store a file, which caused a lot of new files stays in the unsorted area (because it takes time to organize them to the structure). Lately, I have changed into a more flat-hierarchy kind of structure, and it is quite successful in making me less hesitating to organize my files upon creation. However, these legacy "unsorted" files are still needing a lot of work to be organized (I am thinking of encapsulating all of them into a compressed archive and not trying to organize them anymore, to reduce the backlogging). I had spent several hours (or even days) lately to organize all of those files, but still, there are a lot of work to be done.
I guess I should consider using the "label and search" kind of strategy (with Google Desktop apps maybe?) and forgot about organizing the files altogether (if you can still find the file when needed, why bother categorizing?…)
So, what do you think? Do you have any strategy on organizing your files?
-Kurniady