Unstructured Myth??
I see various figures thrown about as to how much unstructured data is produced, stored etc by people; often it is this data which people believe can be deduped,ILMed, migrated, compressed etc. This is often expressed as a percentage of the total data stored by a company, is it 10%, 15, 20%, 30% + but how much is there?
I suspect very few people really know; for example, nearly all of my unstructured data sits on my laptop hard-drive; a great deal of it is my offline mail folders, I'm a fairly heavy mail-user and probably have 2-3 gigs of PST file generated over three years. Documents, another couple of gigs; so I suspect all-in-all I have less than about 10 gigs of unstructured data and I suspect that I'm way above average in our company. Even if I was average, I'd only be really be looking at 150 terabytes of data and to be honest, that is less than 5% of our total storage estate.
Logs for various systems; should be compressed by default and we know logs generally compress really well. But we've had cron-jobs etc for years which rotate logs, compress them etc. So we need to keep them for longer and longer periods of time but even so we should be able to get that sort of stuff onto the right tier of disk straight away.
So what other unstructured data might people be storing; okay, I work for a media company and we have wodges of the stuff but that's slightly unusual.
I wonder how much unstructured stuff there really is? Any thoughts? And of the unstructured stuff which users are storing, how much really belongs on the corporate storage in first place?
This is purely the anecdotal musings of a storage manager but what are the real figures?