[nylug-talk] Ghetto SAN ideas for Linux, was: SMB 10gigE (for Linux?)

jh jhlists at hirschman.net
Sat Sep 1 13:30:56 EDT 2007


Alex Pilosov wrote:
>
> Sure. 10GE is the new GE. You can buy ghetto 10G switches for ~2-3k$.
> 
Upon further reflection, I'm thinking that a SAN with a shared file 
system is a better option at this point. Prices for DIY SAN look pretty 
inexpensive, especially compared to 10G switches.

The problem being solved, since Alex asked: we deal with data in the 
following hierarchy:

- Volumes - typically DVD sized or higher, say, 2-200 gigs, contains 
documents, typically in the range of thousands to hundreds of thousands.
- Documents, which contain pages
- Pages, which are our most atomic data type.

On the front end of things ("ingestion") the volume is our atomic unit. 
We need to bring in the entire thing into our system from a single client.

On the back end of things ("production") we have a dozen nodes each 
working on individual pages and documents. They're fine on GigE, as the 
actual processing is the bottleneck there.

The issue: if we cannot ingest faster than the combined number of 
workers can do their thing, our production rates stall - we get the 
equivalent of "wait states". And GigE is our bottleneck.

So, it seems to me that a better/cheaper/more mature solution is 
something using 4Gb FC on a SAN, plus a shared file system. That way our 
ingestion box can just drop the stuff into a shared file system, while 
the workers can continue to access it via a gateway server sharing it 
via NFS.

So, my questions:

* Generally, how is compatibility between different vendors these days?
* What 4Gb FC HBAs are best supported under Linux?
* Suggestions for a cheap 4Gb switch?
* Has anyone used something like this Promise Vtrak SATA-to-FC raid box? 
  Opinions? http://tinyurl.com/2h3bcy
* Is anyone using a shared file system - either Linux only (i.e., GFS), 
or something else? Especially under Debian?

And lastly: is there a book or online resource that someone can 
recommend to get more up to speed on SANs and the like?

Thanks!

jh



More information about the nylug-talk mailing list