[nylug-talk] Re :Re: extfs journalling percentage (Chris Knadle)
Chris Knadle
Chris.Knadle at coredump.us
Wed May 28 19:41:22 EDT 2008
On Tuesday 27 May 2008, Michael Bubb wrote:
> > From: Chris Knadle <Chris.Knadle at coredump.us>
...
> >> If I understand correctly the actual journal is not that big. Most of
> >> this 5% is reserved so that a superuser has access to an otherwise
> >> full partition. Do you need 5% for this? (ie if you have a 250G
> >> parttion do you need 13G reserved).
...
> > I'm not 100% positive, but I don't think the ext3 journal counts in
> > that 5% reserved for root-only usage. Mainly I think that this is space
> > set aside for root for having a place to do emergency filesystem repair,
> > like using debugfs or similar utilities.
...
> Thank you Chris.
> Now I am intruiged as to how to use this admin space in an actual
> crash. Guess I will try and google more on the purpose for this.
I've only had to rescue an ext3 partition once. It happened because of an
overloaded circuit breaker due to not monitoring how much power we were
running through a circuit, repeatedly bringing down half an entire server
room, including the local mail server -- which corrupted the /var partition
that was using ext3. In this case any attempt at mounting the /var partition
on the mail server would cause the kernel to hang hard without even a kernel
panic as to why. We assumed that the disk had been in the middle of a write
operation (logs) when the power dropped and thus did something unexpected to
the disk. We were forced to use debugfs to glean what information from the
unmounted /var partition that we could, onto another partition. Hitting the
trouble spot even with debugfs would hang the kernel hard, so we had to be
careful to avoid that spot... once we "found out" where that was... by trial
and error.
We then re-made the ext3 filesystem on the /var partition, mounted it, and
copied the data that we had acquired back.
I believe doing the above required booting into a Red Hat 7.2 rescue
envionment, which by default did not have device names made for the disk that
had the problem, so we had to use mknod to make those before doing the work.
So on every "trial and error" miss we had to re-make these devices as well as
run an fsck on the ext3 partitions that had been mounted when the kernel hung
during the rescue work.
These days with udev the missing devices are likely no longer a problem.
Now, I don't think we used the 5% "admin" space on any of the ext3
parititions in the above case, however if all of the various filesystems were
FULL, then that admin space would have been all we had to work with to rescue
the /var partition. So I can understand why it's needed even if I haven't
actually had to use it. ;-)
-- Chris
--
Chris Knadle
Chris.Knadle at coredump.us
More information about the nylug-talk
mailing list