[nylug-workshop] Log analyzer (Re: Regular meetings of the Python workshop)
Peter C. Norton
spacey-nylug-workshop at lenin.net
Tue Apr 10 16:28:01 EDT 2007
On Tue, Apr 10, 2007 at 03:37:50AM -0400, Yusuke Shinyama wrote:
> Hi,
>
> I put up a newer version of logweeder.
> http://www.unixuser.org/~euske/python/logweeder/
>
> This version is significantly improved in its speed.
> Now it can process thousands of lines per second, and
> it can also load a previously constructed pattern file for further speedup.
>
> The major drawback is that since the algorithm is now a greedy
> way, it might not return optimal results. For example, if the log
> file is like as follows, line 1 and line 100000 are recognized as
> different clusters:
>
> line 1: ypbind[1234]: aaaa
> line 2: ypbind[1234]: bbbb
>
> (cluster for "ypbind[1234]: ..." is formed)
>
> (then ypbind restarted for some reason)
>
> line 100000: ypbind[7890]: aaaa ...
> line 100001: ypbind[7890]: bbbb ...
>
> (another cluster for "ypbind[7890]: ..." is formed)
>
> I haven't yet incorporated Peter's patch because the program
> structure is rather different, althoug I mostly incorporated the
> general idea. (and personally I'm not fond of using configParser.)
Oh, well, I'm not appreciated, fine :)
Seriously, It seems like the problem with this is that there's no
extra weight being given to the logging tag, which could probably be
fixed, I think.
-Peter
--
The 5 year plan:
In five years we'll make up another plan.
Or just re-use this one.
More information about the nylug-workshop
mailing list