thread/socket behvior
Simon Marlow
marlowsd at gmail.com
Fri Oct 10 09:23:31 EDT 2008
Jeff Polakow wrote:
> Don Stewart <dons at galois.com> wrote on 10/09/2008 02:56:02 PM:
>
> > jeff.polakow:
> > > We have a server that accepts messages over a socket, spawning
> threads to
> > > process them. Processing these messages may cause other, outgoing
> > > connections, to be spawned. Under sufficient load, the main
> server loop
> > > (i.e. the call to accept, followed by a forkIO), becomes
> nonresponsive.
> > >
> > > A smaller distilled testcase reveals that when sufficient socket
> activity
> > > is occurring, an incoming connection may not be responded to
> until other
> > > connections have been cleared out of the way, despite the fact
> that these
> > > other connections are being handled by separate threads. One
> issue that
> > > we've been trying to figure out is where this behavior arises
> from-- the
> > > GHC rts, the Network library, the underlying C libraries.
> > >
> > > Have other GHC users doing applications with large amounts of
> > socket usage
> > > observed similar behavior and managed to trace back where it
> originates
> > > from? Are there any particular architectural solutions that
> people have
> > > found to work well for these situations?
> >
> > Hey Jeff,
> >
> > Can you say which GHC you used, and whether you used the threaded
> > runtime or non-threaded runtime?
> >
> Oops, forgot about that...
>
> We used both ghc-6.8.3 and ghc-6.10.rc1 and we used the threaded
> runtime. We are running on a 64 bit linux machine using openSUSE 10.
The scheduler doesn't have a concept of priorities, so the accepting thread
will get the same share of the CPU as the other threads. Another issue is
that the accepting thread has to be woken up by the IO manager thread when
a new connection is available, so we might have to wait for the IO manager
thread to run too. But I wouldn't expect to see overly long delays. Maybe
you could try network-alt which does its own IO multiplexing.
If you have multiple cores, you might want to try fixing the thread
affinity - e.g. put all the worker threads on one core, and the accepting
thread on the other core. You can do this using GHC.Conc.forkOnIO, with
the +RTS -qm -qw options.
Other than that, I'm not sure what to try right now. We're hoping to get
some better profiling for parallel/concurrent programs in the future, but
it's not ready yet.
Cheers,
Simon
More information about the Glasgow-haskell-users
mailing list