SMP kernel hanging at when testing 8254 interrupt delivery

David P. Reese Jr. daver at gomerbud.com
Tue Jul 29 20:56:48 PDT 2003


On Tue, Jul 29, 2003 at 07:59:51PM -0700, Matthew Dillon wrote:
> :A UP kernel boots just fine, however when I try to boot an SMP kernel the
> :boot process hangs in a loop in sys/i386/isa/clock.c.
> :
> :$DragonFly: src/sys/i386/isa/clock.c,v 1.4 2003/07/06 21:23:49 dillon Exp $
> :...
> :#ifdef APIC_IO
> :        if (apic_8254_trial) {
> :                
> :                printf("APIC_IO: Testing 8254 interrupt delivery\n");
> :                while (read_intr_count(8) < 6)      <-- hangs here
> :                        ;       /* nothing */
> :
> :What is most bothersome is that the SMP kernel hoses my CMOS checksum,
> :hinting that this is a result of a larger problem.
> :
> :I discovered this problem with a snapshot from just a couple of days ago.
> :I'll go ahead and try a chronological binary search with the source tree
> :to see where the problem was introduced.  This seems to be a DragonFly
> :issue because an SMP FreeBSD 4.8 kernel boots just fine.
> :
> :Has anyone else running DragonFly on SMP hardware encountered this problem?
> 
>     Yes, Jeffrey Hsu reported the same problem, but hadn't tried to backtrace
>     it.  I sure would appreciate whatever work you are able to do to home
>     in on the problem.

But are there reports of SMP hardware that doesnt have the CMOS problem?

>     Please try commenting out the #define CHECK_POINTS line in
>     i386/i386/mp_machdep.c on line 189.  Maybe that is causing the CMOS
>     issues.

With CHECK_POINTS turned off the kernel hangs at same place and CMOS still
gets hosed.

>     I am going to turn it off anyway, right now, because I did not mean
>     to leave it on.
> 
> :If I am reading the source correctly, the routines in
> :
> :   sys/i386/isa/icu_vector.s
> :
> :increment the interrupt counts in the array intr_countp[vec] , so the above
> :loop waits for 6 interrupts from the rtc.    Is there anything I can examine
> :in the debugger to shed some light on the problem?
> :
> :-- 
> :
> :   David P. Reese Jr.                                      daver at xxxxxxxxxxxx
> 
>     Hmm.  All I can think of is to spatter printf()'s around.  First see
>     if removing CHECK_POINTS solves the CMOS checksum problem, though, because
>     if it does it will make debugging a lot easier.
> 
> 					-Matt
> 					Matthew Dillon 
> 					<dillon at xxxxxxxxxxxxx>

I'm out of town for two days.  When I get back I'll be able to fool with
this a bit more and try to track down the code that borks the CMOS.  

-- 

   David P. Reese Jr.                                      daver at xxxxxxxxxxxx
   --------------------------------------------------------------------------
   It can be argued that returning a NULL pointer when asked to allocate
   zero bytes is a silly response to a silly question.
                                         -- FreeBSD manual page for malloc(3)





More information about the Bugs mailing list