From xen-devel-bounces@lists.xen.org Wed Jan 08 14:24:39 2014 Received: (at maildrop) by bugs.xenproject.org; 8 Jan 2014 14:24:39 +0000 Received: from lists.xen.org ([50.57.142.19]) by bugs.xenproject.org with esmtp (Exim 4.80) (envelope-from ) id 1W0u3j-0008Fr-7q for xen-devel-maildrop-Eithu9ie@bugs.xenproject.org; Wed, 08 Jan 2014 14:24:39 +0000 Received: from localhost ([127.0.0.1] helo=lists.xen.org) by lists.xen.org with esmtp (Exim 4.72) (envelope-from ) id 1W0tzA-0001MH-O6; Wed, 08 Jan 2014 14:19:56 +0000 Received: from mail6.bemta5.messagelabs.com ([195.245.231.135]) by lists.xen.org with esmtp (Exim 4.72) (envelope-from ) id 1W0tz9-0001LF-0X for xen-devel@lists.xen.org; Wed, 08 Jan 2014 14:19:55 +0000 Received: from [85.158.139.211:61203] by server-10.bemta-5.messagelabs.com id 33/0D-01405-A8E5DC25; Wed, 08 Jan 2014 14:19:54 +0000 X-Env-Sender: david.vrabel@citrix.com X-Msg-Ref: server-14.tower-206.messagelabs.com!1389190792!8530161!1 X-Originating-IP: [66.165.176.63] X-SpamReason: No, hits=0.0 required=7.0 tests=sa_preprocessor: VHJ1c3RlZCBJUDogNjYuMTY1LjE3Ni42MyA9PiAzMDYwNDg=\n X-StarScan-Received: X-StarScan-Version: 6.9.16; banners=-,-,- X-VirusChecked: Checked Received: (qmail 20784 invoked from network); 8 Jan 2014 14:19:53 -0000 Received: from smtp02.citrix.com (HELO SMTP02.CITRIX.COM) (66.165.176.63) by server-14.tower-206.messagelabs.com with RC4-SHA encrypted SMTP; 8 Jan 2014 14:19:53 -0000 X-IronPort-AV: E=Sophos;i="4.95,624,1384300800"; d="scan'208";a="88724344" Received: from accessns.citrite.net (HELO FTLPEX01CL01.citrite.net) ([10.9.154.239]) by FTLPIPO02.CITRIX.COM with ESMTP; 08 Jan 2014 14:19:39 +0000 Received: from [10.80.2.76] (10.80.2.76) by FTLPEX01CL01.citrite.net (10.13.107.78) with Microsoft SMTP Server id 14.2.342.4; Wed, 8 Jan 2014 09:19:38 -0500 Message-ID: <52CD5E79.9000008@citrix.com> Date: Wed, 8 Jan 2014 14:19:37 +0000 From: David Vrabel User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.16) Gecko/20121215 Iceowl/1.0b1 Icedove/3.0.11 MIME-Version: 1.0 To: Ian Jackson References: <21196.19900.136146.867552@mariner.uk.xensource.com> In-Reply-To: <21196.19900.136146.867552@mariner.uk.xensource.com> X-Originating-IP: [10.80.2.76] X-DLP: MIA2 Cc: Boris Ostrovsky , xen-devel@lists.xen.org, Ian Campbell Subject: Re: [Xen-devel] 3.4.70+ kernel WARNING spew dysfunction on failed migration X-BeenThere: xen-devel@lists.xen.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: Xen developer discussion List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org On 07/01/14 18:55, Ian Jackson wrote: > I did the following test: > > mv /etc/xen/scripts/block /etc/xen/scripts/block.aside > xl migrate debian.guest.osstest localhost > > xl did what appears to be the right thing: it did most of the > migration, failed to run the block scripts at the end of the > migration, and destroyed the destination domain and instead resumed > the source guest. > > However, the source guest immediately went mad spewing WARNINGs and > was after that no longer contactable via the network and not > apparently responsive on the console. See below. > > This is with: > > [ 0.000000] Linux version 3.4.70+ (osstest@rice-weevil) (gcc > version 4.4.5 (Debian 4.4.5-8) ) #1 SMP Wed Dec 4 03:14:51 GMT 2013 > > For reasons I don't understand it doesn't seem to print the actual > kernel git hash in dmesg, but I think it was that from flight 22264, > i.e. 234d96ee0f3b8e49501d068a2a3165aa4db60903. It's i386, on a > 64-bit Xen. > > Thanks, > Ian. > > debian login: [ 124.595658] PM: freeze of devices complete after 2.980 msecs > [ 124.595991] PM: late freeze of devices complete after 0.013 msecs > [ 124.600919] PM: noirq freeze of devices complete after 4.884 msecs > [ 124.601105] Grant tables using version 2 layout. > [ 124.601105] ------------[ cut here ]------------ > [ 124.601105] kernel BUG at drivers/xen/events.c:1582! > [ 124.601105] invalid opcode: 0000 [#1] SMP > [ 124.601105] Modules linked in: [last unloaded: scsi_wait_scan] > [ 124.601105] > [ 124.601105] Pid: 6, comm: migration/0 Not tainted 3.4.70+ #1 > [ 124.601105] EIP: 0061:[] EFLAGS: 00010082 CPU: 0 > [ 124.601105] EIP is at xen_irq_resume+0x215/0x370 We shouldn't be calling xen_irq_resume() when resuming the source VM. The EVTCHNOP_bind_irq is failing because the VIRQ is still bound. This would suggest that the suspend hypercall has not correctly returned the cancelled state. Could this be because of the tools issue mentioned by Ian C? David _______________________________________________ Xen-devel mailing list Xen-devel@lists.xen.org http://lists.xen.org/xen-devel