From xen-devel-bounces@lists.xen.org Wed Jan 08 14:29:26 2014 Received: (at maildrop) by bugs.xenproject.org; 8 Jan 2014 14:29:26 +0000 Received: from lists.xen.org ([50.57.142.19]) by bugs.xenproject.org with esmtp (Exim 4.80) (envelope-from ) id 1W0u8M-0008JD-7c for xen-devel-maildrop-Eithu9ie@bugs.xenproject.org; Wed, 08 Jan 2014 14:29:26 +0000 Received: from localhost ([127.0.0.1] helo=lists.xen.org) by lists.xen.org with esmtp (Exim 4.72) (envelope-from ) id 1W0u43-00025V-RH; Wed, 08 Jan 2014 14:24:59 +0000 Received: from mail6.bemta5.messagelabs.com ([195.245.231.135]) by lists.xen.org with esmtp (Exim 4.72) (envelope-from ) id 1W0u42-000252-HE for xen-devel@lists.xen.org; Wed, 08 Jan 2014 14:24:58 +0000 Received: from [85.158.139.211:34557] by server-14.bemta-5.messagelabs.com id F2/F7-24200-9BF5DC25; Wed, 08 Jan 2014 14:24:57 +0000 X-Env-Sender: Ian.Campbell@citrix.com X-Msg-Ref: server-10.tower-206.messagelabs.com!1389191094!8567277!1 X-Originating-IP: [66.165.176.63] X-SpamReason: No, hits=0.0 required=7.0 tests=sa_preprocessor: VHJ1c3RlZCBJUDogNjYuMTY1LjE3Ni42MyA9PiAzMDYwNDg=\n X-StarScan-Received: X-StarScan-Version: 6.9.16; banners=-,-,- X-VirusChecked: Checked Received: (qmail 23856 invoked from network); 8 Jan 2014 14:24:56 -0000 Received: from smtp02.citrix.com (HELO SMTP02.CITRIX.COM) (66.165.176.63) by server-10.tower-206.messagelabs.com with RC4-SHA encrypted SMTP; 8 Jan 2014 14:24:56 -0000 X-IronPort-AV: E=Sophos;i="4.95,624,1384300800"; d="scan'208";a="88726237" Received: from accessns.citrite.net (HELO FTLPEX01CL03.citrite.net) ([10.9.154.239]) by FTLPIPO02.CITRIX.COM with ESMTP; 08 Jan 2014 14:24:30 +0000 Received: from [10.80.2.80] (10.80.2.80) by FTLPEX01CL03.citrite.net (10.13.107.80) with Microsoft SMTP Server id 14.2.342.4; Wed, 8 Jan 2014 09:24:29 -0500 Message-ID: <1389191068.4883.86.camel@kazak.uk.xensource.com> From: Ian Campbell To: David Vrabel Date: Wed, 8 Jan 2014 14:24:28 +0000 In-Reply-To: <52CD5E79.9000008@citrix.com> References: <21196.19900.136146.867552@mariner.uk.xensource.com> <52CD5E79.9000008@citrix.com> Organization: Citrix Systems, Inc. X-Mailer: Evolution 3.4.4-3 MIME-Version: 1.0 X-Originating-IP: [10.80.2.80] X-DLP: MIA1 Cc: Boris Ostrovsky , xen-devel@lists.xen.org, Ian Jackson Subject: Re: [Xen-devel] 3.4.70+ kernel WARNING spew dysfunction on failed migration X-BeenThere: xen-devel@lists.xen.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: Xen developer discussion List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org On Wed, 2014-01-08 at 14:19 +0000, David Vrabel wrote: > On 07/01/14 18:55, Ian Jackson wrote: > > I did the following test: > > > > mv /etc/xen/scripts/block /etc/xen/scripts/block.aside > > xl migrate debian.guest.osstest localhost > > > > xl did what appears to be the right thing: it did most of the > > migration, failed to run the block scripts at the end of the > > migration, and destroyed the destination domain and instead resumed > > the source guest. > > > > However, the source guest immediately went mad spewing WARNINGs and > > was after that no longer contactable via the network and not > > apparently responsive on the console. See below. > > > > This is with: > > > > [ 0.000000] Linux version 3.4.70+ (osstest@rice-weevil) (gcc > > version 4.4.5 (Debian 4.4.5-8) ) #1 SMP Wed Dec 4 03:14:51 GMT 2013 > > > > For reasons I don't understand it doesn't seem to print the actual > > kernel git hash in dmesg, but I think it was that from flight 22264, > > i.e. 234d96ee0f3b8e49501d068a2a3165aa4db60903. It's i386, on a > > 64-bit Xen. > > > > Thanks, > > Ian. > > > > debian login: [ 124.595658] PM: freeze of devices complete after 2.980 msecs > > [ 124.595991] PM: late freeze of devices complete after 0.013 msecs > > [ 124.600919] PM: noirq freeze of devices complete after 4.884 msecs > > [ 124.601105] Grant tables using version 2 layout. > > [ 124.601105] ------------[ cut here ]------------ > > [ 124.601105] kernel BUG at drivers/xen/events.c:1582! > > [ 124.601105] invalid opcode: 0000 [#1] SMP > > [ 124.601105] Modules linked in: [last unloaded: scsi_wait_scan] > > [ 124.601105] > > [ 124.601105] Pid: 6, comm: migration/0 Not tainted 3.4.70+ #1 > > [ 124.601105] EIP: 0061:[] EFLAGS: 00010082 CPU: 0 > > [ 124.601105] EIP is at xen_irq_resume+0x215/0x370 > > We shouldn't be calling xen_irq_resume() when resuming the source VM. > The EVTCHNOP_bind_irq is failing because the VIRQ is still bound. > > This would suggest that the suspend hypercall has not correctly returned > the cancelled state. > > Could this be because of the tools issue mentioned by Ian C? I'm fairly confident that it is, yes. (well "this" is actually, toolstack failed to implement the old style resume but told the guest it had, but not returning cancel...) Ian. _______________________________________________ Xen-devel mailing list Xen-devel@lists.xen.org http://lists.xen.org/xen-devel