[7249] in Release_7.7_team
RE: 3 machines in macgregor need re-installs [help.mit.edu #1408773]
daemon@ATHENA.MIT.EDU (Stuart Peloquin)
Tue Feb  1 13:31:39 2011
From: Stuart Peloquin <peloquin@MIT.EDU>
To: Jonathon Weiss <jweiss@mit.edu>
CC: Jonathon Weiss <jweiss@mit.edu>, Jonathan D Reed <jdreed@mit.edu>,
   "release-team@mit.edu" <release-team@mit.edu>
Date: Tue, 1 Feb 2011 13:31:28 -0500
Message-ID: <F5347749EB2E3D458E1354F21527052C117BE792BC@EXPO19.exchange.mit.edu>
In-Reply-To: <201101271821.p0RIL9sq029831@outgoing.mit.edu>
Content-Language: en-US
Content-Type: text/plain; charset="us-ascii"
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Latest update:
These machines, despite the fix put in place for W61 DHCP, will still not boot over PXE.  This is true for the PXE built into the BIOS and when using a USB key with gpxe.  I tried doing a packet capture to understand what might be going on, but it looks like the switch port shut down when I tried to connect more than one machine (to capture the DHCP stream with Wireshark).  I will ask Network about the DHCP issue again.
In the meantime, I've asked hotline to pick up the machines and bring them to N42 for us to reinstall there.
-Stuart
--
Stuart Peloquin
Extended Infrastructure Support Team Lead
MIT Computing Help Desk
Information Services and Technology
peloquin@mit.edu
617 324 6557
-----Original Message-----
From: Jonathon Weiss [mailto:jweiss@MIT.EDU] 
Sent: Thursday, January 27, 2011 1:21 PM
To: Stuart Peloquin
Cc: Jonathon Weiss; Jonathan D Reed; release-team@mit.edu
Subject: Re: 3 machines in macgregor need re-installs [help.mit.edu #1408773]
At the very least quickstation-macgregor still reports as running jaunty.  Other problems are currnetly a little difficult to differentiate from yesterday's update disaster.
	Jonathon
Stuart Peloquin <peloquin@MIT.EDU> wrote:
> Hi Jonathan,
> 
> Yes.  I believe all but Macgregor-5 should be returned to service.  I visited the other day to bring Macgregor-5 online, but PXE still doesn't work on the machine.  I will be visiting again with a hub and wireshark to figure out what's going on.
> 
> -Stuart
> 
> -----Original Message-----
> From: Jonathon Weiss [mailto:jweiss@MIT.EDU]=20
> Sent: Thursday, January 27, 2011 12:38 PM
> To: Stuart Peloquin
> Cc: Jonathon Weiss; Jonathan D Reed; release-team@mit.edu
> Subject: Re: 3 machines in macgregor need re-installs [help.mit.edu 
> #1408773]
> 
> 
> Hi Stuart,
> 
> As Reg Day approaches, do we have an update on this?
> 
> --=20
> 
> 	Jonathon
> 
> 
> 
> > Hi all,
> >=20
> > There was  a problem with DHCP in the building that the network team identified and reported as fixed late in December.  I haven't had the opportunity to visit W61 to test PXE and reinstall any machines nor have the RCCs.   Will make it a priority as student consultants return to campus.
> >=20
> > -Stuart
> >=20
> > -----Original Message-----
> > From: Jonathon Weiss [mailto:jweiss@MIT.EDU]=3D20
> > Sent: Monday, January 03, 2011 3:51 PM
> > To: Jonathan D Reed
> > Cc: Jonathon Weiss; Stuart Peloquin; release-team@mit.edu
> > Subject: Re: 3 machines in macgregor need re-installs 
> >[help.mit.edu=20  #1408773]
> >=20
> >=20
> > Any progress on this?
> >=20
> > 	Jonathon
> >=20
> >=20
> >=20
> > Jonathan Reed <jdreed@MIT.EDU> wrote:
> >=20
> > > After chatting with Stuart, we've determined there are some 
> > >network=20
> > >=3D3D=3D20  problems in MacGregor preventing the PXE installer 
> > >from=20 working. =3D20  Stuart =3D3D is going to follow up with the 
> > >RCCs to get=20 some more=3D20  debugging =3D3D information and then we'll escalate to NIST.
> > >=3D20
> > > -Jon
> > >=3D20
> > > On Dec 6, 2010, at 11:50 AM, Jonathon Weiss wrote:
> > >=3D20
> > > >=3D3D20
> > > > Thomas,
> > > >=3D3D20
> > > > Unfortunately, running the previous version of ubuntu can no=20
> > > >longer=3D20 be  considered "running fine" as it has been 
> > > >desupported=20 by the=3D20 vendor, =3D3D
> > > and
> > > > all cluster workstations should have automatically updated to 
> > > >the=20
> > > >=3D20 current version.  That said, I believe we are waiting 
> > > >from=20 someone on =3D20 release-team to investigate the issue of 
> > > >why you are=20 unable to =3D20 re-install these machines, and 
> > > >there isn't anything=20 to be done until =3D20 that happens.
> > > >=3D3D20
> > > > Thanks for keeping this issue active,
> > > >=3D3D20
> > > > 	Jonathon
> > > >=3D3D20
> > > > 	Jonathon Weiss <jweiss@mit.edu>
> > > > 	MIT/IS&T/OIS  Server Operations
> > > >=3D3D20
> > > >=3D3D20
> > > >> Hi Jonatahon,
> > > >>=3D3D20
> > > >> Here is the lateast.
> > > >> There are two machines;MacGregor-5 and 
> > > >>MacGregor-Quickstation=20
> > > >>that=3D20 =3D3D
> > > are running fine with the previous Athena version of ubuntu.
> > > >> The MacGregor-2 is still getting the error message I 
> > > >> mentioned=20 =3D3D
> > > earlier.
> > > >>=3D3D20
> > > >> Best,
> > > >> Thomas
> > > >> Thomas J. Smith
> > > >> IT Deployment & Maintenance Services =20
> > > >>Installation/Spaces/Athena=3D20 Clusters/IT DMS 
> > > >>(acis-team@mit.edu) =20 Massachusetts Institute of=3D20 
> > > >>Technology
> > > >> Tel: 617-253-7433
> > > >> Email: tjsmith@mit.edu
> > > >>=3D3D20
> > > >>=3D3D20
> > > >>=3D3D20
> > > >>=3D3D20
> > > >>=3D3D20
> > > >> On Mon Nov 22 15:36:55 2010, tjsmith wrote:
> > > >>> Hi Jonathon,
> > > >>>=3D3D20
> > > >>> We tried all three machines.
> > > >>>=3D3D20
> > > >>> When using the reinstall cd we have we got the following 
> > > >>>error=20
> > > >>>=3D20 message.
> > > >>>=3D3D20
> > > >>> Loading gpxe.krn....ready
> > > >>>=3D3D20
> > > >>>=3D3D20
> > > >>> Without the reinstall cd,and choosing the onboard network=20
> > > >>>devices=3D20 we  got this.
> > > >>>=3D3D20
> > > >>> PXE-MOF:Exiting broadcom PXE ROM
> > > >>>=3D3D20
> > > >>> Best,
> > > >>> Thomas J. Smith
> > > >>> IT Deployment & Maintenance Services =20
> > > >>>Installation/Spaces/Athena=3D20 Clusters/IT DMS 
> > > >>>(acis-team@mit.edu) =20 Massachusetts Institute of=3D20 
> > > >>>Technology
> > > >>> Tel: 617-253-7433
> > > >>> Email: tjsmith@mit.edu
> > > >>>=3D3D20
> > > >>>=3D3D20
> > > >>>=3D3D20
> > > >>>=3D3D20
> > > >>>=3D3D20
> > > >>>=3D3D20
> > > >>> On Fri Nov 19 16:05:05 2010, jweiss wrote:
> > > >>>>=3D3D20
> > > >>>> Please re-install the following 3 machines in macgregor.  
> > > >>>>I=20 know
> > > >>> there
> > > >>>> have been problems with installs here in the past.  If 
> > > >>>> there=20 are
> > > >>> still
> > > >>>> please send a detailed copy of the errors (including a 
> > > >>>> photo=20
> > > >>>> of=3D20 the screen if helpful) to release-team.  Please note 
> > > >>>> that=20 these =3D3D
> > > machines
> > > >>>> do need re-installs, even if they currently look like they're
> > > >>> working.
> > > >>>>=3D3D20
> > > >>>> macgregor-2
> > > >>>> macgregor-5
> > > >>>> quickstation-macgregor
> > > >>>>=3D3D20
> > > >>>>=3D3D20
> > > >>>> 	Jonathon
> > > >>>=3D3D20
> > > >>>=3D3D20
> > > >>=3D3D20
> > > >>=3D3D20
> > > >>=3D3D20
> > >=3D20