14:02:16 #startmeeting oVirt Infra 14:02:16 Meeting started Mon Oct 21 14:02:16 2013 UTC. The chair is knesenko. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:02:16 Useful Commands: #action #agreed #help #info #idea #link #topic. 14:02:27 #chair obasan orc_orc dcaro_ eedri_ 14:02:27 Current chairs: dcaro_ eedri_ knesenko obasan orc_orc 14:02:34 o7 14:03:14 #chair Rydekull 14:03:14 Current chairs: Rydekull dcaro_ eedri_ knesenko obasan orc_orc 14:03:27 #topic Hosting 14:03:27 http://www.ovirt.org/Infrastructure_team_meetings should be updated more often btw 14:03:43 ewoud: agree 14:03:51 so lets start 14:03:55 small update 14:04:03 regarding reackspace servers 14:04:19 Itamar replied to my migration email ... did you saw it ? 14:04:35 there are some issues with gluster sotrage DCs ... migration libvirt etc. 14:05:15 are there :/ 14:05:15 how much of a blocker is it going to be for us? 14:05:17 * clarkee avoids it 14:05:23 so currently we can run slaves without a migration feature , right ? 14:05:30 no migration 14:05:33 +- 14:05:39 for now 14:06:00 it's only live migration, which we don't have now either 14:06:00 but you will be able to poweroff a VM and restart it on another host 14:06:05 ugly, but will work 14:06:11 right 14:06:16 and snapshots as well 14:06:19 as I have not attempted a migration, I do not hold an opinion 14:06:25 we don't use snapshots for jenkins slaves 14:07:00 that's all I think 14:07:11 so what's the planning now? 14:07:21 in terms of time 14:07:43 somehow rackspace03 was installed in a different LAN 14:08:02 so I am waiting for Red Hat guys to approve moving rackspace03 to 01,02 network 14:08:10 so it will work faster for us 14:08:24 after that I will continue on a migration 14:08:45 ok 14:09:02 ok 14:09:15 other issues on hosting ? 14:10:04 #chair dcaro 14:10:04 Current chairs: Rydekull dcaro dcaro_ eedri_ knesenko obasan orc_orc 14:10:10 don't think so 14:10:13 knesenko: perhaps slow pings to jenkins that I mentioned last Friday? 14:10:45 dcaro: where do we run our jenkins ? 14:10:51 orc_orc: I haven't caught that, but I should mention that our (very limited) icinga instance also runs at alterway so that might explain it 14:10:56 dcaro: its a vm or phys. host ? 14:11:13 jenkins is a physical machine now 14:11:14 ewoud: http://gallery.herrold.com/nagios-jenkins.ovirt.org-slow.png 14:11:27 my nagios consistently is taking over 100 mSec 14:11:39 orc_orc: is that all the time or just once? 14:11:45 knesenko: I uploaded a first version of the patch for the net cleanup 14:11:49 ewoud: http://gallery.herrold.com/nagios-jenkins.ovirt.org-slow.png 14:11:53 i'll test it later 14:11:53 pretty regularly 14:11:57 apuimedo: put on review please 14:12:02 done 14:12:02 orc_orc: and where is your nagios physically located because I get a consistent 18 ms here 14:12:07 * eedri here 14:12:08 apuimedo: thanks ! 14:12:12 #chair eedri 14:12:12 Current chairs: Rydekull dcaro dcaro_ eedri eedri_ knesenko obasan orc_orc 14:12:21 knesenko, topic? 14:12:25 hosting 14:12:27 in a multi-homed DC used ly local govt, between Chicago and atlanta 14:12:48 orc_orc: jenkins is located in .fr so just the distance can explain that latency 14:13:09 ewoud: I will dial up the acceptible latency 14:13:24 That's quite exactly an acceptable latency for that distance 14:13:42 knesenko: did you fix the python-nose yum issue? 14:13:51 apuimedo: yes ... should be ok now 14:14:01 apuimedo: please let me know if there are some issues 14:14:06 ok guys next ? 14:14:07 apuimedo, which should ensure latest python-nose with puppet then 14:14:16 apuimedo, since it will fail on other slaves 14:14:19 eedri: latest comes from pi[p 14:14:36 knesenko, i know, i assume we can use pip upgrade --latest? 14:14:41 eedri: knesenko: well, on el6 what I usually do is to have python-nose installed from yum 14:14:43 #topic Foreman puppet 14:14:44 knesenko, with puppet exec {} or similar 14:14:46 and overwrite it with pip 14:14:55 (so I get the upstream nose version) 14:15:06 eedri: I think you can use provider => pip on package 14:15:09 I think ewoud and dcaro can help us how to upgrade nose with pip via puppet 14:15:19 probably 14:15:24 ewoud: dcaro news ? 14:15:39 the alternative would be to rebuild upstream src rpm for el6, I guess 14:16:02 knesenko: orc_orc kindly reminded us that our documentation is lacking, so I hope to fix that 14:16:32 ewoud: which documentation ? 14:16:34 knesenko: other than that we should get some testing infrastructure going because again a patch introduces a duplicate package 14:16:37 ewoud: I am working thru documentation in the wiki, so placing it there would caseu me to 'tweak' it ;) 14:16:38 knesenko: exactly ;) 14:16:45 ewoud: haha 14:16:45 cause* 14:16:47 ewoud: +1 14:17:11 btw I updated some Infra info 14:17:12 http://www.ovirt.org/Community 14:17:19 added link to the infra page 14:17:30 orc_orc: if you've written something, please ping me on IRC or per mail and I'll gladly review it 14:17:36 also added some new content here - http://www.ovirt.org/Infrastructure 14:17:41 ewoud: will do 14:18:05 ewoud: added the mapping by your request - http://www.ovirt.org/Infrastructure_oVirt_Instances 14:18:22 ewoud: need to map alterway servers as well 14:18:33 ewoud: dcaro what avout rk10 ? 14:19:05 knesenko: about the instances, I'd prefer it if we could somehow get that live from foreman eventually 14:19:45 ewoud: hm ... 14:19:46 knesenko: r10k could use a review, and a finishing touch, but mostly a review 14:19:52 sorry, I'm back 14:20:30 ewoud, +1 14:20:39 knesenko: https://foreman.ovirt.org/hosts should in theory contain all the hosts we have 14:20:40 ewoud, creating inventory from foreman 14:21:15 if we upgrade to foreman 1.3, we can link back VMs to the compute resources 14:21:36 so you get an easy way to see if it's physical or virtual 14:21:43 and if virtual, also a console + power mgmt 14:22:03 ewoud: +1 14:22:48 k 14:22:52 but placing it all in foreman, means it's less open and people like orc_orc will have a harder time than needed 14:23:05 foreman++ 14:23:24 so maybe we can limit access with read only accounts at first? 14:23:39 ohadlevy: no suprise you're a fan :) 14:23:50 I"d like to have something similar to https://apps.fedoraproject.org/ 14:23:55 ohadlevy: xd 14:24:02 I have a ticket on it ... and its 99% ready 14:24:09 I will clone any needed infra backend -- simply having a separate private puppet repo with keying is enough for me to be able to replicate the rest 14:24:31 ewoud: maybe we can use the api to generate a little html page with the data 14:24:38 dcaro: I was thinking the same thing 14:24:40 ... or puppet pulling keying fro a private git instance ... 14:24:40 orc_orc: Error: ".." is not a valid command. 14:24:46 ... or puppet pulling keying fro a private git instance ... 14:25:26 orc_orc: since not all is in puppet yet, it may not be complete 14:25:32 * ewoud will brb 14:25:50 ewoud: * nod * 14:27:07 dcaro: can you review r10k patch ? 14:27:21 knesenko, we also have other puppet related tasks on trac 14:27:22 back 14:27:39 dcaro: http://gerrit.ovirt.org/19141 that is 14:28:03 orc_orc: btw, currently we're lacking in our monitoring so any help to improve that is welcome 14:28:42 ewoud: I can expose my nagios if wanted ... presently I just ahve it emailing me 14:28:57 I run this for the LSB effort anyways 14:29:49 obasan: we have monitoring.ovirt.org right ? 14:29:51 orc_orc: could be helpful to get us started on monitoring.ovirt.org 14:29:59 knesenko, indeed. 14:30:02 knesenko: yes, but it's only monitoring a small part of our infra 14:30:02 also, in setting up the nagios, I did portmapping of the targets to see whta to wathc, and was somewhat surprised at listening ports 14:30:14 what* watch* 14:30:16 orc_orc: sync with obasan and see how can you improve it 14:30:23 knesenko: will do 14:30:25 I'd like to use puppet exported resources to build the nagios config 14:30:27 orc_orc, +1 14:30:42 #action orc_orc sync with obasan to improve monitoring.ovirt.org 14:30:56 something else on puppet foreman ? 14:31:32 I'm going to prepare a 1.3 upgrade of foreman.ovirt.org 14:31:39 ewoud: +1 14:31:49 ewoud, +1 14:31:57 but I think I'm going to upgrade $company foreman first to see how well it goes 14:32:02 ewoud, +1 14:32:05 got a bit more testing infra there 14:32:12 ewoud, we're still running 1.1 on $company :( 14:32:26 eedri: the upgrade is not hard at all 14:32:43 ewoud, we had some issues to upgrade to 1.2.1 on other teams 14:32:52 ewoud, so we're doing it carefullly (side by side) 14:33:45 I'm also preparing a blog series on $company blog on how we're managing foreman there, so when that's ready I'll send you a link as well 14:33:49 knesenko: I'll try+ 14:34:30 other that, I don't think there's anything new on puppet/foreman 14:34:42 ewoud, there is 14:34:51 ewoud, bare metal Power mgmt 14:35:22 eedri: that's possible, but I haven't used that yet 14:35:22 ewoud, as i understood, it's supported from newer 1.2.1 version via api 14:36:07 eedri: we could at least start by setting up ipmi for that 14:36:39 ewoud, yea, anyhow, we dont really need it, since most of our usage is vms 14:36:41 ewoud, eedri I think that one of the 1.3 features is a better upgrade path 14:38:01 obasan, +1 14:38:20 so anything else? 14:38:33 there was mention of running out of space on one unit 14:38:43 I run this, which RHEL has dropped long ago: ftp://ftp.owlriver.com/pub/mirror/ORC/diskcheck/ 14:38:50 which can be tuned to email alerts 14:39:25 #topic Jenkins 14:39:43 eedri: updates ? 14:40:33 knesenko, yes 14:40:42 knesenko, there are some new jobs 14:40:58 knesenko, running per patch on engine 3.2 & 3.3 - create + upgrade db 14:41:30 knesenko, we still facing issues with vdsm-python-copen conflict with vdsm-copen pkg 14:41:41 knesenko, maybe ybronhei or danken can elaborate on it 14:41:57 knesenko, afaik vdsm should not build vdsm-copen-python pkg anymore 14:42:39 knesenko: I can 14:42:58 eedri: did you uninstall vdsm-python-cpopen and install python-cpopen + vdsm? 14:43:17 knesenko: if you used the same slave for ovirt-3.3 and master you must remove vdsm-python-cpopen first (or python-cpopen if you switch from master to ovirt-3.3) 14:43:36 eedri: ^^ 14:43:39 ybronhei, the problem is that we run make rpm 14:43:48 eedri: knesenko: sorry about that, this until i'll update python-cpopen spec 14:43:50 ybronhei, and make rpm builds vdsm-python-copen and it shouldn't build it anymore 14:43:54 ybronhei: the act of removing vdsm-python-cpopen breaks another dependency 14:44:02 eedri: no.. thats not the problem 14:44:11 orc_orc: what do you mean? 14:44:22 ybronhei, or you need to update spec file to Obselete it 14:44:22 ybronhei: I posted about it last week -- looking 14:44:37 orc_orc: I recall you said it requires also remove of vdsm rpm 14:44:53 orc_orc: that's alright for now ... anyhow we have issues with the upgrade :P 14:45:17 eedri: I know, but its on python-cpopen spec file .. so it'll take a bit and it doesn't relate to the build 14:45:54 eedri: the upgrade issue strongly relates to the build, so this I want to fix first 14:45:55 ybronhei, oh, you build that pkg as well? 14:46:33 ybronhei, ok 14:46:47 ybronhei, so job should avoid installing vdsm-python-copen now 14:46:50 ybronhei, vdsm won't require it>? 14:49:06 unrelated, dcaro why do you have 3 accounts in gerrit? 14:49:19 ewoud: I do? 14:49:59 dcaro: if I try to add you as reviewer, I get 3 options, 2 @redhat.com and 1 @gmail.com 14:50:16 ewoud, he cloned himself 14:50:26 ewoud, so he can review 3 patches in parallel 14:50:30 ewoud: I have registered more than one email 14:50:32 heh 14:50:32 eedri: if it gives you more time, how do I do that? 14:50:34 xd 14:50:51 ewoud, you need a clonning machine 14:50:52 dcaro: but I can't select your @redhat.com 14:51:27 ewoud: that's strange 14:51:47 dcaro: I sent you email about that 14:53:07 knesenko: really? did not read it :/ I'll take a look 14:53:21 dcaro: k 14:53:29 anything else on jenkins ? 14:53:46 knesenko, we should verify all our jobs run on maste and 3.3 14:53:57 knesenko, so we won't miss regression like we had on 3.3.1 14:54:05 knesenko: found it 14:54:09 (the email) 14:54:16 knesenko, also, we plan to add new upgrade jobs from stable rpms to nightly rpms 14:54:25 eedri: but job will have too because it depends if you run ovirt-3.3 or master 14:54:28 knesenko: I created http://gerrit.ovirt.org/20366 to fix http://gerrit.ovirt.org/20319 14:54:45 ewoud: saw it ... dup pkg 14:54:57 ewoud: thanks 14:55:09 and http://gerrit.ovirt.org/20367 so we can automatically verify that it at least compiles 14:56:25 ewoud: +1 14:56:33 knesenko, also 14:56:40 knesenko, obasan upgraded jenkins to latest LTS 14:56:47 obasan, any issues with that? 14:56:55 obasan, any plugins were upadted? 14:58:30 eedri: seems like there are no issues 14:58:37 ok guys we are out of time 14:58:45 anything else before we finish ? 14:59:46 thank you all 14:59:51 #endmeeting