14:01:46 #startmeeting 14:01:46 Meeting started Thu Nov 17 14:01:46 2011 UTC. The chair is mburns. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:01:46 Useful Commands: #action #agreed #help #info #idea #link #topic. 14:02:20 ok, so who's here for the node meeting? 14:02:27 here 14:03:08 here 14:03:08 going to be a quiet meeting... 14:03:12 here 14:03:25 #info attendees: mburns mestery jboggs pmyers 14:03:41 ok, quick agenda 14:03:51 1. plugins design 14:03:55 2. stateless design 14:04:02 3. bug review 14:04:06 any other topics? 14:04:15 4. release schedule for sub-project and nomination of release manager 14:04:32 ack 14:05:11 5. questions and/or issues 14:05:24 #topic Plugins 14:05:47 #info http://ovirt.org/wiki/Node_plugins 14:05:56 has everyone read and is familiar with this page? 14:06:05 if not, take a sec to quickly read it :) 14:06:11 * mburns rereads to refresh memory 14:06:52 ok question 14:06:58 "Third party plugins need to be upgradable." 14:07:07 if the intent is to allow injection of plugins into an offline ISO 14:07:14 what does it mean for allowing plugins to be upgradeable? 14:07:35 and then follow up on that 14:07:37 "Third party plugins should be able to hot-add (e.g. not require reboot of the node)." 14:07:41 pmyers: Per our discussion yesterday, if we want to focus on stateless, than upgradeable means simply creating a new PXE image with the new 3rd party plugin 14:07:47 ack 14:07:50 stateful node installed with plugin v1 14:08:05 then new image with plugin v2 should not break stateful node 14:08:07 Is the plan to focus primarly on stateless? 14:08:08 that's my understanding 14:08:12 mburns: ack, let' 14:08:25 let's make that clearer by outlining use stories for how this function will be used 14:08:32 for both stateful (to start with) and eventually stateless nodes 14:08:36 any volunteers to update the wiki? 14:09:03 i'll see if I can make some things clearer 14:09:17 mestery: plan is to focus on getting to stateless 14:09:29 but right now we're not there, so initial plugin integration may be on stateful node 14:09:38 pmyers: OK, understood. 14:09:39 #action pmyers update wiki with use cases for how upgrades will work 14:09:40 however, we can treat the plugins themselves as nominally stateless 14:10:13 Regarding hot-add: I was thinking adding a plugin to a running node should not require a reboot. Makes sense? 14:10:15 mestery: I think this will get us equivalent functionality with what ESXi provides, right? 14:10:39 pmyers: Yes, this sounds very similar to ESXi. 14:10:57 #info hot-add would give us equivalent to ESXi functionality 14:11:13 wait 14:11:23 hot-add sounds like a v2 feature to me 14:11:29 (hot-added modules needs to reside in the ram, right?) 14:11:30 i thought esxi only allowed adding oplugins to off line images 14:11:44 i'm not sure that we SHOULD allow RUNNING nodes to have plugins added 14:11:53 No, you can add them to running images as well, although you have to put hte host in maintenance mode (no VMs running) 14:12:05 ok, that's something we may consider for the future 14:12:14 Yes, sounds good. 14:12:20 but the technical challenges associated with dealing with a stateless and read only root environment 14:12:23 may make that challenging 14:12:38 Does the node have the concept of a mode where no VMs can be run on it? (e.g. maintenance mode)? 14:12:40 are you sure it's ESXi and not plain ESX that allows this? 14:12:40 ok, so move hot-add to future feature? 14:12:54 mestery: not directly, but through the engine, yes 14:12:57 mestery: that's not defined by RHEVH itself, maint mode is defined by the mgmt system 14:13:01 i.e. oVirt Engine 14:13:12 pmyers: Yes, ESXi allows this, as well as ESX. 14:13:20 however, we could easily include logic that says only allow plugin installation if no VMs are running 14:13:34 the host need not know about maint mode, it just needs to know whether or not VMs are running 14:13:43 (two steps: overlayfs for the module for hotadding, and changed images for new nodes) 14:13:46 #info probably need to *maintenance mode* the node before hot-add 14:14:20 the challenge is dealing with service start and dynamic firewall rule changes during runtime 14:14:31 this is particularly challenging on a livecd based image 14:14:43 pmyers: it's doable though 14:14:51 pmyers: Yes, we would need APIs for 3rd party plugins to alter those during load, and start themselves during load. 14:14:52 yes, w/ hacks probably 14:14:53 and i have some ideas for how 14:14:57 ok 14:15:08 pmyers: not hacks really 14:15:11 k 14:15:18 but it goes into how we do dynamic firewalls and services 14:15:22 ok 14:15:57 but i really think hot-add should be future feature, rather than trying to solve it in first pass 14:16:03 agreed? 14:16:08 +1 14:16:15 we have a bz for 3rd party plugins 14:16:27 mburns: can you file new bug for separate rfe for hotplug 14:16:32 that way we'll keep these separate 14:16:41 #agreed hot-add will be future feature 14:16:52 #action mburns file rfe bz for hot-add 14:17:26 is there anyone who want to volunteer for ownership of this feature? 14:17:42 or at least to take first pass at designing? 14:17:53 heh, I would if I could go back to coding :( 14:18:02 managers-- 14:18:19 mburns, might as well be me 14:18:25 jboggs++ 14:18:25 jboggs: excellent, thanks 14:18:37 #action jboggs will come up with design for plugins 14:18:49 jboggs: basically continue adding to this wiki 14:18:59 w/ more concrete details and technical bits 14:19:04 and we can review on list 14:19:10 ack 14:19:20 ok we should go to next topic 14:19:21 any other plugin ideas? 14:19:26 lest meeting go for too long :) 14:19:26 3.... 14:19:30 2.... 14:19:33 1... 14:19:34 One thing 14:19:40 ooh, just under the wire 14:19:45 Persistent storage (e.g. plugin configuration data) 14:19:48 I've got a question, if I may 14:20:07 I imagine some plugins will need that, and whether it comes down from the mgmt host, or resides on the host, it would be good to have that capability 14:20:17 This would be for plugin configuration data 14:20:28 mestery: for config, yes, i agree 14:20:36 mestery: plugins will need to be aware of and utilize the persist functionality that we provide 14:20:42 so that they can persist their own config files 14:20:43 and that would be part of a stateless config bundle 14:20:52 yeah, what pmyers said 14:20:55 right and that persist cmd would also add those config files to the bundle 14:20:57 pmyers: mburns: Great! A way to store /etc files would be sufficient for the most part. 14:20:59 that would be sent to config server 14:21:03 no 14:21:06 not /etc 14:21:11 we need to not allow random edits to /etc 14:21:13 The equivalent maybe? 14:21:14 it's too complicated 14:21:15 yes 14:21:19 Yes, makes sense. cool 14:21:22 i.e. /opt/vendor/plugin/etc 14:21:26 Perfect! 14:21:28 :) 14:21:35 pmyers: we can allow (and may require) some stuff in /etc 14:21:37 the only changes that we'll make to /etc 14:21:40 but that's implementation detail 14:21:42 will be through controlled metadata 14:21:43 like 14:21:45 is there some field in the engine where you can see which version of the node is running on the host? If we're talking about plugins then it might be necessary to see somewhere which node build is running, so you know which node has to be restarted, to load the recent image 14:21:46 1. services that need to start 14:21:51 2. firewall ports opened 14:21:56 3. users/groups need to be added 14:22:01 4. selinux policy additions 14:22:11 xTs_w: good idea 14:22:16 we need a way to query the node 14:22:20 to get a list of installed plugins 14:22:21 and their versions 14:22:23 and state 14:22:27 xTs_w: that what you're after? 14:22:28 xTs_w: i don't know about the engine webadmin right now 14:22:28 ? 14:22:39 mburns: we just need to provide the data for vdsm to gather 14:22:42 but rhevm has an about box that has versions 14:22:44 #agreed With pmyers design for plugin config data not in /etc, and APIs for things in /etc 14:22:49 pmyers: sounds even better than my idea 14:22:53 :) 14:23:05 mburns: as long as we provide interface for vdsm to get at this data in a controlled way 14:23:16 pmyers: ack 14:23:21 it's then oVirt Engin and vdsm team's responsibility for how they visualize to users 14:23:48 could be as simple as /etc/plugin-registry file 14:23:53 that lists all plugins, vendors, versions 14:24:01 #info need to work with vdsm/engine teams to provide info on plugins, vendors, versions, etc 14:24:34 #info need RFE on vdsm/engine to publish those details in the UI 14:24:40 ok, let's move on 14:24:46 #topic stateless 14:24:55 #link http://ovirt.org/wiki/Node_Stateless 14:25:27 oh sorry one more thing 14:25:37 * mburns pauses 14:25:59 #info jboggs to check with RHEL baseos team on notion of 'stacks' since stacks will put things into /opt organized by vendor and we should probably consolidate around a single scheme for directory organization if we can 14:26:14 ack 14:26:30 #action jboggs to check with RHEL baseos team on notion of 'stacks' since stacks will put things into /opt organized by vendor and we should probably consolidate around a single scheme for directory organization if we can 14:26:43 jboggs: check with ddumas she'll have the right info/ppl to get details from 14:26:45 ok, Stateless... 14:27:23 any comments questions concerns with stateless? 14:27:27 definitely want to separate out TPM usage as a v2 add on to this feature 14:27:41 we can start with sneakernet delivered usb thumbdrives for the paranoid 14:27:53 the thumbdrive can contain the key or cert that we want to use 14:27:58 #action mburns move TPM to v2 14:28:35 i think the first thing we need to investigate and decide is 'what does the protocol look like between server and client' 14:28:41 pmyers: first should be key embedded in the image 14:28:49 mburns: that's for non paranoid case 14:28:51 but sure 14:28:56 so different levels 14:29:02 1. no key, no encryptio 14:29:02 and then we can expand functionality to allow for overriding 14:29:07 2. key embedded in ISO 14:29:11 3. key on thumbdrive 14:29:16 4. key in TPM 14:29:23 pmyers: strike #1 IMO 14:29:31 we can build with a default key in the image 14:29:36 that's hardcoded in src 14:29:38 that's worse than no key 14:29:46 never include pregenerated keys or passwords 14:29:49 it's a rule pretty much :) 14:29:57 it lures you into a false sense of security 14:30:04 better to be open and just not encrypt at all 14:30:05 ok 14:30:14 * mestery agrees with pmyers on not embedding keys or passwords. 14:30:15 simpler flow if there is a default key 14:30:27 yeah, but it's just a nono 14:30:28 but not too much more complex without 14:30:31 yep 14:30:47 basically if no key, then assume bundle is not encrypted 14:30:54 and skip the decrypt step 14:30:54 ok 14:31:10 so are we ok with 14:31:25 a bundle really just being an augeas script? 14:31:33 and then we get that augeas script and apply to the running node? 14:31:45 do we need things that can't be contained in an augeas script? 14:31:57 if so do we need to make the bundle a tarball that contains augeas scripts and 'something else' ? 14:32:04 pmyers: not sure we can say that at the moment 14:32:11 ok smth to mull over then 14:32:22 i think we should plan on augeas+something 14:32:29 ok 14:32:33 expandible 14:32:34 i like it 14:32:36 but minimize the something 14:32:46 so initial would be tarball containing augeas script(s) 14:32:50 for things that we can't do with augeas 14:32:51 the tarball is encrypted 14:33:13 and file rfes on augeas to be able to do what we need 14:33:18 ack 14:33:34 mestery: jboggs: ack? 14:33:39 ack 14:33:40 ack 14:33:52 ok transport/protocol 14:34:02 #agreed encrypted tarball with primarily augeas scripts 14:34:18 #agreed but support for "something more" 14:34:19 idea is that we define a protocol that other people can implement, but we create reference implmentation 14:34:29 that can be used for testing and small deployments 14:34:44 would matahari fit in here somehow still? 14:34:47 protocol should be very simple, basically get and put 14:34:53 jboggs: it could 14:35:08 that may make sense, we have other reasons for putting matahari on the node as well 14:35:33 but for people w/o broker setups 14:35:35 perhaps we need a fallback 14:35:42 that uses smth simple like a basic webserver 14:35:46 #idea matahari may make sense for transport protocol 14:35:47 someone mentioned webdav 14:35:52 that may be a good fallback 14:36:01 <--- ;) 14:36:03 so if matahari and bus/brokers present, use that, if not use webdav 14:36:12 #idea webdav as a fallback 14:36:56 ok what else? 14:37:03 pmyers: should this ref impl be part of ovirt-node? 14:37:07 yes 14:37:08 or separate git? 14:37:20 well maybe subpackage 14:37:24 ok, then as a ovirt-node-config-server 14:37:27 sub package 14:37:29 yes 14:37:33 (or similar) 14:37:39 we can fight about naming later 14:37:45 so that could be the QMF Console project 14:37:48 o-n-config-server 14:38:04 #info package as subpackage of ovirt-node 14:38:05 the webdav fallback would basically just be a 'here's how you configure your webdav server to allow this to work' 14:38:21 xTs_w: agree? ^^ 14:38:25 #info subpackage would be QMF Console 14:38:41 * mestery has to step out now for a bit. 14:38:45 #info webdav is fallback 14:38:51 pmyers: exactly :) 14:38:58 #info webdav is just documentation 14:39:14 mestery: thanks! 14:39:15 well documentation and then coding inside of ovirt-node to support that mechanism 14:39:22 future meetings should go shorter 14:39:24 as a client of the webdav server 14:39:54 ok next topic? 14:40:04 pmyers: i'm picturing ovirt-node-config-server being installed on another machine 14:40:10 ack 14:40:14 it would have to be 14:40:28 code to handle in ovirt-node would handle QMF first, then webdav 14:40:43 ok, let's skip bug scrub for now 14:40:47 well, it could depend on kernel cmdline args 14:40:55 you might override to say never use matahari only use webdav 14:41:04 pmyers: yes, but that's implementation detail 14:41:09 ack 14:41:09 ;-) 14:41:18 ok, next topic 14:41:26 +1 on skipping bug scrub 14:41:29 #topic Release Manager for ovirt-node 14:41:33 imo, you guys can just do that offline 14:41:36 close em at will :) 14:41:41 ack 14:41:47 +1 for mburns as release mgr 14:41:54 * mburns volunteers 14:42:48 apevec jboggs: ? 14:42:51 ack 14:43:06 #agreed mburns is ovirt-node release mgr 14:43:12 any objection from other community members? 14:43:35 as an aside, ovirt node team would love to get some new folks to help be maintainers 14:43:52 all that is required is to start submitting patches and show that you've got a good understanding of the code and concepts :) 14:43:52 pmyers: makes sense for me to take it for now at least 14:43:55 mburns: ack 14:44:04 just trying to solicit help :) 14:44:13 if someone is interested, i'm open to working with them to take over 14:44:27 at least as a backup 14:44:44 ok, moving on 14:44:45 #topic Release Schedule 14:44:56 i'm open to suggestions here :) 14:45:11 i think roughly monthly releases for now sounds good 14:45:21 same here 14:45:35 with flexibility to move to oVirt project release schedule 14:45:41 when that eventually gets decided 14:45:50 well, not move to, but synchronize with 14:46:07 +1 14:46:12 i.e. if oVirt releases every 3 months on the 15th, we'll make our monthly releases on the 15th 14:46:20 so we first need to do the official f16 based release 14:46:24 what will our date be for that 14:46:37 do we want it working with engine? 14:46:58 if yes, then we're blocked by a vdsm bug 14:47:28 i think EOM is a good target in general 14:47:33 yes that's a prereq 14:47:44 abaron_wfh: any movement on that vdsm bug? 14:47:56 it's last thing blocking us getting a working oVirt Node published 14:47:56 actually, lets say 15th of the month for releases 14:48:06 but f16 will be ASAP 14:48:11 +1 14:48:26 pmyers: i'm working with dougsland on vdsm bug 14:48:49 ok, root cause yet? 14:49:11 i'm pretty sure i know the problem, but i don't know enough about engine and vdsm internals to fix it 14:49:20 #agreed 15th of each month for releases 14:49:38 #agreed synchronize day of month with oVirt project 14:49:45 #agreed monthly releases 14:49:49 ok, dougsland is aware that this is high prio and is blocking upstream initial release? 14:49:55 pmyers: yes 14:50:02 k 14:50:23 pmyers: i'm completely at his disposal at this point for making sure he has ovirt-node builds 14:50:29 ok 14:50:40 mburns: do you need engine help? 14:50:50 mburns: let's make sure we are periodically updating the relevant bug with state so we know where this is at on a day to day basis 14:50:59 jumper45: possibly, but not sure yet 14:51:07 first step is to figure out the vdsm part 14:51:13 cctrieloff is particularly interested in getting this resolved so we can do our initial launch of release #1 14:51:17 mburns: let me know if you need anything 14:51:18 jumper45: i'll ping after the meeting 14:51:28 mburns: ok probably next topic 14:51:41 #action mburns to work with jumper45 and dougsland on blocking registration issue 14:51:51 #topic Questions and Comments 14:52:00 ok, open season now 14:52:06 nothing from me :) 14:52:14 * mburns ducks and covers 14:52:25 ok, going once.... 14:52:28 twice.... 14:52:33 gone... 14:52:42 #agreed bug scrub can be done offline 14:52:48 Thanks all 14:52:50 pmyers: agreed. 14:53:03 #endmeeting