***: gcw|mbpro has joined #arpnetworks
Webhostbudd has quit IRC (Quit: Leaving)
Ehtyar has quit IRC (Quit: Going!)
`ariel has quit IRC (Ping timeout: 245 seconds)
ariel has joined #arpnetworks
RandalSchwartz has joined #arpnetworks RandalSchwartz: My disk (on kvr05) is acting horribly slow.
is there a system status update that might be relevant?
up_the_irons? ***: himuraken has joined #arpnetworks
RandalSchwartz has quit IRC (Ping timeout: 246 seconds)
ryk has joined #arpnetworks
ryk has quit IRC (Changing host)
ryk has joined #arpnetworks
Webhostbudd has joined #arpnetworks
RandalSchwartz has joined #arpnetworks
RandalSchwartz has quit IRC (Changing host)
RandalSchwartz has joined #arpnetworks
k3asd` has quit IRC (Ping timeout: 246 seconds)
k3asd` has joined #arpnetworks
sako has joined #arpnetworks
hive-mind has quit IRC (Remote host closed the connection)
hive-mind has joined #arpnetworks
k3asd` has quit IRC (Ping timeout: 246 seconds)
k3asd` has joined #arpnetworks
k3asd has joined #arpnetworks
k3asd` has quit IRC (Ping timeout: 248 seconds)
nestea has quit IRC (Ping timeout: 260 seconds)
nestea has joined #arpnetworks
RandalSchwartz has quit IRC (Ping timeout: 246 seconds)
mtve has quit IRC (Ping timeout: 244 seconds)
mtve has joined #arpnetworks
RandalSchwartz has joined #arpnetworks
RandalSchwartz has quit IRC (Changing host)
RandalSchwartz has joined #arpnetworks RandalSchwartz: up_the_irons - any idea about my slow disk?
been crappy all morning
do I have a noisy neighbor again?
DAMN... this needs fixing.
1048576 bytes transferred in 22.494234 secs (46615 bytes/sec)
it took 22 seconds to write A MEGABYTE
up_the_irons ding ding ding up_the_irons: RandalSchwartz: yeah, i've been having intermittent problems with the host kvr05 RandalSchwartz: can you migrate me to a different box then?
it's interfering with business and tasks. jdoe: mine's fairly slow too.
52428800 bytes (52 MB) copied, 71.7589 s, 731 kB/s
:/ up_the_irons: RandalSchwartz: the first priority is to fix the issue with kvr05; i think it is a noisy neighbor, as you said ***: RandalSchwartz has quit IRC (Ping timeout: 246 seconds) up_the_irons: jdoe: are you able to reach your vps at all? (i assume perhaps so, b/c you could get those test results) jdoe: up_the_irons: yeah. up_the_irons: jdoe: and it is on kvr05? jdoe: not sure offhand.
how would I tell? up_the_irons: jdoe: it's the same as your vnc host (listed in portal under vm details) jdoe: you can tell how often I log in ;)
sec. up_the_irons: :) ariel: mm i'm on KVR05 and is sloww, my monitoring software is detecting the VM as down/up/down/up up_the_irons: i can't ssh into kvr05. serial console is responsive, however, it also won't let me in past me typing in my login name. i think the disk is completely locked up CaZe: kvr10 is doing fine. :D up_the_irons: ariel: although it is interesting that you can still get in, even though i can't get into the host. jdoe: up_the_irons: lol... I have at least two portal accounts, neither showing my vps.
nope, I'm on kvr06 up_the_irons: jdoe: roger jdoe: seems better now, 6-ish M/s. up_the_irons: jdoe: i'm not touching kvr06 ;)
but 6M/s is still pretty slow jdoe: I know, I'm just saying whatever it was, it's improved slightly. And no argument ;) up_the_irons: kvr05, i'm in!
jdoe: roger :)
now to find out who is hogging the disk jdoe: prolly that schwartz guy. -: jdoe NODS SAGELY. up_the_irons: randal actually _does_ use quite a bit of I/O, but not enough to take down the box.. ;) jdoe: I'm surprised any guest can make it *that* unresponsive. toddf: maybe someone is swapping? jpalmer: up_the_irons: sorry for the delay on centos. I had to board up house and horses for the storm. I'll be unboarding and such tonight and tomorrow.. then can get back to work on it. up_the_irons: jdoe: well, kvm/qemu doesn't really have too much in the way of I/O isolation. Like, each VM is just a Linux process and if it wants to really load up the disk, the scheduler isn't gonna stop it
jpalmer: oh problem at all, take your time. I'm actually going to make the 5.8 version :) twobithacker: I saw a presentation from Verio a few years back where they had talked about all the custom work they had done to give a level of control over disk I/O, memory and CPU usage to their VPS product
they claimed they were going to release that back to the FreeBSD project, but never did :/ jdoe: up_the_irons: linux does kinda suck for that. up_the_irons: I have come up with a new way of dealing with disk provisioning for Linux guests that will afford me a *much* easier time supporting more distros; therefore, I'm pumped to make CentOS 5.8 templates, then Ubuntu 12.04 after that, then who knows... Fedora, Gentoo, Arch, ...? you name it jdoe: LFS PLZ. up_the_irons: twobithacker: jdoe yeah, i've seen several proprietary solutions talked about for that stuff, but nothing out in the open (but i admit, i haven't looked into it in a while) toddf: as the list grows longer so does the amount of effort to maintain such a list. I know it won't effect our bottom line but still... ;-)
<sarcasm> doesn't ionice work? </sarcasm> jdoe: up_the_irons: yeah I dunno, it doesn't seem like a problem anyone is anxious to solve, despite the work that's gone into cfq and deadline. I dunno why. I can load the shit out of the disk on fbsd, and the machine is still responsive.
try that on linux and it craters. up_the_irons: toddf: well, that's the thing, up until now the effort to maintain such a list has been prohibitive; but, with my new disk provisioning strategy, no longer :) toddf: I'm not in on all the details, but with a new disk provisioning strategy, doesn't that still mean someone has to manually test install a distribution for it to work? up_the_irons: jdoe: i think it is a problem that people have accepted; kinda like DoS attacks, they suck but what can ya do? ;)
toddf: yes, but only once, then the template is copied for every VM down the line. new distro versions don't come out _that_ often; openbsd is actually the most frequent publisher, where I find myself having to update my templates every 6 months ;) toddf: up_the_irons: request multiple vps's across different kvr* hosts like randalschwartz, and make it a cluster that will lessen the effect of one kvr* with temporary io starvation ;_) jdoe: eh, ubuntu is on a 6 month cycle too isn't it? up_the_irons: toddf: wait wut?;)
jdoe: is it? i haven't even noticed... for ubuntu i tend to do only the LTS versions
if you want non-LTS, a ubuntu fan can just get an LTS and dist-upgrade, very simple process with like 2 commands toddf: up_the_irons: my 'request multiple vps' thing was in rsponse to your 'what can ya do?' and my response involves more business for you *grin* up_the_irons: toddf: :) jdoe: up_the_irons: if you're tracking every version, then yeah it's every 6 months, most of the time.
XX.04 and XX.10 up_the_irons: roger
time for kvr05 to admit failure and reboot
up 968 days, 13:33, 5 users, load average: 11.82, 12.28, 20.87
of *course* i can't reach 1000, ever...
the load was getting better, but now it is back to worse ***: beandog has joined #arpnetworks
beandog has quit IRC (Quit: Leaving) plett: up_the_irons: re your comments earlier about protecting one guest from another on KVM, cgroups might be what you're after jdoe: didn't know those worked for io... wonder how well they work. andol: jdoe: Well, if we all start doing really heavy i/o usage, perhaps up_the_irons will find out for us? :-) ***: Webhostbudd_ has joined #arpnetworks
Webhostbudd has quit IRC (Read error: Operation timed out)
heavysixer has joined #arpnetworks
ChanServ sets mode: +o heavysixer
jbum has joined #arpnetworks up_the_irons: plett: i'll check it out, tnx ***: k3asd has quit IRC (Ping timeout: 265 seconds)
RandalSchwartz has joined #arpnetworks
RandalSchwartz has quit IRC (Changing host)
RandalSchwartz has joined #arpnetworks RandalSchwartz: up_the_irons - there's no PST until november. :(
"at approximately 08/27/2012 14:30 PST" - not possible ***: arenlor has joined #arpnetworks RandalSchwartz: but at least my system is working again. :) arenlor: Do we know the cause?
Also, my VPS seems to be working fine. RandalSchwartz: it was just kvr05 acting up
900+ uptime days arenlor: RandalSchwartz: I'm on kvr05 ***: Ehtyar has joined #arpnetworks jdoe: arenlor: note past tense arenlor: jdoe: So the answer is no that we don't know what the cause was. ***: henderb_ has quit IRC (Quit: Changing server)
henderb has joined #arpnetworks up_the_irons: RandalSchwartz: bah, i always end up screwing the date in some way
RandalSchwartz: so, no further issues now? (i don't see any)
arenlor: cause is unknown, but i simply suspect after 960+ days of uptime, something just got kinked. I've analyzed enough logs and i'm writing the resolution as "Power cycling the server fixed the high I/O wait issue" and leaving it at that. I'll take 960+ days of uptime and move on... :) ***: sako has quit IRC (Ping timeout: 268 seconds) jdoe: up_the_irons: quitters never win and winners never quit... milki: i usually win more sleep if i quit early
:o RandalSchwartz: yeah - seems fine now ***: arenlor has quit IRC (Read error: Connection reset by peer)
arenlor has joined #arpnetworks up_the_irons: RandalSchwartz: cool ***: sako has joined #arpnetworks up_the_irons: sako: so looks like you didn't make it to la dev ops? ;) -: up_the_irons wanders off sako: stop spying on me
lol ***: jbum has quit IRC (Quit: jbum) pjs: up_the_irons, turns out we have a few friends in common, sako being one
and Lars ***: sako has quit IRC (Ping timeout: 244 seconds)
sako has joined #arpnetworks
himuraken has quit IRC (Ping timeout: 246 seconds)
himuraken has joined #arpnetworks
sako has quit IRC (Ping timeout: 260 seconds) up_the_irons: pjs: you know sako and lars? hah, nice
pjs: do u go to LADevOps at all?
perhaps pjs is there now... -: up_the_irons looks around up_the_irons: we had 75 nicks in here earlier, a record! ***: sako has joined #arpnetworks -: up_the_irons spies on sako ***: Webhostbudd has joined #arpnetworks
Webhostbudd_ has quit IRC (Ping timeout: 250 seconds) Webhostbudd: up_the_irons: still do =) up_the_irons: Webhostbudd: still do wut? :)
oh, 75 nicks! CaZe: Hmm.
How many VPSes? arenlor: CaZe: I have two, what about you? CaZe: Infinity. arenlor: Nice trick, gave an interger overflow on the billing platform I'm sure. ***: Webhostbudd has quit IRC (Ping timeout: 250 seconds)
Webhostbudd has joined #arpnetworks
Webhostbudd_ has joined #arpnetworks
Webhostbudd has quit IRC (Ping timeout: 250 seconds)
Aerosonic has joined #arpnetworks up_the_irons: holy crap, 76 nicks! Aerosonic: up_the_irons: Yeah, so I decided to check out ARP Networks. up_the_irons: Aerosonic: ah, your nick looked new
:)
although it sounds oddly familiar... Aerosonic: I used to be on Linode?
Maybe you know me from their old support chan ***: sako has quit IRC (Ping timeout: 256 seconds) up_the_irons: Aerosonic: hmm.. i was never on their old support chan; i used to be in the slicehost one... oh well, doesn't matter )
:) ***: sorressean has quit IRC (Read error: Operation timed out)
sorressean has joined #arpnetworks