Zoom improvements requested by music schools

felipc · on Jan 15, 2021

A long time ago in Internet time, Justin Frankel (creator of Winamp) created a tool for live music collaboration that approached this problem in a very different way. Basically what it does is that it _adds_ delay for everyone in a way that synchronizes the musical measures, such that you play a measure while listening to everyone else's previous measure.

I never tried it because I'm not a musician (just a longtime fan of Justin and Winamp), but I always found the concept very interesting. Apparently it is still alive: https://www.cockos.com/ninjam/

kfarr · on Jan 15, 2021

Bingo this is the answer. You can't "speed up" the speed of light, but you can have predetermined latency with a synchronized clock that lets you do lots of great things.

Other ideas (this is not new btw), you don't get to hear the other participants but you're all on the same sync'ed clock with a metronome and trust the general output will be ok. Final mix is synced to viewers (obviously on delay to achieve sync).

Areading314 · on Jan 15, 2021

Most studio recordings are done iteratively, where you record a rough "scratch" track but then one by one, record over each part so that the final recording is the sum of everyone playing their best. This combines the feeling of playing with a group asynchronously with producing a high-quality recording. Would be cool to see tools to make this easier although the current gen of DAWs is pretty good.

gonzo41 · on Jan 15, 2021

It's worth watching some interviews with artist like Jack White to get their take on the quantization of music like this. It's not always the best way to make a track. And then when you get o concert If the artists can't bring it on stage then it's a bit of a letdown.

Gravityloss · on Jan 16, 2021

I wonder if this is why older music often sounds so much more pleasant and relaxed. Not recorded with a click track?

perfmode · on Jan 15, 2021

would love a link to such an interview if you have a particular one in mind

AndrewUnmuted · on Jan 15, 2021

Latency is the most crucial issue for audio when doing live broadcasts, especially when the peers all need to be synchronized to a rhythm or pulse.

However, it doesn't stop there. Latency is also important for voice, not just music; excess latency is also the cause of the dreaded zoom fatigue [0].

We humans seem to have brains designed for particular cadences of conversation, and products like Zoom really work to disrupt & disengage these preferences we have, leading to poor communications outcomes.

[0] https://www.wsj.com/articles/why-does-zoom-exhaust-you-scien...

unreal6 · on Jan 15, 2021

I wonder if over time, we will learn to adapt to the new cadence as we continue to socialize over platforms like zoom. My hunch is that we will, it will just take time for our minds to adapt to the new medium.

AndrewUnmuted · on Jan 16, 2021

I'm not so sure.

We'll likely fix the latency issues inherent to the way we do live broadcasting on the web today before the human race has the time needed to adapt to the new cadence.

Our vocal communication was solidified into our genetic code over millions of years of iteration. I imagine that in a decade or less, the latency issue will be fixed for most of the connected world.

yholio · on Jan 15, 2021

I'm not sure i understand: if the cello is delayed so I can feel like I play my trumpet in sync, then the cello player can't also feel he'd in sync with the trumpet, he must be ahead.

So there is some sort of hierarchy of instruments or a dumb sync track pre-recorded.

ziaddotcom · on Jan 15, 2021

I'm not a musician, but I interpreted it like this:

Lets say the delay is 15 seconds, you hear the composite, including your part delayed, and you "think, cool, I played the right thing at the right time, I'll keep going with my same assumptions" or you hear "my part was way off, but the rest sounded decent enough, better do something different."

The part where everyone is just playing terribly isn't a concern or doesn't manifest because you've got a bunch of intermediate or better musicians playing.

throwaway2245 · on Jan 15, 2021

if you've ever tried to talk against a mic with a delay of about a second, you'll know this is easier said than done.

This literally is marketed as a speech jammer.

ziaddotcom · on Jan 15, 2021

You're right, but most speakerphones still aren't even properly full duplex, much less utopian zoom software for apex cyber virtual orchestras.

The amount of times I'm tripping up on my own words because the last thing I said is blaring out a friends/family mobile speakerphone and then back into the mic is disappointing on all levels, to say the least.

travbrack · on Jan 15, 2021

I think it works only if everyone is playing the same chord/scale. Kinda like someone playing with a loop pedal. You're right that it definitely wouldn't work with a piece of music.

steerablesafe · on Jan 15, 2021

It probably works for practicing a few measures on repeat.

gvido · on Jan 15, 2021

I've spent most of 2020 building something similar. Now I just wish I had a landing page ready so I can plug it here instead of over engineering the app itself, but oh well :)

Anyway, what I'm building is meant more for the repetitive kinds of electronic music, but I solved the problem by just making it work like a shared loop pedal that records up to 16 measures of audio.

Everyone works asynchronously and can add or remove audio on their own time, but loops get synced to other players as they get recorded.

Best part? You can do cool stuff with browsers nowadays (even record uncompressed audio). So it just needs a web browser.

weinzierl · on Jan 15, 2021

No comment that mentions Justin Frankel and Winamp is complete without mentioning Reaper. Software written with true passion - that's what Winamp was and that's what Reaper is today.

GrumpyNl · on Jan 15, 2021

We tackled this kind of problem with voice systems on the server side, record all streams simultaneously, sync and merge them and stream the recording. Downside, there will be a delay, but it is a fixed delay and its determined by the worst connection. ( this was around 1995 )

TonyTrapp · on Jan 15, 2021

Ninjam is so much fun - and it just works!

greensoup · on Jan 15, 2021

Do you still use it?

What servers are you on? I remember hours, days of playing on it, 10 years ago :) But didnt know it was still active!

TonyTrapp · on Jan 15, 2021

Hah, it's been a while, I haven't used it recently. I think we were using it on a self-hosted server just available to a couple of friends.

kgarten · on Jan 15, 2021

When I saw the headline, I thought they might provide some more insights or maybe a novel ideas on how to overcome the latency problems. Very disappointing public relationship piece.

I agree with others, latency will always be an issue over a distance.

Thought Leadership ... ha ha ... I'm impressed by their leadership, I didn't expect latency and timing matters in Music. Yet, I'm also a novice and ignorant :)

Bud · on Jan 15, 2021

Professional classical musician here.

The real problem here is latency, and I'm good friends with the folks who are working the hardest on that issue for real musicians wanting to collaborate with each other in real time. The most important work is being done with SoundJack.

https://www.ianhowellcountertenor.com/soundjack-comprehensiv...

silasdavis · on Jan 15, 2021

From the home page of sound jack site (https://www.ianhowellcountertenor.com/soundjack-real-time-on...):

> In practice, musicians ignore sound and watch a conductor when separated by such distances [of 100 feet].

It had never occurred to me that a conductor was a kind of clock at an approximately uniform distance from the performers.

saberdancer · on Jan 15, 2021

Uniform distance matters little for a conductor. Performers watch visual cues from him and speed of light is very fast (unlike sound).

If you are 100 meters away, you have delay of about a third of a second if you use sound, but for light it will be a fraction of a millisecond.

croon · on Jan 15, 2021

> Uniform distance matters little for a conductor. Performers watch visual cues from him and speed of light is very fast (unlike sound).

Doesn't matter for the performers, but it matters for the conductor.

sam_dal · on Jan 15, 2021

Conductor is the metronome.

swaggyBoatswain · on Jan 16, 2021

This, I've asked my musician friends too how teaching has been like through covid. All the platforms for recording they stated have at least a 0.5s delay, which makes playing together with a student very difficult. Usually the teacher sometimes has to prerecord a sequence instead and play a video, which makes teaching online not really convenient or fun for the teacher.

kgarten · on Jan 15, 2021

That's a great guide! Thanks for sharing. Much more what I would have expected from the parent link :)

reikonomusha · on Jan 15, 2021

I agree. At the end the conclusion is that it’s a “step in the right direction,” which was underwhelming as a piano student myself.

pishpash · on Jan 15, 2021

Latency isn't physically solvable though. You'll need another network not the internet.

smolder · on Jan 15, 2021

The established technique and one that many people have independently invented (myself included) is to chain the performers so one person hears only a metronome and each other person hears the previous members on the chain, forwarding along the audio in sync.

As I've said elsewhere in the thread zoom is the wrong tree to bark up since these tools exist.

hunter2_ · on Jan 15, 2021

As a musician, audio engineer, and software engineer, this just blew my mind. I can't believe I've never heard of or thought of this until now. But my primary focus has been non-audio software through the pandemic, so I haven't been actively trying to find solutions to this. Thank you.

What applications would you recommend? Can it also do video, if not each link in the chain, at least one camera for the conductor (the node providing the metronome audio)? I realize the conductor would hear nobody at all, until playing back a recording from the end of the chain. Playback of said recording could begin after the song, or one measure into the song if there are no tempo/time changes (NINJAM style).

pas · on Jan 15, 2021

There's a sea shanty called the Wellerman that recently blow up on TikTok, and listening to it I was amazed how in sync the whole thing was despite all the layers. (It uses the same chain technique, but not "live".)

hunter2_ · on Jan 15, 2021

Not-live overdubbing with perfect sync has been going on since the dawn of multi-track recording, although the popularity of including video from each layer (and DIY without a tracking engineer running the session) surged in popularity during the pandemic. With modern editing in a DAW, quantizing every phrase, word, or even syllable to the grid (or other layers) isn't difficult at all, just a bit tedious.

smolder · on Jan 15, 2021

Someone else in this thread linked to software like I described: https://github.com/gwillen/solstice-audio-test

I will try to search around and get you some more examples. I can't remember off the top of my head what else worked that way.

spacechild1 · on Jan 15, 2021

For which kind of music does this work in practice, though? The first musician in the chain would have a very lonely experience ;-)

Another technique is to mix all musicians on a server and send the mix back to all of them. Everyone hears the exact same signal, so in theory people could play perfectly in sync, but in practice hearing yourself delayed is very awkward and needs lots of training. We tried it but gave up quickly.

visarga · on Jan 15, 2021

The first musician could be the conductor and they already are accustomed to conduct a bit ahead of the sound. Or they could put the instrument with most notes.

spacechild1 · on Jan 15, 2021

Musicians want/need to hear each other. This is obvious for improvised music, but even when everything is written out, players have/want to listen to each other.

While this approach certainly works from a technical perspective, but I don't see many musical styles where this would yield a satisfying musical result. However, it might work for pieces which are specifically written for this constellation.

Bud · on Jan 15, 2021

Latency can be radically improved with SoundJack and tweaks to router settings, combined with audio equipment that introduces as little latency as possible. (i.e., don't use cheap USB mics.)

Do all of this, and you can collaborate really well with people in the same geographic area as you. Of course you can't get around the speed of light, but you can do a lot better than the default setup or Zoom will give you.

saurik · on Jan 15, 2021

Yeah: it is maybe worth noting explicitly for people that the speed of sound is sufficiently slower than the speed of light that two people laying instruments on opposite sides of a small stage can actually, if they have pickup mics and headphones using software carefully designed to minimize latency, be reasonably far apart over the Internet and have the same experience (such as between a bunch of musicians living in San Francisco, or all in the same region of Los Angeles).

Bud · on Jan 15, 2021

Even about two orders of magnitude further than that, based on the work done so far.

For instance, 22.5ms latency has been achieved at a distance of 900 miles (between the middle of Kentucky and Boston) with SoundJack and this setup:

Both port-forwarded, both using Fast Music Boxes, both on fiber internet connections. First user had an MXL770 XLR mic and an Audient EVO4 interface. Second user also had an XLR mic and an audio interface.

San Francisco is 7 miles wide.

pmiller2 · on Jan 15, 2021

I think you're underestimating the amount of latency that the human ear can detect. If the audio is as little as 10 ms out of sync, either way, someone will notice that.

4ec0755f5522 · on Jan 15, 2021

Depends on what sound you are measuring. A single drum hit where one channel is delayed by 10ms is very audible. Would a violin and cello player have issues playing together if one was delayed by 10ms though? Could anyone hear that in the audio itself, or even detect it in their performance at all? That seems a very different thing.

xavriley · on Jan 15, 2021

Sound travels at ~1ft per ms in air so 10ms equivalent to being 10ft away. Sure, you notice the lag at that distance but it’s still possible. 30ms (or 30ft) is the point at which I would give up on trying to play rhythmic music well. That said 30ms latency is a lot of wiggle room where the speed of light is involved

pishpash · on Jan 15, 2021

People watch each other to synchronize. Problem with network latency is the visual latency isn't any better than aural latency, unlike on a stage.

saurik · on Jan 15, 2021

And I think you are underestimating the speed of light ;P. The round-trip latency (twice my distance) from my apartment in Santa Barbara to servers I have in Nevada and Northern California is 20ms, making me the equivalent of like 11ft away from those places by sound. If I were communicating within San Francisco or Los Angeles I should trivially be able to hit 3ms round trip times.

pmiller2 · on Jan 16, 2021

In theory, maybe. In practice, you'll probably be limited to about 15ms round trip.

kilnr · on Jan 15, 2021

How does this impact large orchestras? Some of these musicians have got to be (way) over 10 m apart. Even 10 m at the speed of sound is almost 30 ms.

saberdancer · on Jan 15, 2021

That is the role of a conductor. He hears the piece as it should be heard and is active participant in "tuning" it. My understanding is that he can slow down/speed up various performers to get them in sync.

Performers need to learn to ignore the rest and play their instrument while looking at the conductor.

pishpash · on Jan 15, 2021

That's why they rely on light in addition to sound: watching the conductor, watching each other.

AmericanChopper · on Jan 15, 2021

Over surprisingly short distances you run into issues with the speed of light. I did live sound engineering when I was younger, and worked in some studio recordings, and the conventional wisdom of the field is that musicians will begin to get distracted by latency greater than 10ms (which you have to care about when sending foldback to a musician, even when all of your equipment is in the same room).

10ms only gets light 2/3rds of the straight line distance between LA and NY. After you start accounting latency in infrastructure components, round trips, and the fact that cables aren't laid in straight lines between two users, the idea of live musical collaboration over even short distances starts to seem implausible.

Bud · on Jan 15, 2021

It's actually not implausible; it's already happening. No, you can't do it between LA and NY, but you definitely can do it across distances of, say, several hundred miles quite well.

https://www.ianhowellcountertenor.com/soundjack-comprehensiv...

pmiller2 · on Jan 15, 2021

Well, I don't know about how fast electrons travel in copper, but I imagine it's well below c. In optical fiber, the speed of light is about 2/3 c. So, you should probably at least multiply your numbers by 2/3. In that case, you can only get 4/9 of the way between LA and NYC.

visarga · on Jan 15, 2021

> how fast electrons travel in copper

Signals are fast, electrons are slow - on the order of a quarter of a millimeter per second.

> the drift speed through a copper wire of cross-sectional area 3.00 x 10-6 m^2, with a current of 10 A will be approximately 2.5 x 10-4 m/s

https://www.physlink.com/education/askexperts/ae69.cfm

daniel-cussen · on Jan 15, 2021

Signal travels at roughly 2/3 c in copper as well.

bhj · on Jan 15, 2021

The threshold is more like 3ms in my experience. When tracking vocals with headphones (no acoustic latency) anything higher will start feeling weird. This is also why Thunderbolt is preferred over USB for recording; you can get in and out of the CPU with effects much faster.

saurik · on Jan 15, 2021

3ms by sound is only 3.36ft away.

AmericanChopper · on Jan 15, 2021

Distance from the source of sound is actually very important in live performances. There have been some empirical studies on it, but it’s still something people often get rather opinionated about. There is a reason though that a small ensemble can play to the beat of something like a drum, but a large ensemble like an orchestra can’t. There’s plenty of writing about how orchestras lag behind conductors and how different sections of orchestras have to take their timing queues differently depending on where they’re situated.

10ms is what I was always told was the generally accepted threshold after which a musician start to experience increased difficulty as a result of the latency. Which largely aligns with my own experience with audio latency. From there the difficulty just keeps increasing, and the performance quality keeps degrading until at some point the musicians and the audience gives up.

pmiller2 · on Jan 15, 2021

Thanks for writing this. I made the exact same point in another subthread, that 10ms is enough to mess with your brain. My only experience of it is when the audio and the visual out of sync by only about 10ms while watching something (I forget what). I can't watch anything like that for any real length of time, and I'm not even trying to play along.

saurik · on Jan 15, 2021

10ms by light is 1862.824 miles away (which is approximately the distance from Los Angeles to Chicago).

I honestly just don't think you are truly appreciating that light is almost a million times faster than sound.

AmericanChopper · on Jan 15, 2021

The refractive index of single model fiber is ~1.5. So 10ms of fiber transit is closer to 1250 miles. The fact that no two real world internet users are connected by a straight line of fiber optic cabling is going to bring that in even further. If you were relying on round trip latency for some reason, that would cut the distance in half.

And that’s just for an idealized communication network. A real world use case would have a lot more latency introduced by routing/switching, and all of the typical quality issues you’d expect from ISPs. I used to live within line of sight of the data centre that hosted an online video game that I played, and the lower bound of my ping was about 15ms, it was usually in the 20s. Those factors are why something like this is unlikely to ever provide a high quality experience over even relatively short distances. I’ve just always found it interesting how quickly you start to run into issues with the speed of light when you’re trying to optimize for network latency.

saurik · on Jan 15, 2021

I mean, the stated use case is for a music school. I actually live in a University town. We have tons of students all "working remotely" within a single mile radius of campus, and many of them are connected together by our campus intranet -- the University does wireless point-to-multipoint links to connect all of these random housing options (which include large buildings they simply bought in the community) -- which is something I think we could easily push to include more buildings. I bet we could easily pull off 2ms networking overhead.

(And no: you are wrong about round trip. If you care about round trip time then 3ms of round trip by sound is so close that the violin player is going to be elbowing the singer. Sound is fundamentally slow, and you need to just accept that.)

AmericanChopper · on Jan 15, 2021

A well architected local network is about the only type of network that a service like this could operate over, and still provide a decent experience.

Also, the reason you'd care about round trip time, is if you needed mixing or processing to be done to musician foldback (I never mentioned 3ms being an issue btw). The mixing and processing of sound is prior to being sent to foldback monitors is a completely standard process in just about all live and studio sound engineering. Latency is a very important consideration for sound engineers, and it's why in situations like stadium performances, the only sounds that matters to a musician on stage, is the sound coming out of the foldback monitors directly in front of them. I'm getting a rather strong impression that you don't know much about what actually goes into sound engineering.

saurik · on Jan 16, 2021

I guess I don't understand why you think "sound engineering" is relevant to the point of being required, as the question to me here is more about what happens at something like a "session"? I used to go every week to play with a large mix of random people at an Irish bar... we definitely didn't have monitors (or a sound engineer ;P). People have been playing music with each other in groups for a long time, and being within 11'3" of everyone seems unrealistic. I have been in drum circles larger than that (I am at best a percussionist, though I was dabbling with fiddle), and I would assume that to be the case where precision matters the most. The stage we were on (which is weird: most sessions I've seen--and I have seen more than I participated in as I got involved in this stuff due to dating a musician--are at dining tables) was definitely over ten feet wide.

FWIW, when I asked my professional musician ex about all of this (which was before I read this comment of yours here) she also mentioned monitors, but it was because she claimed a core consideration for her with respect to distance in a performance was volume of her instrument drowning out her playing companions (and when I pushed into that she said if the acoustics were bad enough they would use monitors). She also told me that people she knew were already using software to play with each other over the Internet. If nothing else, I feel like "proof by counter example" should win here vs. your statement that this is just somehow impossible? As others have pointed out in this thread, the SoundJack people exist and claim to do this (whether or not you believe they have users: my ex apparently knows of users, though I don't know if they are using the same software).

pmiller2 · on Jan 16, 2021

Right. I am saying that if the light from my TV reaches me essentially instantaneously, and the audio is on a 10ms delay, that messes with my brain.

MrStonedOne · on Jan 15, 2021

I'm not sure on the percentage of assuming latency is only coming from the network/internet layer.

klodolph · on Jan 15, 2021

It’s more or less workable for local (metropolitan) networks.

layoutIfNeeded · on Jan 15, 2021

You'll need faster-than-light communication actually.

cush · on Jan 15, 2021

Yeah seems to just be for async music lessons.

Moru · on Jan 14, 2021

"The biggest problem is Latency". Try turning off video and instead run Mumble with low latency settings. Should be way better than zoom at least. Why do we always have to create something new from scratch when we already have a solution that just needs a bit of polish?

noman-land · on Jan 14, 2021

I gotta mention JackTrip in this thread. You can stream full quality audio with near zero latency. If you're within a ~300 mile radius of your peer, it boasts <20ms latency, which is low enough to feel instant. I've tried it with a friend and it's pretty incredible.

https://news.stanford.edu/2020/09/18/jacktrip-software-allow...

luplex · on Jan 14, 2021

My dad uses https://www.soundjack.eu/ running on a Raspberry Pi to practice with his wind quartet. along with a separate video call so they can see each other.

Its also a peer2peer system, and not easy to set up correctly.

zinekeller · on Jan 15, 2021

Well, considering that ping times can be still abysmal due to your ISP's incompetence...

Some tests that focus on interconnection issues (ISP to another ISP):

Continuous

To Azure: https://azurespeedtest.azurewebsites.net/

To AWS: https://awsspeedtest.xvf.dk/

One-time

To Cloudflare: https://speed.cloudflare.com/

To Netflix (maybe misleading due to direct interconnectivity in some ISPs): https://fast.com/

jimmydorry · on Jan 15, 2021

Sorry, but 20ms is more than double the amount I can tolerate when playing my instrument.

Even just the difference between local loopback (instrument -> headphones) vs instrument -> USB -> PC -> USB -> headphones, feels like playing with someone continually dragging the tempo down.

I'm sure I could relearn and adapt to >10ms, but this is a far cry from a live experience of playing with others, and I am definitely not alone in my cohort (after trying several low latency remote meetup configurations).

Bud · on Jan 15, 2021

20ms is actually equivalent to playing/singing with someone around 20 feet away from you, on a stage. (Sound travels around 1.13 feet per millisecond.)

Many of us in classical music are doing this all the time. So it's definitely doable. Especially if you have a conductor! Indeed, that's one of the reasons conductors are useful.

dash2 · on Jan 15, 2021

But the conductor is exactly what solves the problem, and which one can't use when one is working via Zoom because of latency.

pmiller2 · on Jan 15, 2021

If you don't have a conductor, always, always, always follow the soloist or concert master, which ever one you have.

gorkish · on Jan 16, 2021

Listening to yourself on a 20ms delay is a far different thing than listing to someone else on the same delay. The latter is what you experience in every day life all the time as sound travels about 1ms/foot. The former causes a negative feedback loop that will cause problems as you try in vain to effect an impossible correction. https://en.wikipedia.org/wiki/Delayed_Auditory_Feedback

wilburm · on Jan 15, 2021

I’ve read that setup was non-trivial. How did you find it?

noman-land · on Jan 15, 2021

It is pretty painful to set up, that's for sure. You need to be comfortable with the command line so I haven't been able to experiment yet with non-computer friends. I'm going to try to create some sort of one-liner setup script for my friends to use but I haven't gotten there yet.

There are some good instructions here for different systems.

https://ccrma.stanford.edu/software/jacktrip/

ghostpepper · on Jan 15, 2021

For anyone who hasn't seen it yet - this adjacent project sells a complete unit utilizing this software. As far as I can tell it's just a raspberry pi with an ADC/DAC hat, in a metal enclosure, and some sort of cloud configuration interface. They also publish the details and the pi image if you want to build your own, and subscribe to their cloud plans.

https://www.jacktrip.org/studio.html

bit_logic · on Jan 14, 2021

I was thinking recently on how bad modern voice calls are. It's all highly compressed and basically unidirectional, forcing the participants to take turns talking.

The old analog POTS phone system wasn't like this. It had that feeling of almost being in the same room, no latency, true birectional conversation.

I was wondering if there's anything that tries to get close to this old POTS level of quality, at the cost of higher bandwidth and processing power.

Can some of the software mentioned in this thread achieve this?

bombcar · on Jan 15, 2021

Mumble can get very close if you tune it right.

To make it work well you Amy have to turn off lots of helpful processing that modern systems do.

ghostpepper · on Jan 18, 2021

Do you happen to know of any guides to doing this, or what specific settings/optimizations cause the most latency?

titzer · on Jan 15, 2021

> The old analog POTS phone system wasn't like this.

I mostly agree with this, but POTS starting going digital surprisingly early, before VoIP ever became a thing. And of course digitization back then was typically chopping a huge amount of the high frequencies, making things sound...well, like the phone.

sneak · on Jan 15, 2021

Yes, but it was using TDM near exclusively in the digital domain, and not packet switching like we do now, which lead to vastly different levels of service compared to now. The problem isn't digital, the problem is packet switching.

Most people making calls today have no conception of how good telephones used to be.

titzer · on Jan 15, 2021

Sure. Packet switching allowed oversubscribing bandwidth, whereas previous digital telephony actually researched bandwidth (with TDM among other techniques).

newsclues · on Jan 15, 2021

Compressed cellular audio vs good landlines is a world of difference... and the modern cellular sound still sucks

sneak · on Jan 15, 2021

The way to do it now is to make calls using an app via IP; FaceTime Audio is remarkably good even over 4G.

spoonjim · on Jan 15, 2021

Sadly for some moronic reason FaceTime can’t use hardwired Ethernet. I have a reliable 4ms ping and Apple sells Ethernet adapters for iPads but FaceTime won’t use them.

cush · on Jan 15, 2021

Yeah systems like zoom are designed for enterprise where hundreds of simultaneous calls are happening on the same network. They work very hard to make those packets as tiny as possible.

jjulius · on Jan 14, 2021

>Try turning off video...

I'm a musician and I'm incredibly confused by this point. Music instruction cannot possibly be effective without a visual component, regardless of whether it's a solo instruction or a group. An A/V method with a bit of latency is almost always going to be superior to audio-only.

underyx · on Jan 14, 2021

With one group of friends I’ve been doing weekly calls for months now with audio through Mumble for audio, and video through Discord where everyone stays muted.

johannes1234321 · on Jan 15, 2021

Are you doing instructions, including "your hand position isn't right" etc.? For that latency is tolerable, but different alten y for picture and audio is problematic.

underyx · on Jan 15, 2021

Sometimes, but only for drinking games and such.

centimeter · on Jan 15, 2021

As a non-musician, could you explain why?

jjulius · on Jan 15, 2021

How you place your hands, bend your wrists, stretch your fingers and generally hold your instrument can greatly impact how you play it. Depending on the instrument, a simple bend of a wrist can lead you to consistently play out of tune, or make it hard for you to hit a specific note.

Even just touching strings in different ways can produce wildly different results; think playing a harmonic note on a bass guitar, which is how you get such a low-sounding instrument to play a high-pitched note. How you blow air through your horn can greatly change the timbre and mood of your music.

There are a lot more examples, and they are all difficult to correct if you aren't able to see what the musican is doing.

Bud · on Jan 15, 2021

It's similarly essential for teaching voice.

spoonjim · on Jan 15, 2021

The exception here is at the MasterClass level. Fuck the world for autocorrecting that to camelcase.

Igelau · on Jan 15, 2021

Physical matters of technique. Depending on the instrument or lack thereof: grip, placement of fingers, posture, breathing, lip embouchure, and more. Heck, you could even have it put together wrong or be holding it upside down. A remote teacher cannot easily convey this things in a timely manner without seeing the student.

dialamac · on Jan 15, 2021

> Music instruction cannot possibly be effective without a visual component

I suppose blind people will never be successful with music.

jjulius · on Jan 15, 2021

Please re-read my post and note that, at the beginning of the sentence you quoted, I specifically targeted the instruction of music. More specifically, you cannot help teach a blind violinist how to prevent themselves from continuously playing their notes flat without being able to see how they are playing.

Blind people are perfectly capable of learning to play music. Blind people and non-blind people alike can learn on their own, or seek out instruction. It is the latter that we are discussing here.

spoonjim · on Jan 15, 2021

Could a blind person be a music teacher?

jjulius · on Jan 15, 2021

Totally, and I'm sure there have been many, but being able to touch and feel the placement of the musician's hands/fingers/etc. would be important. There are certainly things they could teach without that touch, just like non-blind people could teach a few things remotely sans video, but core areas of playing wouldn't get the right attention.

cush · on Jan 15, 2021

I think they were referring to remote music instruction....

reactordev · on Jan 14, 2021

Latency will always be an issue here. 200ms is enough to throw off the timing of an “all-at-once” note or syncing up sections.

I’ve been a hobbyist musician since leaving school 20 years ago. I play piano, guitar, bass, mandolin, cello, drums, etc. All of the typical video platforms fall short of having a low enough latency to achieve “remote orchestra”.

There was one product/service that I tried that got closer than anyone, JamKazam (https://jamkazam.com/) and there were a few audio only options back mid-2000s that had close to 20-50ms which was better.

nullc · on Jan 14, 2021

> Latency will always be an issue here.

No, it will not. :) You can get to 5ms above the network delay, even using compressed audio (opus), though some sound devices may be picky.

Network delay can be just a few milliseconds even across a city, assuming that no latency murdering devices (wifi or nics in interrupt mitigation mode) are on the path.

This means that you can have lower audio delay from a compressed audio conference crossing your state than is achieved from speed-of-sound delays from a performer sitting on the other side of a moderate sized room!

The future is already here -- it's just not very evenly distributed. (yet)

PaulDavisThe1st · on Jan 14, 2021

> Network delay can be just a few milliseconds even across a city

If you limit your collaboration radius in that way, then sure (though it still depends on the router technology the packets encounter along the way). But if you want to go beyond that, the numbers get bigger. For transcontinental stuff, much much longer.

But thanks for correcting the basic thrust.

nullc · on Jan 15, 2021

One way delay between San Francisco and New York city is 37ms across the public internet today.

This is the same one way delay as sound in air across 12 meters, which would be on the large size for an orchestra pit.

It does get larger as you go further, but live performance with people on the other side of a large continent is completely realistic and has been done. :)

Sure, expecting to go between NYC and someone in china behind the GFW is probably asking too much for a seamless experience. But many people mostly want to work with people in the same country as them ... and for that, with sufficient technology, latency need not be an issue.

PaulDavisThe1st · on Jan 15, 2021

> live performance with people on the other side of a large continent is completely realistic and has been done. :)

In case you're not aware, I'm the original author of JACK, around which JackTrip is built, which is the most likely and reliable tool for such a collaboration. I certainly know people who've done this, and I regret that I didn't realize back in February how useful it would have been to make something like JackTrip into a much easier-to-use tool for computer-naive folks.

37ms cross-country in the USA is optimistic, but certainly possible. US->Europe is not so great.

nullc · on Jan 15, 2021

In case you're not aware, I'm one of the authors of Opus. :P

When the pandemic hit I thought about putting out some pointers to easier to use low latency streaming resources... but... that would require overcoming pandemic lockdown funk. And honestly it's still kind of a technical rats nest to get everything working. It's not exactly musician friendly to need instructions like "next you need to make sure your nic doesn't enable interrupt mitigation when there are more than 100 packets per second...". :)

[I love Jack BTW.]

raphlinus · on Jan 15, 2021

Another example of HN being amazing. In the interest of playing too, I did some work on reducing audio latency in Android, but that all predated AAudio/Oboe, which takes that quite a bit further and makes at least the Pixel devices a plausible platform for low-latency audio collaboration apps as suggested in this thread. I for one would love to see that happen - I miss singing hymns on Sunday mornings.

PaulDavisThe1st · on Jan 15, 2021

Yeah, I'm really sad that I didn't get some serious effort together on this back at the start of the year. There's no lockdown funk here, just too busy working on Ardour.

One of the 2 netjack implementations uses Opus - great stuff.

Absolutely right that the good stuff (i.e. not zoom) requires way too much setup, and even then (as you note) you're not guaranteed reliable function because you don't control all the hops. Soundjack gets the user-side fairly good, but it's still not quite what I think would have really taken off during the pandemic - probably needs a mobile app for that.

spacechild1 · on Jan 15, 2021

I love libopus! I use it as one of the codecs for my low latency streaming library “AOO“ https://git.iem.at/cm/aoo

sonosaurus · on Jan 17, 2021

And I love AOO, I use it as the basis for SonoBus.... I’ve been meaning to contact you about it, and work on merging your latest into my fork (which has diverged a bit over the last few months).

And hey Paul, long time no chat!

https://sonobus.net

reactordev · on Jan 15, 2021

Thank you for JACK btw.

reactordev · on Jan 15, 2021

Latency in this case is cumulative. 1-5ms for DAW encoding of your analog signal, encoding it in whatever codec is used for transmission 1-?ms. Actual transmission 10-1000ms. Decoding the signal into audio channels for playback 1-5ms. Don’t forget this is duplex. You are transmitting as you are also receiving. In an ideal world we could get very close to zero-latency (enough where an orchestra could by in time step with each other).

It sounds do-able. It sounds like we should already be there. Looking around though you’ll see we aren’t because of technical issues described that are systemic of the infrastructure used.

PaulDavisThe1st · on Jan 15, 2021

JackTrip doesn't encode at all, it sends raw PCM over the wire. So that helps.

LeifCarrotson · on Jan 15, 2021

Network delay can be just a few milliseconds across a city, or you can discover, like my brother who teaches elementary music, that your kids have different ISPs and some are awful.

He's been teaching music remotely since August, and one of the first days he demonstrated why they have to be on mute on Zoom by having them all sing happy birthday to one of the students. It was utter chaos. I worked with him to try to find a solution, we tried various tools, but found nothing that worked across cable, DSL, WISP, and cellular connections.

Some tools could sync after the fact so the recording sounded good (if each musician keeps the same tempo), but the whole point is that the kids needed to be able to hear and follow each other, and just one student at 500ms latency totally ruins the tech.

So he teaches music to second graders by zoom, and they're all on mute unless he unmutes them one by one.

telesilla · on Jan 14, 2021

> Latency will always be an issue here

Yes, it will as soon as you are 1000km away. Light speed creates a lot of unresolvable latency. How can you play live between London and Sydney? You need a different musical understanding.

Retric · on Jan 14, 2021

1000km at the speed of light is 3.33 ms which on it’s own isn’t an issue. While fiber is slower, you can have low latency connections over that distance without issue.

The real problem is computer audio isn’t designed for low latency. Generally, as long as video and audio are reasonably synchronized nobody notices, but latency as rarely a consideration on it’s own.

telesilla · on Jan 14, 2021

This shows actual network latency between regions.

https://www.cloudping.co/grid

Even with the state of the art equipment, transcontinental music making will always exclude the kinds of music that most of us play while in the same room together.

Retric · on Jan 15, 2021

I was specifically mentioning 1,000km as the metric.

AWS latency isn’t great between regions. Looking on some other sites you can see London to NYC is 70ms, but that’s also 5,500km which is far behind the 1,000km mentioned. Ex: https://wondernetwork.com/pings/London

70ms /5.5 = 12.7 ms seconds round trip. The speed of sound is only 1,125 ft/s, and half that for round trips so 1,000km over fiber is the equivalent of ~7.14 feet apart in the same room plus whatever latency your personal computer and software adds.

toast0 · on Jan 15, 2021

My DSL line gives me a 20 ms roundtrip to the first hop. My only choices at this address are DSL, LTE (latency seems highly variable, and much more expensive than DSL), or paying at least $50k to get muni fiber installed. So latency will always be an issue here.

Well, I could probably get some form of POTS either analog or ISDN/T1

nullc · on Jan 15, 2021

DSL is pretty much the worst of the wireline services as far as latency goes. Even some of the wireless technologies are better.

If you use sonic, at least in some areas they have a setting where you can turn down the level of forward error correction and get lower latency in exchange for somewhat higher packet loss.

coldtea · on Jan 15, 2021

>Network delay can be just a few milliseconds even across a city

Sure. If you constraint it a little more, you can also just play to the persons in the next room directly (no streaming required).

For country level or worse global collaboration latency will always be a problem however...

lordnacho · on Jan 15, 2021

Jitter is an issue as well. If the packets get routed in different ways the variability in latency might annoy you. Ever had a video call where the person gets really slow and then speeds up again?

nullc · on Jan 15, 2021

For extremely low latency audio, you simply convert most jitter into loss (and the rest into added delay). How annoying the loss depends on the quality of the loss concealment.

If your conferencing is 'slowing down' and 'speeding up' then it's simply not a realtime communications medium.

aeyes · on Jan 15, 2021

> Network delay can be just a few milliseconds even across a city, assuming that no latency murdering devices (wifi or nics in interrupt mitigation mode) are on the path.

Even with fiber all ISPs I have ever used at home have latency murdering devices between my modem and their connection to the Internet. And I have to go through these hops even if I connect to an IP on the same IP.

Best I ever got was 12ms. Once you connect your laptop to the Internet in a data center you will instantly know what I mean.

kayodelycaon · on Jan 14, 2021

Using Wi-Fi in my office adds up to 150ms of latency. (20m cat5 Ethernet cable has < 1ms)

Jonnax · on Jan 14, 2021

Is that with bad signal? Sounds like something is wrong with your setup.

Got a Unifi AC LR and it only adds about 4 to t 6ms over Ethernet.

ipython · on Jan 15, 2021

High jitter also occurs when the signal is marginal and the client scans the WiFi frequencies for other base stations to associate to (on the assumption that maybe you moved and there is another base station with a stronger signal). Since it has to “tune” into the other channels to listen for beacons, there will be a noticeable jitter while it is not tuned to the “current” channel.

This is my understanding as I’m still learning. Fascinating this stuff works at all.

ikiris · on Jan 15, 2021

Not bad signal, but the basis of how wifi shares the spectrum is time distribution.

plorkyeran · on Jan 14, 2021

Mumble is in fact one of those audio-only options which can do 20-50ms latency, which is why the parent suggested it.

jamroom · on Jan 14, 2021

I use Jam Kazam weekly and with a non Wi-Fi connection I get really good results

reactordev · on Jan 15, 2021

I mentioned JamKazam in this thread above about latency. It’s the top of my list for this purpose but still has some issues when the connection quality isn’t good (ISP issues)

ilumanty · on Jan 14, 2021

Some friends of mine had a pretty low latency (sub 50ms) using Jamulus (https://github.com/corrados/jamulus), which is an open source conferencing tool specially made for musicians. Need to self-host though.

laurieg · on Jan 14, 2021

I second this. Mumble across different continents has less latency than Zoom in the same city.

smolder · on Jan 15, 2021

Mumble is probably not the right software given the purpose built ones that exist. Jamkazam is one. There are many. The apparent ignorance about existing tools in this story and comment section is driving me nuts.

banana_maker · on Jan 15, 2021

This is basically HN anytime anything related to audio engineering pops up. I'm sure nobody has thought of trying to address things like latency.

powersnail · on Jan 14, 2021

Visual cue is important in playing music together.

Unless everyone just jam to a click track instead of each other, it's very hard to play together without seeing the body language.

Pxtl · on Jan 15, 2021

I'm sure somebody could figure out a webapp that's nothing but a globally synchronized metronome.

powersnail · on Jan 15, 2021

That's pretty much a click track. It works, but it's a bit boring that way. You can't make phrasing choices as freely.

coldtea · on Jan 15, 2021

Because turning off video makes it useless for a music instruction tele-conference app -- which is all about the instructor being able to see how you sit, how you use your fingers, show you fingerings, point mistakes, and so on.

In other words, they don't want a real time music streaming app, they want a music teaching app. Duh!

ska · on Jan 15, 2021

One of their problems was they wanted video as well. They had already tried separate audio solutions, and found it lacking.

This makes sense if you are trying to teach, you need to see the techniques not just hear them.

jackson1442 · on Jan 14, 2021

Mumble requires running your own servers, and many universities already pay for Zoom licenses. Why not ask your vendor to add a feature to software you're already paying for?

upofadown · on Jan 14, 2021

Few people run their own Mumble server. Instead they rent one for anywhere from free to a few bucks a month.

Pxtl · on Jan 15, 2021

Mumble has excellent audio quality but it's marred by its lack of feedback cancellation. Mumble is basically unusable if anybody isn't using a headset to listen.

Which may be tied to its low latency... I assume there are algorithmic reasons why cancellation needs latency.

That and between the audio setup wizard and its unorthodox auth system, the onboarding process is brutal for non-technical users.

Sesse__ · on Jan 14, 2021

Try Sonobus or JamKazam (the former is FOSS). Actually designed for the “playing together” use case.

brundolf · on Jan 15, 2021

Engineers often discount the value of discoverability and accessibility. I would argue those are the main reasons Zoom became ubiquitous in the first place.

pishpash · on Jan 15, 2021

Because it's for speech. Read the article, it talks about echo cancellation and compression, none of which you want for music.

4f77616973 · on Jan 15, 2021

Riot (Matrix) too!

weinzierl · on Jan 15, 2021

We use OBS Studio in conjunction with the teachers preferred videotelephony software (currently Skype) for the piano lessons of our daughter.

I have defined four scenes in OBS which she can switch with A, S, Z and X, followed by a space bar press.

A is the webcam of the notebook, which sits right above the keyboard of our digital piano. This shows just the semi-portrait of the person playing.

Z shows the view from a second webcam mounted above the keyboard to display the keys and the fingers.

X is the view of just the virtual keyboard from a Synthesia window, where the pressed keys are highlighted.

S is all three previously mentioned scenes combined in one nicely arranged view.

Audio from the digital piano is also a source for scenes in OBS.

OBS provides a virtual camera that can be used like a normal camera in most, but not all conferencing software. For example, the preinstalled Skype in Windows 10 doesn't recognize the virtual cam, but Skype Desktop from skype.com does.

It's not perfect, but - I think - a decent setup, quickly built with only a laptop, one extra webcam and free software.

clan · on Jan 15, 2021

Congratulations. The teacher must be really happy.

My better half teaches online as well. Your setup goes the extra mile and leaves all other setups in the dust. Most parents cannot be bothered to make sure a basic setup is working.

For music education even a $5 lav mic would be a huge help.

crazygringo · on Jan 15, 2021

This is great.

Now I wonder about performing together? Everything I've seen that looks like a live Zoom performance of 10-20 musicians is actually recorded separately (with a click track) and then edited together in a video to make it look like it's Zoom.

But now that the music quality is there... Zoom could conceivably broadcast a click track (e.g. the host's audio that would go to performers' speakers but not the audience's, or even the host being a conductor or lead performer), performers would play "solo" (without hearing each other), but then the central Zoom server would wait for and cache every performer's audio until they were all received, with a suitable buffer, and then output it mixed, to the audience, synchronized to the host's (click-track) audio (timecodes).

And unlike normal Zoom calls where it only ever transmits 1 or 2 audio streams (the loudest ones at the moment), it would always mix all of them.

It's probably complicated enough that it wouldn't be worth Zoom's effort, and also niche enough not to be worth a startup developing a separate product for it... but it certainly seems doable, no?

jefftk · on Jan 15, 2021

Another way to do this is to put everyone in a chain, and have each person be listening to what the person before them was playing a few seconds ago. I wrote some software that does this, and anyone is welcome to play with it: https://echo.jefftk.com

It's open source: https://github.com/gwillen/solstice-audio-test

Here's an example with some informal singing my friends and I were doing this evening: https://echo.jefftk.com/recordings/2021-01-15-023012.wav (it makes ephemeral recordings)

flatcakes · on Jan 15, 2021

Not the same as your idea, but similar is NINJAM, which is tempo synced, but deliberately makes the latency equal to one bar, phrase, or other segment of time. You are always playing live to what everyone else played the very last time around, but you are still playing on the beat together. It obviously works better for repetitive jam sessions than for structured songs, but it looks very interesting. I haven't yet tried it with my music buddies, and should.

[1] https://www.cockos.com/ninjam/

loosetypes · on Jan 15, 2021

Bit unrelated, but I recall an old webpage that had a grid of 10 or so YouTube videos embedded, each with a different instrument playing (they didn’t auto play).

This was pre-mobile, so it was expected that you could play multiple videos at once. I was amazed, and it was the intent, that independent of how few or many videos you played and no matter when you started each of them, all sounds blended together nicely.

I don’t know what to call it when there’s no inter-sound coordination - a timing independent harmony? It was so cool.

Anyways, I would be super interested in attending a zoom concept of that sort that might side step some technical difficulties.

pmiller2 · on Jan 15, 2021

This is correct. The reason is more or less that humans are really, really good at staying in sync, but, even with only 2 parts to listen to, it starts to get almost impossible levels of effort to keep proper time. In person, without any network lag, musicians can make adjustments in real time (e.g. always, always, always follow the soloist, even if the soloist is off tempo or off pitch). The conductor is listening to the soloist, too, so you don't need to worry about conflicting directions.

Here are a couple of examples I know of on YouTube where a single musician plays multiple parts, which are then edited together:

Now professional cellist, graduating senior at time of recording, Sarah Chaffee playing an original transcription of Alice Cooper's Poison with 5 cello parts (with a short guest appearance by Shostakovich at about 2:42, then showing up again in the outro to play us out at 3:33): https://www.youtube.com/watch?v=PjYwQHFsld0

Professional cellist Wells Cunningham playing the Handel- Halvorsen Passacaglia: https://www.youtube.com/watch?v=NseBdxfHk5k

This guy has a really, uhm, interesting violin technique. ;) Just for the fact that he chose to play the instrument the way he did tells me he does not actually know how to play violin. What he's doing is playing it like a teeny tiny cello. Since the notes are so much closer together on the respective fingerboards on a violin than on the cello, I'm betting he needed an extra take or 2 on that. His performance is passable, but nothing when put up against the performance of an actual, professional violinist.

smolder · on Jan 15, 2021

Zoom is not the right software for this. It already exists. There are many online streaming musical collaboration tools, including free ones. Jamkazam is one example.

singingfish · on Jan 15, 2021

My band did that for a while. It was a lot of work for the producers.

We also played a specially composed piece for playing over zoom. We each recorded separately then sent our files in later. But it was live. And the music was super-simple.

galaxyLogic · on Jan 15, 2021

Possible solution: First everybody plays to the same click track without hearing each other. Then everybody plays the same thing again but this time hearing what others played during the first round.

This can be repeated as many times as wanted. On 3rd time you will hear what others played during the 2nd round and that is presumably better played than what they played the first time when they only heard the click-track.

Thus you can "jam" with the other musicians the way they were last time around.

travisjungroth · on Jan 15, 2021

Imagine something like this but as an endless jam session. People choose who to turn on and off and can go through it as many times as they like.

vbsteven · on Jan 15, 2021

Something like a live recording session? Where the musicians input is synchronized and gets streamed into the DAW of each participant so they can mix their own version. This could be fun.

During the first lockdown I watched a demo video of a remote VST plugin combo that allowed the recording artist to record his vocals, and the recording engineer to manipulate the audio each from their own homes.

travisjungroth · on Jan 15, 2021

Yes, but sort of decentralized. I can play with any combination of recordings that were recorded over each other. There’s some neat graph theory stuff in there.

jdprt · on Jan 15, 2021

Endlesss is trying to do basically that https://endlesss.fm/products/studio

There is an iOS version available now too, which has inbuilt synths and effects but also allows external sources. That in essence means people are doing exactly that!

The app is alot of fun even with little musical ability. With studio it seems they're trying to achieve your vision but in the DAW.

cheschire · on Jan 15, 2021

The flaw here is when one musician makes a timing mistake, the others may play off it as they naturally would while jamming, but then the mistake is compounded. Similar to a copy of a copy, the flaws are amplified.

TheSpiceIsLife · on Jan 15, 2021

That's how studio records are typically done, as per Areading314's comment here https://news.ycombinator.com/item?id=25787241

pgreenwood · on Jan 14, 2021

During lockdown I was able to have music jams with high quality uncompressed audio at 10ms (effectively real time), using Jamulus for the audio, and Zoom for the video. Then the whole thing was live streamed to Twitch.

wmwmwm · on Jan 14, 2021

Me too - jamming at about 15ms lag with Jamulus felt really natural. In fact after months of talking to other people through zoom, just chatting with my friends with low latency, high quality, and without the auto processing/muting that zoom seems to do, was amazing. Conversation just flowed back and forth better!

There’s a bunch of similar bits of software but none of them are super welcoming for the less technically inclined. I guess that the experience is often going to be limited by network performance (specifically jitter and loss on UDP) so it’s a challenging thing to make super slick/consistent. I think that low latency 5G with latency guarantees might help solve it when its widely available though.

MrManatee · on Jan 15, 2021

I can also report a positive experience with Jamulus. There's probably people in this discussion who have only experienced the latency through Zoom or Skype, and assume it's about as good as you can get over the internet. It's not even close.

If you have special software like Jamulus, non-wireless internet, players geographically close to each other, and an external audio interface, the latency is surprisingly good. If you have most but not all of that, it can still be usable. Our end-to-end latency was around 30 ms, so not as low as parent's, but we were still able to play even "jam songs" where just followed each other for chord changes instead of having agreed them beforehand.

patal · on Jan 15, 2021

This is my experience with Jamulus, too. However, to get down to 10-40ms, you need to setup your tech as follows.

- ethernet instead of wlan

- dedicated audio interface, takes calculation overhead off your computer

- a proper microphone, improves acoustics greatly

All of these steps will increase Zoom/Skype experience, too. However, Zoom/Skype will hardly go below 100s of ms of latency, just by themselves.

Analog24 · on Jan 15, 2021

How far apart (roughly) were you and your bandmates? I'm guessing in the same town/city?

pgreenwood · on Jan 15, 2021

We were within about 10kms. Ethernet connection, no wifi.

ndiscussion · on Jan 14, 2021

I find it sad that schools are having to use for-profit foreign-owned services like Zoom. If I had a child, I would have severe qualms with it.

carabiner · on Jan 15, 2021

> If I had a child, I would have severe qualms with it.

Yeah, I'm skeptical. If you had a child, you'd be so spread thin with keeping your kid healthy and safe, while paying for it all, that the licensing of some software they use would be the least of your concerns. You'd be worried that his girlfriend might be manipulative, if he's smoking too much weed, if he's partying too much, if he is being bullied, or if saving up for the PS5 is a wise use of his summer job funds, most likely.

bluGill · on Jan 15, 2021

I have 3 kids, and I'm glad those are not my worries. Marrying the right person helps. (though there is some luck there)

My worries is while I'm working with one kid the other will start watching Thomas the train or the like on youtube.

isatty · on Jan 15, 2021

What's wrong with Thomas the tank engine?

eindiran · on Jan 15, 2021

I think it has to do with this sort of issue: https://www.thesun.co.uk/news/1418668/kids-left-traumatised-...

For the last few years, there have been complaints that Youtube Kids has issues where the algorithm stumbles onto content using cartoon characters in ways that are disturbing for young kids.

bluGill · on Jan 15, 2021

Hours of that instead of the school work they are supposed to do.

Plus they run around acting like the naughty diesel 10. Instead of imanginative play.

closeparen · on Jan 15, 2021

Do you think imaginative play is supposed to be from whole cloth? I thought that was half the point of children’s media, to give raw material for children to synthesize into imaginative play.

bluGill · on Jan 15, 2021

My kids play very different when they are not given media. They can create their own stories. They dig in the mud instead of grabbing some dirt and calling it thomas.

kfarr · on Jan 15, 2021

I loved Thomas as a kid and will let my kids watch it, but if you actually watch the plots they have very odd morals to the stories and are generally male chauvinist. But trains and it's generally harmless IMHO as long as you discuss the shortcomings with your children.

koolba · on Jan 14, 2021

I find it mind boggling that US government entities use a service run by company with such deep ties with China.

flukus · on Jan 14, 2021

I think both are a rightful condemnation of previous video conferencing "solutions".

evad3r · on Jan 15, 2021

I can tell you right now they're just gonna default to Zoom as that's the service that is mostly talked about, and probably the easiest to set up and use. Even my grandmother knows how to set up a Zoom meeting.

eunos · on Jan 15, 2021

What? Zoom HQ is in Cali.

ndiscussion · on Jan 15, 2021

I'm not highly educated on the matter, but here's an instance of a Zoom employee illegally working with the CCP (and other zoom employees) to crack down on dissidents: https://fortune.com/2020/12/18/zoom-china-employee-charged-c...

ineedasername · on Jan 14, 2021

Many use google classroom, which is at least US owned, but then probably mines significantly more data about the behavior of minors.

midasuni · on Jan 15, 2021

Microsoft is a for profit foreign owned service. So is google. And Apple. And facebook.

kiwijamo · on Jan 14, 2021

As a teacher of the Deaf, teaching in sign language, I've found the video quality of various video conferencing platforms to be very variable. I wish Zoom had more tuning options for Video so we can optimize our video streams for better video clarity (e.g. tweaking FPS or enforcing a minimum bitrate/resolution/quality/etc for the video stream). Interesting to know there are also issues for the audio stream as well.

ZoomZoomZoom · on Jan 15, 2021

I also recommend Jamulus (https://jamulus.io/) for really low-latency audio. It got a fair bit of attention and development done during 2020 and now works better than ever.

I regularly stumbled upon open jamulus rooms which some music teachers used for their private lessons. They usually spend a lot of time (painfully so) explaining the students how to connect everything and use Zoom simultaneously for video.

hungryhobo · on Jan 15, 2021

Piano is a notoriously difficult instrument even for regular music production, just because of its broad frequency range and harmonics are a pain to deal with. So kudos for zoom for at least making it sound palatable.

I do wish there are more examples for other types of instruments or even voices. Guess i'll have to try it out myself.

klodolph · on Jan 15, 2021

> Piano is a notoriously difficult instrument even for regular music production…

Er, sort of. I think there’s some incorrect information in this comment.

Piano is an acoustic instrument, and to record an acoustic instrument well, you generally need to have a decent instrument in a good acoustic space with good microphone placement. For most people, getting all three is quite hard. Microphone placement for piano ain’t exactly easy but it’s not really harder than, say, guitar or violin. The hard part is that you inevitably capture a lot of ambiance from the room. With a room that sounds bad or with poor mic placement you’re going to get an inferior piano sound.

What makes, for example, guitar easier is really just the fact that if you have any decent room somewhere, you can fit somebody with a guitar into it. It’s small & portable. You still need a good instrument, a good room, and good mic placement but you don’t need to have the guitar permanently set up.

The frequency range of a piano & its harmonics are not especially difficult to tame in music production.

There are some instruments that are kind of “easier” in a sense, like vocals, electric guitar, or electric bass—but I don’t think the differences are all that stark. The main difference is that you are much more likely to use only a single microphone for these instruments rather than multiple mics, and you’re more likely use multiple mics for piano / acoustic guitar / violin / etc.

If there’s one common instrument that requires the most effort in music production I’ll say that it’s the humble drum kit, hands down. If you’re in a home studio, the drum kit might spur you to add more channels, make you work harder setting up the instrument and mics, and make you do more work with acoustic treatment.

robotmay · on Jan 15, 2021

Ooh something I have direct personal experience in. I currently use Zoom once a week for a music lesson. The latency is poor but my friend is not super great with computers so it's a reasonable compromise.

Now _playing_ together is much more challenging, and not something especially viable for students (which is why it's a bit weird targeting this at Zoom): you definitely need a proper audio interface, most likely with ~5ms of latency. You then also absolutely have to be on a wired network connection, which on most modern laptops probably requires another adapter, as wireless introduces too much latency. Overall you need < 25ms of latency in order to be able to play "live" together. This is doable but you need the right tools.

I'm a folk musician who plays in pubs and you can even see the effects of latency in a real-life setting like that - if the room is too large then people on the other side start to drop out of time. What I do in this situation is follow their hands rather than what I can hear. Unfortunately this too doesn't work online due to video latency being even worse. Metronomes don't really work for our music because it doesn't follow a standard BPM, it can and should vary.

We did manage a few sessions using JamKazam (https://jamkazam.com/) which works well enough with the tweaks listed above. It's largely restricted to people who know what they're doing, sadly.

ineedasername · on Jan 14, 2021

I can use products like Geforce now and manually best my server to a location a few thousand miles away. Ping time increases from 10/15 ms to about 70ms, and for anything but twitch gaming I can get high fidelity graphics and controls & sound synced indistinguishably from playing on my local PC.

This is not a technology bottleneck, it is an implementation bottleneck. Zoom has benefited from the WFH and remote learning movement, but the market is ripe for disruption with a higher quality experience.

qmmmur · on Jan 14, 2021

There is so much low hanging fruit in zoom they just refuse to add options for. The "turn on original audio" is fairly good at removing noticeable DSP processing but the audio can offer suffer from issues due to compression.

mnw21cam · on Jan 15, 2021

I have been trying to follow the instructions on Zoom's web site to turn on original audio (because I always use headphones on a call), but the options just aren't where the web site says they are. How did you manage it?

qmmmur · on Jan 15, 2021

To my knowledge sometimes are setting are locked behind an institution's setup. You might need to log in to the Zoom website where your preferences are and see if there is a setting to enable advanced settings or something to that effect.

Then prior to screen sharing or audio sharing there is a button on the top left for "original audio".