More

moinnadeem · on May 25, 2023

I would hold skepticism for the moment.

I know the authors from the blog post quite well. Say what you will about the firm, but one of the authors have been investing in machine learning since 2016, and another has a PhD in CS (including a SIGCOMM test of time award!)

I come from a strong ML background (multiple publications, PhD dropout), I would say that the canon is actually quite good.

gist · on May 25, 2023

> and another has a PhD in CS

Sorry to say but 'big deal'.

> one of the authors have been investing in machine learning since 2016

Ditto.

I have been doing something (in another field) since the mid 90's. I would say most people would consider me an expert. I get referrals for what I do from 'top' people investors in tech. I also went to what most would consider 'a top college'. I would never want to be positioned as being right or expert because of the amount of time I spent doing something or the college that I went to, or who trusts me, but actual things that I have done that point to my expertise (not a halo of some type).

JumpCrisscross · on May 25, 2023

Medium matters. Anyone making public statements should understand as much.

disgruntledphd2 · on May 25, 2023

I agree with both you and the commenters roasting a16z, tbh.

moinnadeem · on May 25, 2023

I wouldn't go so far. I know the authors quite well, and as someone who has multiple publications in machine learning confeerences (and started a PhD in ML), they know their stuff well.

dpflan · on May 25, 2023

OK, thanks for reassuring.

moinnadeem · on March 22, 2022

Disclosure: I work at MosaicML

Yeah, I strongly agree. While Nvidia is working on better hardware (and they're doing a great job at it!), we believe that better training methods should be a big source of efficiency. We've released a new PyTorch library for efficient training at http://github.com/mosaicml/composer.

Our combinations of methods can train CV models ~4x faster to the same accuracy on CV tasks, and ~2x faster to the same perplexity/GLUE score on NLP tasks!

jwuphysics · on March 22, 2022

I've been seeing a lot more about MosaicML on my Twitter feed. Just wanted to ask -- how are your priorities different than, say, Fastai?

moinnadeem · on March 19, 2021

Instagram is down for me (located in the midwest). Reports "5xx server error"

neom · on March 19, 2021

Curious if anyone knows why they would use xx instead of the number?

cratermoon · on March 19, 2021

The HTTP response says 503, so the text is just generic for any 5xx error.

neom · on March 19, 2021

How do you know the HTTP response was 503? (sorry for the silly questions, I'm not a software engineer)

cratermoon · on March 19, 2021

I looked at the headers returned from the server using developer tools. 503 is "Service Unavailable" and you often see it when a proxy can't reach the backend server.

AugurCognito · on March 19, 2021

Whatsapp is reporting same error here(India) https://web.whatsapp.com/status.json

whimsicalism · on March 19, 2021

But not FB blue, interestingly enough for me.

moinnadeem · on May 2, 2020

What do you think about the train test discrepancy? ie. will practitioners have to fine-tune Nubia's models on their training dataset in order to evaluate on their test dataset?

aliabd · on May 2, 2020

Thanks for clarification mhkane.

At least 3 datasets go into making a NUBIA model:

- The general dataset used to train the language model before being fine-tuned to extract semantic similarity, logical entailment and grammaticality (ie Wikipedia)

- The dataset used to fine-tune the semantic similarity module and logical inference scorer

- The dataset used to predict human judgement

So far, the experiments have actually shown that without any finetuning, the NUBIA model trained to assess machine translations does better at agreeing with human judgement for image captions than the metrics specifically design to assess image captions.

For more advanced cases like, say, scoring medical reports where, for example, grammaticality doesn't matter as much, it may have to be fine-tuned. This is not unlike human training actually where experts are trained on "what to look for".

The nice thing with this modular architecture and the interpretable scores is that it can provide a lot of flexibility to study individual components and their emergent properties and make a judgement call on whether or not to fine tune.

aliabd · on May 2, 2020

The aggregators in Nubia are pretrained to correlate with human judgement, so it should only be used for inference, but the idea is that you can use it as a loss function to optimize translation/image captioning/summarization. It’s too big for that as is but thats what we’re working towards.

mhkane · on May 2, 2020

I think the question here is more along the lines of "If now, I have ,say, radiology reports, do I use Nubia out of the box or do I need to make it read radiology reports and have a sense of what high quality radiology reports look like before using it?"

aliabd · on May 2, 2020

oh I see thanks! will clarify.

moinnadeem · on May 2, 2019

MIT student here, there's really no rhyme or reason. Generally, higher numbers mean more advanced courses, but that's about it.

moinnadeem · on Nov 16, 2018

Jenks High School class of 2016, MIT class of 2020. Honestly, Jenks was an incredible high school to go to since it was public; having a $20M Math and Science center let me go to places like MIT.

dsl · on Nov 16, 2018

My high school having a brand new computer lab completely rebuilt every year, an agricultural sciences program that put most corn country colleges to shame, and a $xx million a year budget surplus let me drop out at 16.

romed · on Nov 16, 2018

That’s good to hear. I went to OSSM in the 90s and the legislature tried to defund it.

kevin_thibedeau · on Nov 16, 2018

> having a $20M Math and Science center let me go to places like MIT.

No it didn't.

coryrc · on Nov 16, 2018

My school not having something like that led to me being unable to apply to MIT.

geekone · on Nov 16, 2018

It sure looks like it has a crazy nice math and science center:

http://www.newson6.com/story/12523105/jenks-public-schools-b...

http://su2016.thedude.oucreate.com/uncategorized/jenks-high-...

moinnadeem · on Oct 8, 2018

Honestly, I think Cambridge MA may give you that environment.

ryanSrich · on Oct 8, 2018

Median home price in Cambridge is $800k. Certainly cheaper than the bay, but I wouldn’t consider that cheap by any stretch.

chrismeller · on Oct 8, 2018

It _sounds_ like he plans on quitting his job and trying to "get by" for a while, hence needing the cheaper cost of living. While Cambridge would definitely scratch the intellectual itch I wouldn't exactly call it "cheap"...

moinnadeem · on March 6, 2017

Freshman at MIT here taking this class -- the lectures are actually taught in a flipped classroom format, so I wouldn't imagine they would release the course considering there are no lectures to follow. I could see them releasing problem sets, however.

sjroot · on March 6, 2017

Interesting! Who is your instructor? Tell them to release a hard copy!

moinnadeem · on Jan 29, 2017

Anyone have thoughts / ideas on what people need that developers could create for times like these? ie. What are the biggest problems in today's political climate which a developer may be able to solve?

richardlitt · on Jan 29, 2017

The EFF probably has some things. https://github.com/EFForg/