About as open source as a binary blob without the training data
-
[email protected]replied to [email protected] last edited by
Just look at blender vs maya for example.
-
[email protected]replied to [email protected] last edited by
Never have I used open source software that has achieved that, or was even close to achieving it. Usually it is opinionated (you need to do it this way in this exact order, because that's how we coded it. No, you cannot do the same thing but select from the back), lacks features and breaks. Especially CAD - comparing Solidworks to FreeCAD for instance, where in FreeCAD any change to previous ops just breaks everything. Modelling software too - Blender compared to 3ds Max - can't do half the things.
-
[email protected]replied to [email protected] last edited by
The training data would be incredible big. And it would contain copyright protected material (which is completely okay in my opinion, but might invoice criticism). Hell, it might even be illegal to publish the training data with the copyright protected material.
They published the weights AND their training methods which is about as open as it gets.
-
[email protected]replied to [email protected] last edited by
They could disclose how they sourced the training data, what the training data is and how you could source it. Also, did they publish their hyperparameters?
They could jpst not call it Open Source, if you can't open source it.
-
[email protected]replied to [email protected] last edited by
You downloaded the weights. That's something different.
-
There were e|forts. Facebook didn't like those.
-
That sounds like a segment on “My 600lb Life”
-
[email protected]replied to [email protected] last edited by
I reckon C++ > Delphi
-
[email protected]replied to [email protected] last edited by
You can do sneaky things with weights that are virtually undetectable.
-
magic_lobster_partyreplied to Fushuan [he/him] last edited by
Ok I understand now why people are upset. There’s a disagreement with terminology.
The source code for the model is open source. It’s defined in PyTorch. The source code for it is available with the MIT license. Anyone can download it and do whatever they want with it.
The weights for the model are open, but it’s not open source, as it’s not source code (or an executable binary for that matter). No one is arguing that the model weights are open source, but there seem to be an argument against that the model is open source.
And even if they provided the source code for the training script (and all its data), it’s unlikely anyone would reproduce the same model weights due to randomness involved. Training model weights is not like compiling an executable, because you’ll get different results every time.
-
[email protected]replied to [email protected] last edited by
I may misunderstand, but are the weights typically several hundred gigabytes large?
-
[email protected]replied to [email protected] last edited by
Even worse is calling a proprietary, absolutely closed source, closed data and closed weight company "OpeanAI"
-
Is it even really software, or just a datablob with a dedicated interpreter?
-
[email protected]replied to [email protected] last edited by
That's fat shaming
-
[email protected]replied to [email protected] last edited by
For neural nets the method matters more. Data would be useful, but at the amount these things get trained on the specific data matters little.
They can be trained on anything, and a diverse enough data set would end up making it function more or less the same as a different but equally diverse set. Assuming publicly available data is in the set, there would also be overlap.
The training data is also by necessity going to be orders of magnitude larger than the model itself. Sharing becomes impractical at a certain point before you even factor in other issues.
-
[email protected]replied to [email protected] last edited by
I mean that's all a model is so....
Once again someone who doesn't understand anything about training or models is posting borderline misinformation about ai.Shocker
-
[email protected]replied to [email protected] last edited by
Hell, for all we know it could be full of classified data. I guess depending on what country you're in it definitely is full of classified data...
-
[email protected]replied to [email protected] last edited by
Yet another so-called AI evangelist accusing others of not understanding computer science if they don't want to worship their machine god.
-
[email protected]replied to [email protected] last edited by
You don't download the training data when running locally. You are downloading the sheet baked model.
-
[email protected]replied to [email protected] last edited by
Praise the Omnisiah! ... I'll see myself out.