• prenatal_confusion@feddit.org
    link
    fedilink
    arrow-up
    3
    ·
    9 months ago

    Honest Questions Can a model be open source? Dont we need the data it was trained on to truly be able to reproduce, understand and modify it?

    • lordnikon@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      1
      ·
      9 months ago

      No a model is a model now if you are talking a chat bot or ai app yeah you need the data and that data changes all the time but if I have the model I can supply my own and get simalr results to their model if the data I need is the same. The code is right here https://github.com/deepseek-ai/DeepSeek-V3?tab=readme-ov-file I can get the models from https://huggingface.co/deepseek-ai/DeepSeek-V3-Base and train them further and redistub that model legally. Per

      Section III: CONDITIONS OF USAGE DISTRIBUTION AND REDISTRIBUTION

      1. Distribution and Redistribution. You may host for Third Party remote access purposes (e.g. software-as-a-service), reproduce and distribute copies of the Model or Derivatives of the Model thereof in any medium, with or without modifications, provided that You meet the following conditions: a. Use-based restrictions as referenced in paragraph 5 MUST be included as an enforceable provision by You in any type of legal agreement (e.g. a license) governing the use and/or distribution of the Model or Derivatives of the Model, and You shall give notice to subsequent users You Distribute to, that the Model or Derivatives of the Model are subject to paragraph 5. This provision does not apply to the use of Complementary Material. b. You must give any Third Party recipients of the Model or Derivatives of the Model a copy of this License; c. You must cause any modified files to carry prominent notices stating that You changed the files; d. You must retain all copyright, patent, trademark, and attribution notices excluding those notices that do not pertain to any part of the Model, Derivatives of the Model. e. You may add Your own copyright statement to Your modifications and may provide additional or different license terms and conditions - respecting paragraph 4.a. – for use, reproduction, or Distribution of Your modifications, or for any such Derivatives of the Model as a whole, provided Your use, reproduction, and Distribution of the Model otherwise complies with the conditions stated in this License.

      Use-based restrictions. The restrictions set forth in Attachment A are considered Use-based restrictions. Therefore You cannot use the Model and the Derivatives of the Model for the specified restricted uses. You may use the Model subject to this License, including only for lawful purposes and in accordance with the License. Use may include creating any content with, finetuning, updating, running, training, evaluating and/or reparametrizing the Model. You shall require all of Your users who use the Model or a Derivative of the Model to comply with the terms of this paragraph (paragraph 5).

      • prenatal_confusion@feddit.org
        link
        fedilink
        arrow-up
        3
        ·
        9 months ago

        Okay let’s talk terminology

        I meant the useable application as a for example chat bot.

        The model itself is useful as far as it enables me to understand the underlying logic of the whole thing right? But without the data and I guess a training period with it I can’t use it if I understood it correctly.

        If I am right with the above this wouldnt make it useful as in I can reproduce it as a functioning application and I wouldn’t call that opensourve in my understanding of the word.

    • Swedneck@discuss.tchncs.de
      link
      fedilink
      arrow-up
      2
      ·
      edit-2
      8 months ago

      sure a model could theoretically be open source, but in this case that source would be 1) the software to train the model and 2) the training data, and good fucking luck getting either of those honestly.

      the AI sphere has co-opted the term “open source” because it sounds nice, what they actually mean is just “a free copy”, a free copy of deepseek is available for download.

      They seem to be able to fiddle a lot with the models, but in the end calling it open source is just blatantly not fucking true.