Use instructions to create a machine learning python notebook to train and call GPT-J model

Chiuso Pubblicato 2 anni fa Pagato alla consegna
Chiuso Pagato alla consegna

I'm tired of negotiating infrastructure and TPUs and GPU and CUDA errors. I want code that works.

The deliverable for this project may be either of the following:

A Google Colab notebook (shared with me so I can make a copy)

A Paperspace Gradient python notebook (shared with me so I can make a copy)

A .ipynb notebook I can upload to one of those platforms. (Specify which platform. It has to work there.)

I have Google Colab Pro and/or am fine using paid instances on Paperspace. Your solution does not have to run for free. It just has to run.

Either as the first cell in the notebook or as a separate document, provide step-by-step instructions for setting up the environment. What do I need to select for the connection? Etc.

I will also accept ipynb code and instructions for some other paid service (AWS, Azure, Google Cloud, etc) but the instructions need to start from square one with the platform: What clusters do I need to set up? Where do I put this python code? How do I connect it to the training data and python code? Etc.

The notebook should be the code to create an instance of the GPT-J transformer-based text generation model, located here:

[login to view URL]

Program functionality:

Download/install GPT-J model and dependencies

Prepare the data (attached csv) for training

Train the model on the data

Save the tuned model

Call the tuned model

Using any sample prompt

The creators of the model also have a sample Collab notebook showing how to install and run the pre-trained model here:

[login to view URL]

Your notebook should then follow the fine-tuning instructions here: [login to view URL]

(The github training example above uses Google Cloud TPUs. You can use any of the above platforms.) Your instructions on how to perform the steps like creating the proper project, installing dependencies should be more detailed than the link above. Document every step so it is very clear.)

In text cells of the notebook, include detailed instructions on where to upload the training data. Then your code should prepare the data for training.

Your code should then execute fine-tuning on the attached data set for 2 epochs.

Acceptance criteria:

The deliverable is not the fine-tuned model. The deliverable is the notebook and any instructions, so that I can do this repeatably myself.. I will test your code by running the notebook code on the platform you indicate, using the instructions you give me. If I am successfully able to execute the steps above:

Download/install GPT-J model and dependencies

Prepare the data (attached csv) for training

Train the model on the data

Save the tuned model

Call the tuned model on a sample prompt.

Then the project is a success. I will not be evaluating the project based on the quality of the response generated in step 5, only that I can successfully run the code.

Note: The attached file is utf-8 format, but I think it may have been saved with a Byte Order Mark (BOM) for easy reading into Windows apps. Opening it with encoding='utf-8-sig' will ensure python can read it properly.

Additional resources:

You may find the notebook linked here helpful as well.

[login to view URL]

Python Machine Learning (ML) Servizi web di Amazon Cloud Computing Google Cloud Platform

Rif. progetto: #31012966

Info sul progetto

14 proposte Progetto a distanza Attivo 2 anni fa

14 freelance hanno fatto un'offerta media di $262 per questo lavoro

taranchenkovlady

Vladyslav is here. Andrea C I have +6 years of experience in Machine Learning && Artificial Intelligence && Website development. Especially I am good in: - Python/Django, JavaFx, Java spring boot,C++, C#, QT, ASP.NET, Altro

$200 USD in 3 giorni
(4 valutazioni)
4.8
iPingDataLabsLLP

Hi, Greetings for the day! We are a team of passionate Machine Learning Engineers based in Mumbai, India, who have developed and deployed complex AI solutions using Machine Learning and Computer Vision. We have a g Altro

$500 USD in 7 giorni
(8 valutazioni)
4.6
KolaPeters

"Use instructions to create a machine learning python notebook to train and call GPT-J model" Hello, I’m a Data Scientist and Machine Learning engineer. My area of interest is statistical analysis of datasets/images Altro

$30 USD in 5 giorni
(5 valutazioni)
4.0
Sandeep2805

Hello, Dear Client. Thanks for your posting! I am a machine learning expert with over 10 years of experience in tensorflow, darknet, keras, pytorch, opencv and open vino, etc. I have developed lots of LPR projects for Altro

$140 USD in 7 giorni
(6 valutazioni)
3.5
amrmooohamed

Hello There, I have a profound practical knowledge in Machine learning, Deep learning, Computer vision, and NLP. Kindly find below a brief description of my skills and past projects. Waiting to discuss the job role wi Altro

$250 USD in 7 giorni
(0 valutazioni)
0.0
blysh

Hi, there. Good day! Thanks for job posting. I've read your job requirements carefully and am very interesting in your project. *******************Why choose me?************** - My skill is match as your requirement Altro

$140 USD in 7 giorni
(0 valutazioni)
0.0
shafiqueqadri

Hi, I can help you with this. Since everything is provided in your job posting and I can begin working as soon as you want. I'm an ML expert with 5 years of experience. This job right in my scope. So let's jump to c Altro

$200 USD in 7 giorni
(0 valutazioni)
0.0