DistilGpt-2 and generating descriptive summary of a strava activity

#15

by robinverma - opened Nov 20, 2024

Nov 20, 2024

New to this area, wondering if it is possible to train distill-gpt2, using a training set with below entries (assuming diversification in activity types, statistics, and how the formation of output ). Have around 1800 such entries for training :

{"input": "Date: 2014-12-20 08:16:45+0:0, Timezone: (GMT+05:30) Asia/Kolkata, Athlete: Sammy, Gender: Male, Sport: Run, Activity: 10.13 km, Elapsed: 1.10 hrs, Moving: 1.09 hrs, Elevation Gain: 0.0 m, Kudos: 0, Avg Pace: 6.45 min/km, Max Pace: 4.27 min/km, Photos: 0", "output": "2014-12-20 08:16:45+0:0 saw Sammy went for a run with a distance of 10.13 km with 0.0 m meters of elevation gain. It took 1.10 hrs, including 1.09 hrs of moving time. They garnered 0 kudos. Maintaining an average pace of 6.45 min/km , their fastest pace was 4.27 min/km. No photos were taken during this activity"}

I am expecting that if I provide data such as the one in input, I see a response similar to "output".
However, I am seeing echo of the input as response that if I provide input in the above format.

Also, attached are the loss curves from training and evaluation.

Any suggestions would be appreciated.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment