Skip to content

Commit 975d164

Browse files
Lingjiao ChenLingjiao Chen
authored andcommitted
update ReadMe
1 parent fe7eb0b commit 975d164

File tree

2 files changed

+18
-2
lines changed

2 files changed

+18
-2
lines changed

README.md

Lines changed: 18 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,8 +17,8 @@ performances of GPT-4 and GPT-3.5 can vary substantially over time, and for the
1717
</p>
1818

1919

20-
What are the main findings? In a nutshell, there are many interesting performance shifts over time. For example, GPT-4 (March 2023) was very good at identifying prime numbers (accuracy 97.6%)
21-
but GPT-4 (June 2023) was very poor on these same questions (accuracy 2.4%). Interestingly
20+
What are the main findings? In a nutshell, there are many interesting performance shifts over time. For example, GPT-4 (March 2023) was very good at identifying prime numbers (accuracy 84.0%)
21+
but GPT-4 (June 2023) was very poor on these same questions (accuracy 51.1%). Interestingly
2222
GPT-3.5 (June 2023) was much better than GPT-3.5 (March 2023) in this task. We hope releasing the datasets and generations can help the community to understand how LLM services drift better. The above figure gives a quantatitive summary.
2323

2424
## 🚀 Reproducing the Results
@@ -41,10 +41,26 @@ The above figure shows the first few rows in the ```generation/PRIME_EVAL.csv```
4141

4242
## 📚 Read More
4343

44+
You can get an overview via our Twitter threads:
45+
46+
[**Introducing LLM Drifts**](https://twitter.com/matei_zaharia/status/1681467961905926144) (July 18, 2023)
47+
48+
[**Additional Explanations**](https://twitter.com/james_y_zou/status/1681519439613956099) (July 23rd, 2023)
4449

4550
You can find more details in the academic paper:
4651
* [**How Is ChatGPT’s Behavior Changing over Time?**](https://arxiv.org/pdf/2307.09009.pdf)
4752

53+
54+
## 📣 Updates & Changelog
55+
56+
### 🔹 [2023.08.01] - Added Tasks & Expanded Queries & Analysis
57+
- ✅ Added four new tasks to the repository
58+
- 🎢 Expanded one existing task with more diverse queries
59+
- 🔍 Additional analysis in the paper
60+
61+
### 🔹 [2023.07.18] - Initial Release
62+
- 🎊 The project is now live!
63+
4864
## 🎯 Reference
4965

5066
If you use our findings and/or datasets in a research paper, please cite our work as follows:

asset/intro.png

395 KB
Loading

0 commit comments

Comments
 (0)