Update README.md
Browse files
README.md
CHANGED
|
@@ -146,10 +146,10 @@ We can use the following code to get a sense of peak memory usage during inferen
|
|
| 146 |
|
| 147 |
## Results
|
| 148 |
|
| 149 |
-
| Benchmark
|
| 150 |
-
|
| 151 |
-
|
|
| 152 |
-
| Peak Memory
|
| 153 |
|
| 154 |
|
| 155 |
## Benchmark Peak Memory
|
|
|
|
| 146 |
|
| 147 |
## Results
|
| 148 |
|
| 149 |
+
| Benchmark | | |
|
| 150 |
+
|------------------|----------------|--------------------------------|
|
| 151 |
+
| | Phi-4 mini-Ins | Phi-4-mini-instruct-int4wo-hqq |
|
| 152 |
+
| Peak Memory (GB) | 8.91 | 2.98 (67% reduction) |
|
| 153 |
|
| 154 |
|
| 155 |
## Benchmark Peak Memory
|