Skip to content

Commit e3b4b85

Browse files
authored
Update LICENSE and TODOs in README (ggml-org#14)
* Update LICENSE with our copyright notice * Update README.md * fix readme anchor * Update README.md
1 parent 7cebaba commit e3b4b85

File tree

2 files changed

+12
-10
lines changed

2 files changed

+12
-10
lines changed

LICENSE

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,7 @@
11
MIT License
22

33
Copyright (c) 2023 Georgi Gerganov
4+
Copyright (c) 2023 SJTU-IPADS
45

56
Permission is hereby granted, free of charge, to any person obtaining a copy
67
of this software and associated documentation files (the "Software"), to deal

README.md

Lines changed: 11 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -53,14 +53,19 @@ We have tested PowerInfer on the following platforms:
5353
- x86-64 CPU (with AVX2 instructions) on Linux
5454
- x86-64 CPU and NVIDIA GPU on Linux
5555
- Apple M Chips on macOS (As we do not optimize for Mac, the performance improvement is not significant now.)
56-
5756

57+
And new features coming soon:
58+
59+
- Mistral-7B model
60+
- Online fine-grained FFN offloading to GPU
61+
- Metal backend for sparse inference on macOS
62+
5863
## Getting Started
5964

60-
- [Installation](##setup--installation)
61-
- [Model Weights](##model-weights)
65+
- [Installation](#setup-and-installation)
66+
- [Model Weights](#model-weights)
6267

63-
## Setup & Installation
68+
## Setup and Installation
6469
### Get the Code
6570

6671
```bash
@@ -70,12 +75,7 @@ cd PowerInfer
7075
### Build
7176
In order to build PowerInfer you have two different options. These commands are supposed to be run from the root directory of the project.
7277

73-
Using `make` on Linux or macOS:
74-
```bash
75-
make
76-
```
77-
78-
Using `CMake`:
78+
Using `CMake` on Linux or macOS:
7979
* If you have one GPU:
8080
```bash
8181
cmake -S . -B build -DLLAMA_CUBLAS=ON
@@ -130,6 +130,7 @@ PowerInfer achieves up to 11x and 8x speedup for FP16 and INT4 models!
130130
We will release the code and data in the following order, please stay tuned!
131131

132132
- [x] Release core code of PowerInfer, supporting Llama-2, Falcon-40B.
133+
- [ ] Support Mistral-7B
133134
- [ ] Release perplexity evaluation code
134135
- [ ] Support Metal for Mac
135136
- [ ] Release code for OPT models

0 commit comments

Comments
 (0)