vllm-project
diff --git a/‎docs/README.md
Lines changed: 1 addition & 0 deletions b/‎docs/README.md
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/source/api/multimodal/index.md
Lines changed: 0 additions & 1 deletion b/‎docs/source/api/multimodal/index.md
Lines changed: 0 additions & 1 deletion
diff --git a/‎docs/source/api/params.md
Lines changed: 0 additions & 1 deletion b/‎docs/source/api/params.md
Lines changed: 0 additions & 1 deletion
diff --git a/‎docs/source/community/sponsors.md
Lines changed: 2 additions & 0 deletions b/‎docs/source/community/sponsors.md
Lines changed: 2 additions & 0 deletions
diff --git a/‎docs/source/contributing/overview.md
Lines changed: 0 additions & 2 deletions b/‎docs/source/contributing/overview.md
Lines changed: 0 additions & 2 deletions
diff --git a/‎docs/source/deployment/docker.md
Lines changed: 2 additions & 2 deletions b/‎docs/source/deployment/docker.md
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/source/deployment/frameworks/cerebrium.md
Lines changed: 5 additions & 5 deletions b/‎docs/source/deployment/frameworks/cerebrium.md
Lines changed: 5 additions & 5 deletions
diff --git a/‎docs/source/deployment/frameworks/dstack.md
Lines changed: 5 additions & 5 deletions b/‎docs/source/deployment/frameworks/dstack.md
Lines changed: 5 additions & 5 deletions
diff --git a/‎docs/source/deployment/frameworks/skypilot.md
Lines changed: 1 addition & 1 deletion b/‎docs/source/deployment/frameworks/skypilot.md
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/source/deployment/integrations/llamastack.md
Lines changed: 1 addition & 1 deletion b/‎docs/source/deployment/integrations/llamastack.md
Lines changed: 1 addition & 1 deletion
@@ -16,4 +16,5 @@ make html
 ```bash
 python -m http.server -d build/html/
 ```
+
 Launch your browser and open localhost:8000.
@@ -13,7 +13,6 @@ via the `multi_modal_data` field in {class}`vllm.inputs.PromptType`.
 
 Looking to add your own multi-modal model? Please follow the instructions listed [here](#enabling-multimodal-inputs).
 
-
 ## Module Contents
 
 ```{eval-rst}
 
@@ -19,4 +19,3 @@ Optional parameters for vLLM APIs.
 .. autoclass:: vllm.PoolingParams
     :members:
 ```
-
@@ -6,13 +6,15 @@ vLLM is a community project. Our compute resources for development and testing a
 <!-- Note: Please keep these consistent with README.md. -->
 
 Cash Donations:
+
 - a16z
 - Dropbox
 - Sequoia Capital
 - Skywork AI
 - ZhenFund
 
 Compute Resources:
+
 - AMD
 - Anyscale
 - AWS
 
@@ -37,8 +37,6 @@ pytest tests/
 Currently, the repository is not fully checked by `mypy`.
 ```
 
-# Contribution Guidelines
-
 ## Issues
 
 If you encounter a bug or have a feature request, please [search existing issues](https://github.com/vllm-project/vllm/issues?q=is%3Aissue) first to see if it has already been reported. If not, please [file a new issue](https://github.com/vllm-project/vllm/issues/new/choose), providing as much relevant information as possible.
 
@@ -28,8 +28,8 @@ memory to share data between processes under the hood, particularly for tensor p
 You can build and run vLLM from source via the provided <gh-file:Dockerfile>. To build vLLM:
 
 ```console
-$ # optionally specifies: --build-arg max_jobs=8 --build-arg nvcc_threads=2
-$ DOCKER_BUILDKIT=1 docker build . --target vllm-openai --tag vllm/vllm-openai
+# optionally specifies: --build-arg max_jobs=8 --build-arg nvcc_threads=2
+DOCKER_BUILDKIT=1 docker build . --target vllm-openai --tag vllm/vllm-openai
 ```
 
 ```{note}
 
@@ -13,14 +13,14 @@ vLLM can be run on a cloud based GPU machine with [Cerebrium](https://www.cerebr
 To install the Cerebrium client, run:
 
 ```console
-$ pip install cerebrium
-$ cerebrium login
+pip install cerebrium
+cerebrium login
 ```
 
 Next, create your Cerebrium project, run:
 
 ```console
-$ cerebrium init vllm-project
+cerebrium init vllm-project
 ```
 
 Next, to install the required packages, add the following to your cerebrium.toml:
@@ -58,10 +58,10 @@ def run(prompts: list[str], temperature: float = 0.8, top_p: float = 0.95):
 Then, run the following code to deploy it to the cloud:
 
 ```console
-$ cerebrium deploy
+cerebrium deploy
 ```
 
-If successful, you should be returned a CURL command that you can call inference against. Just remember to end the url with the function name you are calling (in our case` /run`)
+If successful, you should be returned a CURL command that you can call inference against. Just remember to end the url with the function name you are calling (in our case`/run`)
 
 ```python
 curl -X POST https://api.cortex.cerebrium.ai/v4/p-xxxxxx/vllm/run \
 
@@ -13,16 +13,16 @@ vLLM can be run on a cloud based GPU machine with [dstack](https://dstack.ai/),
 To install dstack client, run:
 
 ```console
-$ pip install "dstack[all]
-$ dstack server
+pip install "dstack[all]
+dstack server
 ```
 
 Next, to configure your dstack project, run:
 
 ```console
-$ mkdir -p vllm-dstack
-$ cd vllm-dstack
-$ dstack init
+mkdir -p vllm-dstack
+cd vllm-dstack
+dstack init
 ```
 
 Next, to provision a VM instance with LLM of your choice (`NousResearch/Llama-2-7b-chat-hf` for this example), create the following `serve.dstack.yml` file for the dstack `Service`:
 
@@ -338,7 +338,7 @@ run: |
 sky launch -c gui ./gui.yaml --env ENDPOINT=$(sky serve status --endpoint vllm)
 ```
 
-2. Then, we can access the GUI at the returned gradio link:
+1. Then, we can access the GUI at the returned gradio link:
 
 ```console
 | INFO | stdout | Running on public URL: https://6141e84201ce0bb4ed.gradio.live
 
@@ -7,7 +7,7 @@ vLLM is also available via [Llama Stack](https://github.com/meta-llama/llama-sta
 To install Llama Stack, run
 
 ```console
-$ pip install llama-stack -q
+pip install llama-stack -q
 ```
 
 ## Inference using OpenAI Compatible API