{"payload":{"feedbackUrl":"https://github.com/orgs/community/discussions/53140","repo":{"id":178075572,"defaultBranch":"master","name":"kserve","ownerLogin":"kserve","currentUserCanPush":false,"isFork":false,"isEmpty":false,"createdAt":"2019-03-27T21:14:14.000Z","ownerAvatar":"https://avatars.githubusercontent.com/u/83512434?v=4","public":true,"private":false,"isOrgOwned":true},"refInfo":{"name":"","listCacheKey":"v0:1722186656.0","currentOid":""},"activityList":{"items":[{"before":"72490bb684f6f28b5e33179647aff27d0df077f3","after":"e7d9ac8c48900bfd6db4821305b762bc51d8a67b","ref":"refs/heads/release-0.13","pushedAt":"2024-07-28T16:16:28.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Publish 0.13.1 Release (#3824)\n\nSigned-off-by: Johnu George ","shortMessageHtmlLink":"Publish 0.13.1 Release (#3824)"}},{"before":"386217551c750cd2c5225c2eb83d7364c1adf81b","after":"f9e7d5a7a08d007ba6fd8f97f590fab92d3b1808","ref":"refs/heads/master","pushedAt":"2024-07-27T20:56:46.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Bump python to 3.11 for serving runtime images and Bump poetry to 1.8.3 (#3812)\n\n* Upgrade serving runtime python version to 3.11 and debian to bookworm\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Upgrade poetry to 1.8.3\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Upgrade openjdk to 17 for pmml\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Fix 'AS' casing warning\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Fix pmml server\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n---------\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan ","shortMessageHtmlLink":"Bump python to 3.11 for serving runtime images and Bump poetry to 1.8…"}},{"before":"32d3fc09206bebb4b87453a20ba320535027abc2","after":"386217551c750cd2c5225c2eb83d7364c1adf81b","ref":"refs/heads/master","pushedAt":"2024-07-27T19:11:14.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Add tests for vLLM (#3771)\n\n* Add tests for vLLM\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* resolve comments\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Uncomment tests for fixed bugs\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n---------\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan ","shortMessageHtmlLink":"Add tests for vLLM (#3771)"}},{"before":"1c51eeee174330b076e4171e6d71e9138f2510b3","after":"72490bb684f6f28b5e33179647aff27d0df077f3","ref":"refs/heads/release-0.13","pushedAt":"2024-07-27T15:43:53.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"0.13.1 Patch Release (#3821)\n\n* Add nccl package and Bump vLLM to 0.4.3 for huggingface runtime (#3723)\r\n\r\n* Add nccl and flash_attn dependencies for huggingface runtime\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Fix vllm generate argument\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Upgrade to cuda 12.4\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n---------\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\nSigned-off-by: Gavrish Prabhu \r\n\r\n* Propagate `trust_remote_code` flag throughout vLLM startup (#3729)\r\n\r\n* propagate trc bool across vllm init\r\n\r\nSigned-off-by: Calvin Woo \r\nSigned-off-by: calvin d. woo \r\n\r\n* use args directly to avoid undefined var\r\n\r\nSigned-off-by: Calvin Woo \r\nSigned-off-by: calvin d. woo \r\n\r\n* Remove trailing space\r\n\r\nSigned-off-by: Dan Sun \r\nSigned-off-by: calvin d. woo \r\n\r\n* move params to newline\r\n\r\nSigned-off-by: calvin d. woo \r\n\r\n---------\r\n\r\nSigned-off-by: Calvin Woo \r\nSigned-off-by: calvin d. woo \r\nSigned-off-by: Dan Sun \r\nCo-authored-by: Dan Sun \r\nSigned-off-by: Gavrish Prabhu \r\n\r\n* Use add_generation_prompt while creating chat template (#3775)\r\n\r\nUse add_generation_rompt for chat template\r\n\r\nSigned-off-by: Dattu Sharma \r\nSigned-off-by: Gavrish Prabhu \r\n\r\n* Fix logprobs for vLLM (#3738)\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\nSigned-off-by: Gavrish Prabhu \r\n\r\n* Install packages needed for vllm model load\r\n\r\nSigned-off-by: Gavrish Prabhu \r\n\r\n---------\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\nSigned-off-by: Gavrish Prabhu \r\nSigned-off-by: Calvin Woo \r\nSigned-off-by: calvin d. woo \r\nSigned-off-by: Dan Sun \r\nSigned-off-by: Dattu Sharma \r\nCo-authored-by: Sivanantham <90966311+sivanantha321@users.noreply.github.com>\r\nCo-authored-by: Calvin Daniel Woo <32230627+calwoo@users.noreply.github.com>\r\nCo-authored-by: Dan Sun \r\nCo-authored-by: Datta Nimmaturi ","shortMessageHtmlLink":"0.13.1 Patch Release (#3821)"}},{"before":"de96bd3f516597a2e07b9c92936f06357f94efd2","after":"32d3fc09206bebb4b87453a20ba320535027abc2","ref":"refs/heads/master","pushedAt":"2024-07-27T15:34:57.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Upgrade CI deps (#3822)\n\nSigned-off-by: Sivanantham Chinnaiyan ","shortMessageHtmlLink":"Upgrade CI deps (#3822)"}},{"before":"bb9641f970431aeb6ac4979e2f8f738e62c9161a","after":"de96bd3f516597a2e07b9c92936f06357f94efd2","ref":"refs/heads/master","pushedAt":"2024-07-25T11:10:20.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Increase timeout to make unit test stable (#3808)\n\nagent watcher unit test is always flaky so increase timeout to make it stable\r\n\r\nSigned-off-by: jooho lee ","shortMessageHtmlLink":"Increase timeout to make unit test stable (#3808)"}},{"before":"5736731f08ec19d2dfc2df97f9cc48b24041414d","after":"bb9641f970431aeb6ac4979e2f8f738e62c9161a","ref":"refs/heads/master","pushedAt":"2024-07-21T14:03:11.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Several bug fixes for vLLM completion endpoint (#3788)\n\n* Fix logprobs\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Fix a scenario where stream completion fails if echo is true and logprobs is nil\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Fix a scenario where completion fails if the prompt is token_ids and echo is set to true\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Respect tokenizer revision\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Add workaround for adding None to token_logprobs and top_logprobs\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n---------\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan ","shortMessageHtmlLink":"Several bug fixes for vLLM completion endpoint (#3788)"}},{"before":"e24d07f45c10d6df86ba44530121b7005afeaa39","after":"90274a628c5511bc41e105edff16071c59815783","ref":"refs/heads/yuzisun-patch-1","pushedAt":"2024-07-21T13:33:03.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Add llm gateway\n\nSigned-off-by: Dan Sun ","shortMessageHtmlLink":"Add llm gateway"}},{"before":null,"after":"e24d07f45c10d6df86ba44530121b7005afeaa39","ref":"refs/heads/yuzisun-patch-1","pushedAt":"2024-07-21T13:24:36.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Update ROADMAP.md\n\nSigned-off-by: Dan Sun ","shortMessageHtmlLink":"Update ROADMAP.md"}},{"before":"31d2395c5c655d26b0f55603d4c467f37b01acd8","after":"5736731f08ec19d2dfc2df97f9cc48b24041414d","ref":"refs/heads/master","pushedAt":"2024-07-20T14:16:41.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Add readiness probe for MLServer and Increase memory for pmml in CI (#3789)\n\n* Add readiness probe for mlserver in CI\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Increase memory limit for pmml test to prevent OOMKilled and read timeout error\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n---------\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan ","shortMessageHtmlLink":"Add readiness probe for MLServer and Increase memory for pmml in CI (#…"}},{"before":"580a0f372b2d5a9b3c05d0649c7fccf55abf732e","after":"31d2395c5c655d26b0f55603d4c467f37b01acd8","ref":"refs/heads/master","pushedAt":"2024-07-20T13:49:39.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Make gRPC max message length configurable (#3741)\n\nSigned-off-by: Sivanantham Chinnaiyan ","shortMessageHtmlLink":"Make gRPC max message length configurable (#3741)"}},{"before":"29731befb0a1721d42e642f09f5ec62f1eafd522","after":"580a0f372b2d5a9b3c05d0649c7fccf55abf732e","ref":"refs/heads/master","pushedAt":"2024-07-18T21:59:47.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Install packages needed for vllm model load (#3802)\n\n* Install packages needed for model load\r\n\r\nSigned-off-by: Gavrish Prabhu \r\n\r\n* make all apt get into a single line\r\n\r\nSigned-off-by: Gavrish Prabhu \r\n\r\n---------\r\n\r\nSigned-off-by: Gavrish Prabhu ","shortMessageHtmlLink":"Install packages needed for vllm model load (#3802)"}},{"before":"1a70e118d5b2e77c27c41ee5ac30be55379adf15","after":"29731befb0a1721d42e642f09f5ec62f1eafd522","ref":"refs/heads/master","pushedAt":"2024-07-15T11:21:12.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Make Virtual Service case-insensitive (#3779)\n\nvirtual service case insensitive\r\n\r\nSigned-off-by: Andrews Arokiam ","shortMessageHtmlLink":"Make Virtual Service case-insensitive (#3779)"}},{"before":"164ea839eeb8b94b6c39cf0cc0bf1a94bfb5a1fc","after":"1a70e118d5b2e77c27c41ee5ac30be55379adf15","ref":"refs/heads/master","pushedAt":"2024-07-12T10:56:44.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Deduplicate the names for the additional domain names (#3773)\n\n* Deduplicate the names for the additional domain names\r\n\r\nSigned-off-by: Vincent Hou \r\n\r\n* Refactoring the functions\r\n\r\nSigned-off-by: Vincent Hou \r\n\r\n---------\r\n\r\nSigned-off-by: Vincent Hou ","shortMessageHtmlLink":"Deduplicate the names for the additional domain names (#3773)"}},{"before":"ca691f728ac0fe6a711b2953a88abb1b3d532658","after":"164ea839eeb8b94b6c39cf0cc0bf1a94bfb5a1fc","ref":"refs/heads/master","pushedAt":"2024-07-11T10:13:56.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Use add_generation_prompt while creating chat template (#3775)\n\nUse add_generation_rompt for chat template\r\n\r\nSigned-off-by: Dattu Sharma ","shortMessageHtmlLink":"Use add_generation_prompt while creating chat template (#3775)"}},{"before":"68b04ef8232cac7e519673daba452bc1068a6b3e","after":"ca691f728ac0fe6a711b2953a88abb1b3d532658","ref":"refs/heads/master","pushedAt":"2024-06-29T23:46:00.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"pillow - Buffer Overflow (#3598)\n\nempty commit\r\n\r\nSigned-off-by: Spolti ","shortMessageHtmlLink":"pillow - Buffer Overflow (#3598)"}},{"before":"d4349ea63a10237a1a597b0dea9f95f3081ac60e","after":"68b04ef8232cac7e519673daba452bc1068a6b3e","ref":"refs/heads/master","pushedAt":"2024-06-29T22:08:04.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Fix model name not properly parsed by inference graph (#3746)\n\n* Fix model name not properly parsed by inference graph\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Handle single string arg with excess whitespace\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Handle duplicate arguments\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n---------\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\nSigned-off-by: Dan Sun \r\nCo-authored-by: Dan Sun ","shortMessageHtmlLink":"Fix model name not properly parsed by inference graph (#3746)"}},{"before":"74c9ff4036c478653d05fa38dc9beb1bfdc4cc29","after":"d4349ea63a10237a1a597b0dea9f95f3081ac60e","ref":"refs/heads/master","pushedAt":"2024-06-29T19:55:37.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Fix logprobs for vLLM (#3738)\n\nSigned-off-by: Sivanantham Chinnaiyan ","shortMessageHtmlLink":"Fix logprobs for vLLM (#3738)"}},{"before":"fbd87784da4ff08edc5305f7013abf837b071b28","after":"74c9ff4036c478653d05fa38dc9beb1bfdc4cc29","ref":"refs/heads/master","pushedAt":"2024-06-29T17:23:22.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Initial implementation of Inference client (#3401)\n\n* Initial implementation of inference client\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Add tests\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Use Inference client for e2e tests\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nUpgrade pytest-asyncio to 0.23.4\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nFix mutable object initialization in default parameters\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nFix graph e2e tests\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nFix pmml test\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Add explain, support dict response, use inference client for internal requests\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nFix inference graph test and grpc headers\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nRemove v1 datamodels\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Introduce protocol in client config\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Support inference graph\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nremove logging configs\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nUpdate default timeout to 60 seconds\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Add retry config for grpc client\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nFix infer model_name parameter\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Add tests for graph endpoint\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\ndebug\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nfix http client param mismatch\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nskip graph test\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nfix timeout in grpc client\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nFix url construction\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nFix explain\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* configure logger for e2e tests\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nRebase master\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nRebase master\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nFix grpc retry config\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nIncrease request timeout\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* configure logger for e2e tests\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nRebase master\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nRebase master\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nFix grpc retry config\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\nIncrease request timeout\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Rebase\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Use fixtures for rest client\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n---------\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan ","shortMessageHtmlLink":"Initial implementation of Inference client (#3401)"}},{"before":"d19e31040c558ef88774736f67a0c2af7dbc6bc2","after":"fbd87784da4ff08edc5305f7013abf837b071b28","ref":"refs/heads/master","pushedAt":"2024-06-27T09:04:19.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Fix No model ready error in multi model serving (#3758)\n\n* Fix No model ready error in multi model serving\r\n\r\n- Fixes the regression introduced by #3275\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Mark transformer model ready in init method\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n---------\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan ","shortMessageHtmlLink":"Fix No model ready error in multi model serving (#3758)"}},{"before":"352e030568779e1ee6cb0ff477ff6f3a58c6dfa0","after":"d19e31040c558ef88774736f67a0c2af7dbc6bc2","ref":"refs/heads/master","pushedAt":"2024-06-24T12:00:23.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Fix model is ready even if there is no model (#3275)\n\ncheck empty model final.\r\n\r\nSigned-off-by: HAO \r\nCo-authored-by: koshino17 ","shortMessageHtmlLink":"Fix model is ready even if there is no model (#3275)"}},{"before":"56a4e7d3c8e4f0a5f9def43a94687c81f6645346","after":"352e030568779e1ee6cb0ff477ff6f3a58c6dfa0","ref":"refs/heads/master","pushedAt":"2024-06-24T11:46:41.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Fix dead links on PyPI (#3754)\n\nThe KServe Python SDK README.md uses relative URLs that work well on GitHub but return a 404 error when visited on PyPI.\r\n\r\nThis change updates the README.md to use absolute URLs that work well on both GitHub and PyPI.\r\n\r\nSigned-off-by: kevinbazira ","shortMessageHtmlLink":"Fix dead links on PyPI (#3754)"}},{"before":"7f0f5e07ad64d09213e196402d88a0399c69b1e1","after":"56a4e7d3c8e4f0a5f9def43a94687c81f6645346","ref":"refs/heads/master","pushedAt":"2024-06-21T10:18:05.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Propagate `trust_remote_code` flag throughout vLLM startup (#3729)\n\n* propagate trc bool across vllm init\r\n\r\nSigned-off-by: Calvin Woo \r\nSigned-off-by: calvin d. woo \r\n\r\n* use args directly to avoid undefined var\r\n\r\nSigned-off-by: Calvin Woo \r\nSigned-off-by: calvin d. woo \r\n\r\n* Remove trailing space\r\n\r\nSigned-off-by: Dan Sun \r\nSigned-off-by: calvin d. woo \r\n\r\n* move params to newline\r\n\r\nSigned-off-by: calvin d. woo \r\n\r\n---------\r\n\r\nSigned-off-by: Calvin Woo \r\nSigned-off-by: calvin d. woo \r\nSigned-off-by: Dan Sun \r\nCo-authored-by: Dan Sun ","shortMessageHtmlLink":"Propagate trust_remote_code flag throughout vLLM startup (#3729)"}},{"before":"b3b1c09ce235d54cff2bae96fd3e108907443e3d","after":"7f0f5e07ad64d09213e196402d88a0399c69b1e1","ref":"refs/heads/master","pushedAt":"2024-06-19T16:24:19.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Add nccl package and Bump vLLM to 0.4.3 for huggingface runtime (#3723)\n\n* Add nccl and flash_attn dependencies for huggingface runtime\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Fix vllm generate argument\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Upgrade to cuda 12.4\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n---------\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan ","shortMessageHtmlLink":"Add nccl package and Bump vLLM to 0.4.3 for huggingface runtime (#3723)"}},{"before":"0ec176345e9b59e5f11c9ce7c813981502e1646f","after":"b3b1c09ce235d54cff2bae96fd3e108907443e3d","ref":"refs/heads/master","pushedAt":"2024-06-19T16:23:22.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"inferenceservice controller: fix error check in Serverless mode (#3753)\n\nSigned-off-by: Daniele Trifirò ","shortMessageHtmlLink":"inferenceservice controller: fix error check in Serverless mode (#3753)"}},{"before":"4c78aaf34e4ea6307db1169fbf8629c9b155780f","after":"0ec176345e9b59e5f11c9ce7c813981502e1646f","ref":"refs/heads/master","pushedAt":"2024-06-19T15:32:30.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Refactor storage initializer to log model download time for all storage types (#3735)\n\n* Refactor storage initializer to log model download time for all storage types\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n* Add more tests for local storage\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan \r\n\r\n---------\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan ","shortMessageHtmlLink":"Refactor storage initializer to log model download time for all stora…"}},{"before":"32d3e19d26f0e2622a345e483d7bbf9b083f3039","after":"4c78aaf34e4ea6307db1169fbf8629c9b155780f","ref":"refs/heads/master","pushedAt":"2024-06-19T15:26:19.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Bump MLServer to 1.5.0 (#3740)\n\nSigned-off-by: Sivanantham Chinnaiyan ","shortMessageHtmlLink":"Bump MLServer to 1.5.0 (#3740)"}},{"before":"212a77c2fd76255db49569d1efa12b4c075466a8","after":"32d3e19d26f0e2622a345e483d7bbf9b083f3039","ref":"refs/heads/master","pushedAt":"2024-06-18T19:30:02.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Extract openai predict logic into smaller methods (#3716)\n\n* refactor into smaller methods\r\n\r\nSigned-off-by: grandbora \r\n\r\n* address comments\r\n\r\nSigned-off-by: grandbora \r\n\r\n* format\r\n\r\nSigned-off-by: grandbora \r\n\r\n---------\r\n\r\nSigned-off-by: grandbora ","shortMessageHtmlLink":"Extract openai predict logic into smaller methods (#3716)"}},{"before":"1c51eeee174330b076e4171e6d71e9138f2510b3","after":"212a77c2fd76255db49569d1efa12b4c075466a8","ref":"refs/heads/master","pushedAt":"2024-06-09T12:31:58.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Prevent the PassthroughCluster for clients/workloads in the service mesh (#3711)\n\nPrevent the PassthroughCluster for clients in the service mesh\r\n\r\nThe KServe Ingress VirtualServices are created with configurations targeting only the Gateways. Although this works, the omission of the Istio sidecars has the following downsides for workloads that belong to the Istio mesh:\r\n\r\n* Requests to InferenceServices will be treated as going to external services (i.e. not part of the mesh), because the sidecars are unaware of the routing rules.\r\n* In consequence, the requests will be hanlded as with any external (non-mesh) workload: the ingress gateway will first receive the request and will forward it to itself doing the URL rewrite to the relevant -predictor, -explainer or -transformer hostname. Such forwarding can be avoided (for mesh-workloads) and the rewrite can be performed by the sidecars with the right VirtualService configuration.\r\n\r\n This is adding the missing configurations in the KServe-created VirtualService, so that Istio sidecars are aware of the KServe services/hostnames and do the rewrite in the sidecar, rather than delaying/deferring the rewrite to the Gateway.\r\n\r\n For workloads that belong to the mesh, slightly better performance may be seen (given one request forwarding is saved) and better observability from Istio may also be possible.\r\n\r\nSigned-off-by: Edgar Hernández <23639005+israel-hdez@users.noreply.github.com>","shortMessageHtmlLink":"Prevent the PassthroughCluster for clients/workloads in the service m…"}},{"before":"6c37dce17c652ebc82ad956f04a5c2badb04c8a4","after":"1c51eeee174330b076e4171e6d71e9138f2510b3","ref":"refs/heads/release-0.13","pushedAt":"2024-06-05T13:34:07.000Z","pushType":"force_push","commitsCount":0,"pusher":{"login":"yuzisun","name":"Dan Sun","path":"/yuzisun","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7662095?s=80&v=4"},"commit":{"message":"Update serving runtimes package version 0.13.0 (#3720)\n\nupdate version to 0.13.0\r\n\r\nSigned-off-by: Sivanantham Chinnaiyan ","shortMessageHtmlLink":"Update serving runtimes package version 0.13.0 (#3720)"}}],"hasNextPage":true,"hasPreviousPage":false,"activityType":"all","actor":null,"timePeriod":"all","sort":"DESC","perPage":30,"cursor":"djE6ks8AAAAEizlfZwA","startCursor":null,"endCursor":null}},"title":"Activity · kserve/kserve"}