[SkyServe] Cache output of `sky serve status --endpoint` #2915

cblmemo · 2023-12-28T08:48:21Z

Currently, every time we run sky serve status --endpoint, it will contact the sky serve controller to get the endpoint. It is not friendly for a bash-based script that calls this CLI multiple times, say curl $(sky serve status --endpoint svc1)/v1/models. We should have a local database serve as a cache for the service endpoint since it will not change while the service is running.

Notice that we still use the remote database on the SkyServe controller as the only source of truth. If we don't find the service endpoint in the local cache, then we contact the SkyServe controller for the endpoint.

The text was updated successfully, but these errors were encountered:

github-actions · 2024-04-27T01:45:14Z

This issue is stale because it has been open 120 days with no activity. Remove stale label or comment or this will be closed in 10 days.

github-actions · 2024-10-20T02:04:39Z

This issue is stale because it has been open 120 days with no activity. Remove stale label or comment or this will be closed in 10 days.

github-actions · 2025-02-18T01:58:38Z

This issue is stale because it has been open 120 days with no activity. Remove stale label or comment or this will be closed in 10 days.

kyuds · 2025-03-18T13:47:36Z

@cblmemo just wondering if this is still relevant

cblmemo · 2025-03-19T23:43:14Z

@cblmemo just wondering if this is still relevant

Yes

kyuds · 2025-03-20T00:12:46Z

@cblmemo just wondering if this is still relevant

Yes

Could I try to work on this?

cblmemo · 2025-03-20T05:20:17Z

@cblmemo just wondering if this is still relevant

Yes

Could I try to work on this?

Sounds great! cc @Michaelvll for a look here

kyuds · 2025-03-20T05:30:05Z

Perfect. @cblmemo if you could assign this issue to me, I'll get started starting this weekend

cblmemo · 2025-03-20T17:07:51Z

Perfect. @cblmemo if you could assign this issue to me, I'll get started starting this weekend

Done

kyuds · 2025-03-24T08:45:09Z

@cblmemo
ok so I had the opportunity to look through the SkyServe code over the weekend. This is my general plan on how to implement this issue. If this general idea looks good, then I'll go ahead with the implementation and submit a draft PR with more details on the specs:

The caching will be done by adding a new column to the serve_state sqlite3 database (under ~/.sky/serve/service.db). This will be named "endpoint", default to None, and will be populated by the "initial" query for status. Though, considering that the sky api server is moving to a client server architecture, api server dbs may not be local anymore, so it might be a better idea to actually create a new, local database (maybe under ~/.sky/serve/cache?) to just cache the state instead of querying the api server. I would appreciate feedback on this part as my understanding is that the local sky api server is the one currently storing state data under the ~/.sky folder
I am assuming that we want caching only for the --endpoint flag (correct me if I am wrong), so when we receive such a request then we will check the cache for the endpoint, and if it exists, return, else, query and store.
I am also thinking of adding a --force-refresh (can be renamed) flag so that users have the choice to force a re-query to the ground truth skyserve controller.

cblmemo · 2025-03-25T21:21:46Z

@cblmemo ok so I had the opportunity to look through the SkyServe code over the weekend. This is my general plan on how to implement this issue. If this general idea looks good, then I'll go ahead with the implementation and submit a draft PR with more details on the specs:

The caching will be done by adding a new column to the serve_state sqlite3 database (under ~/.sky/serve/service.db). This will be named "endpoint", default to None, and will be populated by the "initial" query for status. Though, considering that the sky api server is moving to a client server architecture, api server dbs may not be local anymore, so it might be a better idea to actually create a new, local database (maybe under ~/.sky/serve/cache?) to just cache the state instead of querying the api server. I would appreciate feedback on this part as my understanding is that the local sky api server is the one currently storing state data under the ~/.sky folder

I am assuming that we want caching only for the --endpoint flag (correct me if I am wrong), so when we receive such a request then we will check the cache for the endpoint, and if it exists, return, else, query and store.

I am also thinking of adding a --force-refresh (can be renamed) flag so that users have the choice to force a re-query to the ground truth skyserve controller.

I think putting that into the service db on the api server makes sense since this cache should be shared by all users for this api server.
--force-refresh looks good to me.

cc @Michaelvll for a look here

cblmemo added good first issue Good for newcomers feature-request labels Dec 28, 2023

github-actions bot added the Stale label Apr 27, 2024

cblmemo removed the Stale label Apr 27, 2024

cblmemo added the serve features/bugs related to sky serve label Jun 21, 2024

github-actions bot added the Stale label Oct 20, 2024

cblmemo removed the Stale label Oct 20, 2024

github-actions bot added the Stale label Feb 18, 2025

cblmemo removed the Stale label Feb 18, 2025

cblmemo assigned kyuds Mar 20, 2025

kyuds linked a pull request Mar 28, 2025 that will close this issue

[Serve] Add endpoint caching for status --endpoint #5052

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SkyServe] Cache output of `sky serve status --endpoint` #2915

[SkyServe] Cache output of `sky serve status --endpoint` #2915

cblmemo commented Dec 28, 2023

github-actions bot commented Apr 27, 2024

github-actions bot commented Oct 20, 2024

github-actions bot commented Feb 18, 2025

kyuds commented Mar 18, 2025

cblmemo commented Mar 19, 2025

kyuds commented Mar 20, 2025

cblmemo commented Mar 20, 2025

kyuds commented Mar 20, 2025

cblmemo commented Mar 20, 2025

kyuds commented Mar 24, 2025 •

edited

Loading

cblmemo commented Mar 25, 2025

[SkyServe] Cache output of sky serve status --endpoint #2915

[SkyServe] Cache output of sky serve status --endpoint #2915

Comments

cblmemo commented Dec 28, 2023

github-actions bot commented Apr 27, 2024

github-actions bot commented Oct 20, 2024

github-actions bot commented Feb 18, 2025

kyuds commented Mar 18, 2025

cblmemo commented Mar 19, 2025

kyuds commented Mar 20, 2025

cblmemo commented Mar 20, 2025

kyuds commented Mar 20, 2025

cblmemo commented Mar 20, 2025

kyuds commented Mar 24, 2025 • edited Loading

cblmemo commented Mar 25, 2025

[SkyServe] Cache output of `sky serve status --endpoint` #2915

[SkyServe] Cache output of `sky serve status --endpoint` #2915

kyuds commented Mar 24, 2025 •

edited

Loading