Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

125k errors/second and $800 of costs #975

Closed
jakajancar opened this issue Jan 1, 2023 · 4 comments
Closed

125k errors/second and $800 of costs #975

jakajancar opened this issue Jan 1, 2023 · 4 comments

Comments

@jakajancar
Copy link

jakajancar commented Jan 1, 2023

Hey,

I'm not sure whether to be impressed or scared. My application (using tokio-postgres) started producing ~125k errors per second (and incurred $800 of AWS CloudWatch Logs costs). The specific error was:

connection error: error communicating with the server: Socket not connected (os error 107)

Screenshot 2023-01-01 at 3 24 58 PM

This is coming from here:

async fn create_client(config: &tokio_postgres::Config) -> Result<tokio_postgres::Client, tokio_postgres::Error> {
    let mut config = config.clone();
    config.connect_timeout(Duration::from_secs(5)); // default is no timeout

    let (client, mut connection) = config.connect(make_tls()).await?;
    tokio::spawn(async move {
        let mut messages = futures::stream::poll_fn(move |cx| connection.poll_message(cx));
        loop {
            match messages.next().await {
                Some(Ok(AsyncMessage::Notice(notice))) => {
                    tracing::debug!("connection notice: {}", notice);
                }
                Some(Ok(_)) => {}
                Some(Err(error)) => {
                    tracing::error!("connection error: {}", error);
                }
                None => break
            }
        }
    });
    Ok(client)
}

I suspect the reason is that poll_message calls poll_shutdown, which calls Framed/Sink::poll_close, which returns Err. According to the documentation of Sink, "If this function encounters an error, the sink should be considered to have failed permanently, and no more Sink methods should be called.", however, tokio-postgres does not remember this and keeps calling it.

Or perhaps this is expected and I should stop reading after Err is returned, in which case it would be nice to explicitly document this. Because of the Option and the semantics of a stream, I assumed the interface was Stream-y and terminates after None is returned.

@sfackler
Copy link
Owner

sfackler commented Jan 1, 2023

The general expectation is that errors returned from poll_message are fatal and it shouldn't be called again. I can update the documentation to make that more clear.

@jakajancar
Copy link
Author

👍🏻 So None is expected only for graceful disconnect?

Also, out of curiosity, is there a reason for not just implementing Stream?

@sfackler
Copy link
Owner

sfackler commented Jan 1, 2023

👍🏻 So None is expected only for graceful disconnect?

Yep

Also, out of curiosity, is there a reason for not just implementing Stream?

There's a lot of method name overlap between FutureExt and StreamExt, and the connection type already implements Future. Having both impls would prevent those combinators from being callable cleanly if both traits were in scope.

@jakajancar
Copy link
Author

Good point, thanks!

lukoktonos pushed a commit to readysettech/rust-postgres that referenced this issue Aug 31, 2023

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Closes sfackler#975
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants