Implemented get_name in StatsLogger, updated Otel and StatsD #43340

ArshiaZr · 2024-10-24T06:10:26Z

Added get_name method in StatsLogger (base_stat_logger.py) to handle metric name preparation and tag validation.
Updated Otel and StatsD loggers to inherit from StatsLogger and utilize the new get_name method.
Refactored tag preparation and validation logic into get_name for a cleaner and more consistent implementation.

^ Add meaningful description above
Read the Pull Request Guidelines for more information.
In case of fundamental code changes, an Airflow Improvement Proposal (AIP) is needed.
In case of a new dependency, check compliance with the ASF 3rd Party License Policy.
In case of backwards incompatible changes please leave a note in a newsfragment file, named {pr_number}.significant.rst or {issue_number}.significant.rst, in newsfragments.

- Added get_name method in StatsLogger (base_stat_logger.py) to handle metric name preparation and tag validation. - Updated Otel and StatsD loggers to inherit from StatsLogger and utilize the new get_name method. - Refactored tag preparation and validation logic into get_name for a cleaner and more consistent implementation.

ArshiaZr · 2024-10-24T06:16:21Z

@ferruzzi, @howardyoo, @o-nikolas Can you please take a look at it? If anything needs to change please let me know.

howardyoo · 2024-10-24T23:58:13Z

@ferruzzi, @howardyoo, @o-nikolas Can you please take a look at it? If anything needs to change please let me know.

@ArshiaZr, I am not a reviewer, and thus cannot review this PR, but I believe it would be nice if you can add unit tests for your changes as part of this PR, so that the changes can be properly checked to work properly. cc to @ferruzzi

potiuk · 2024-10-25T17:11:08Z

@ArshiaZr, I am not a reviewer, and thus cannot review this PR, but I believe it would be nice if you can add unit tests for your changes as part of this PR, so that the changes can be properly checked to work properly. cc to @ferruzzi

Actually - anyone can review PRs - and we encourage it. You can even approve it @howardyoo and it might guide other maintainers with their approvals.

howardyoo · 2024-10-25T17:17:10Z

No, as far as I know, I don’t think I have the ability to approve the PR, but if I get a chance I’ll sure do the review. Having the unit test would be very helpful for the testing too.Sent from my iPhoneOn Oct 25, 2024, at 12:11 PM, Jarek Potiuk ***@***.***> wrote: @ArshiaZr, I am not a reviewer, and thus cannot review this PR, but I believe it would be nice if you can add unit tests for your changes as part of this PR, so that the changes can be properly checked to work properly. cc to @ferruzzi Actually - anyone can review PRs - and we encourage it. You can even approve it @howardyoo and it might guide other maintainers with their approvals. —Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you were mentioned.Message ID: ***@***.***>

potiuk · 2024-10-25T17:26:18Z

No, as far as I know, I don’t think I have the ability to approve the PR

You can approve it (many contributors do that). But you need at least one approval from a commiter to merge it - but approvals can be done by absolutely anyone who has a GitHub account.

howardyoo · 2024-10-25T17:30:38Z

Ok, I see. Thanks!Sent from my iPhoneOn Oct 25, 2024, at 12:26 PM, Jarek Potiuk ***@***.***> wrote: No, as far as I know, I don’t think I have the ability to approve the PR You can approve it (many contributors do that). But you need at least one approval from a commiter to merge it - but approvals can be done by absolutely anyone who has a GitHub account. —Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you were mentioned.Message ID: ***@***.***>

howardyoo

Looks good to me, and ready. Thank you for pulling this up.

howardyoo · 2024-10-28T14:34:28Z

I reviewed the code, and to me, it is well implemented and ready to be merged.

…

On Fri, Oct 25, 2024 at 12:30 PM Howard Yoo ***@***.***> wrote: Ok, I see. Thanks! Sent from my iPhone On Oct 25, 2024, at 12:26 PM, Jarek Potiuk ***@***.***> wrote: No, as far as I know, I don’t think I have the ability to approve the PR You can approve it (many contributors do that). But you need at least one approval from a commiter to merge it - but approvals can be done by absolutely anyone who has a GitHub account. — Reply to this email directly, view it on GitHub <#43340 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AHZNLLU25JM2URXAA4NZZRLZ5J5M7AVCNFSM6AAAAABQQJT7UOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDIMZYGQYDEMRYGA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

potiuk · 2024-10-28T17:47:31Z

Do we expect any incompatibilities in produced metrics @ArshiaZr ? I would love to understand what was the intention of "cleaner and more consistent implementation." before I get into details.

ferruzzi

Left some comments and questions, overall looking pretty good but still needs a little work.

airflow/metrics/base_stats_logger.py

airflow/metrics/otel_logger.py

airflow/metrics/statsd_logger.py

tests/core/test_otel_logger.py

…tsdLogger for enhanced validation - Modified StatsLogger base class to enforce stricter validation on metric names and prefixes, ensuring all names are strings and conform to required formats. - Enhanced SafeOtelLogger with additional validation to support OpenTelemetry standards and added regression tests for metrics with and without tags. - Updated SafeStatsdLogger to align with new validation standards and to handle edge cases more robustly. - Added regression tests to catch issues with missing tags, empty names, and overly long names, ensuring consistency across changes.

ArshiaZr · 2024-10-29T04:19:03Z

I just made the code more robust based on @ferruzzi reviews I'd appreciate it if you could look at it.

cc @howardyoo, @dannyl1u

ferruzzi

Looking much better, left a few more comments. One big thing that has to be fixed, but the rest are little.

tests/core/test_otel_logger.py

airflow/metrics/otel_logger.py

airflow/metrics/statsd_logger.py

tests/core/test_otel_logger.py

…ility

ArshiaZr · 2024-10-29T21:43:56Z

@ferruzzi Thanks for the reviews. Here's the fixed version. I had to add the instance back in StatsLogger as I noticed we're using it somewhere else.

airflow/metrics/otel_logger.py

airflow/metrics/statsd_logger.py

ArshiaZr · 2024-10-31T01:25:28Z

I raised another commit. I tried to address all the issues and reviewed it myself multiple times.
I'll be grateful if you can take another look at it @ferruzzi.

airflow/metrics/statsd_logger.py

ferruzzi · 2024-11-01T21:38:03Z

I can def check whether the name validation works on python otel sdk side

Thanks @howardyoo

I agree we should have short names where possible, but lets not artificially limit it in the logger to 32 characters

@ashb I concur. Can we agree that the Otel names should continue to be the name without the embedded variables, as it is now, and (assuming Howard confirms the external verification is actually working) just remove our truncating and validating the name?

The datadog statsd format natively supports tags -- we should use those there/and it should be updated to inherit form StatsLogger too

I'm not very familiar with datadog. Looking at the existing code, its get_name implementation should be similar to OTel's and just pass the name through as provided since it also gets the tags added later, right?

@ArshiaZr - Please implement the datadog part and either you and/or Howard can sort out what is going on with the otel name validation after that.

howardyoo · 2024-11-01T21:42:33Z

Yes, we shouldn’t have the limit to 32 chars - that is way too small and not useful.

- Added prepare_stat_with_tags decorator to standardize tag processing across all metrics functions. - Ensured tags default to an empty list [] when no valid tags are provided or self.metrics_tags is disabled, matching expected behavior in tests. - Updated all metric functions (e.g., incr, decr, gauge, timing, timer) to use the new decorator for consistent tag validation and formatting.

ArshiaZr · 2024-11-03T03:55:03Z

Rationale for Using Decorators Instead of Inheritance with Datadog

Using get_name with the Datadog protocol is unnecessary since we don’t combine tags and the stat name when calling DogStatsd functions. The functions in DogStatsd (e.g., increment, decrement, gauge, timing) expect the metric name and tags separately, so there’s no need to modify or combine them beforehand.

Even if tags need to be prepared, we already apply the @prepare_metric_name_with_tags decorator before get_name, which impacts only the values passed to get_name and not the final metric name used in DogStatsd calls.

Therefore, making SafeDogStatsdLogger a subclass of StatsLogger doesn’t add any functional value here. Instead, adding @prepare_metric_name_with_tags to each function in SafeDogStatsdLogger keeps the code organized and consistent without requiring inheritance specifically for Datadog. This approach provides clarity while keeping SafeDogStatsdLogger focused solely on Datadog-specific logic.
Additionally, I added the @prepare_stat_with_tags decorator for a more consistent way of validating tags, allowing for cleaner code by centralizing tag preparation and validation across all metric functions.

@ashb @ferruzzi if you have any other suggestions just let me know

ashb · 2024-11-04T14:03:27Z

Not being a subclass sounds fine as long as it follows the same interface/protocol. I'll check out the changes when I can.

ArshiaZr · 2024-11-05T04:13:02Z

I have also added inheritance for consistency. Just let me know which approach is better. @ashb, @ferruzzi

ferruzzi · 2024-11-05T17:27:15Z

We discussed on slack, but I'll also reply here for posterity: I think inheritance is the right way to go here. Should someone make a change tot he base class in the future, it would be confusing to troubleshoot why StatsD and OTel both work but Datadog doesn't. So it should inherit for the sake of consistency, if nothing else.

airflow/metrics/datadog_logger.py

howardyoo

Thanks!

ferruzzi · 2024-11-12T16:49:58Z

@ashb - Can we get this merged?

ashb · 2024-11-13T10:52:52Z

Looking at this right now, sorry for the delay

airflow/metrics/datadog_logger.py

airflow/metrics/statsd_logger.py

tests/core/test_stats.py

This review actioned, new changes but less blocking now

ferruzzi · 2024-11-13T17:12:45Z

@ArshiaZr - You can also remove the portion(s) of code that limit the OTel name length now. I think we've all agreed that we can keep the names short as Best Practice but should no longer be truncating them.

…apply minor refinements

ferruzzi · 2024-11-14T21:11:06Z

airflow/metrics/validators.py

-    if len(proposed_stat_name) > OTEL_NAME_MAX_LENGTH:
-        # If the name is in the exceptions list, do not fail it for being too long.
-        # It may still be deemed invalid for other reasons below.
-        for exemption in BACK_COMPAT_METRIC_NAMES:


My IDE is currently borked so I can't double check this easily; is BACK_COMPAT_METRIC_NAMES used anywhere else? It's possible that can also be pruned out now if it was only ever used as a work-around for the name length bug.

ferruzzi · 2024-11-18T17:43:53Z

They increased the name length in September of last year 😅 open-telemetry/opentelemetry-python#3442

ArshiaZr closed this Oct 24, 2024

ArshiaZr reopened this Oct 24, 2024

ArshiaZr closed this Oct 24, 2024

ArshiaZr reopened this Oct 24, 2024

howardyoo approved these changes Oct 28, 2024

View reviewed changes

ferruzzi reviewed Oct 28, 2024

View reviewed changes

ArshiaZr requested a review from ferruzzi October 29, 2024 04:11

Merge branch 'main' into main

ed8ae52

ferruzzi reviewed Oct 29, 2024

View reviewed changes

ArshiaZr added 2 commits October 29, 2024 17:21

Refactor metric name validation to preserve Allow/Block list compatib…

e0a6233

…ility

Merge branch 'main' of https://github.com/ArshiaZr/airflow

6d14631

ferruzzi reviewed Oct 29, 2024

View reviewed changes

ArshiaZr and others added 4 commits October 29, 2024 19:30

Merge branch 'main' into main

981405d

Merge branch 'main' into main

8a243ec

Refactored and deleted unnecessary parts of the code

459fa28

Merge branch 'main' into main

a6246bf

ferruzzi reviewed Oct 31, 2024

View reviewed changes

airflow/metrics/statsd_logger.py Show resolved Hide resolved

dannyl1u mentioned this pull request Nov 3, 2024

Metrics Improvement Project - bootstrap metrics registry #43618

Draft

Implemented inheritance for SafeDogStatsdLogger

cecf8d5

ferruzzi reviewed Nov 5, 2024

View reviewed changes

airflow/metrics/datadog_logger.py Show resolved Hide resolved

ArshiaZr requested review from ashb and howardyoo November 5, 2024 18:55

ArshiaZr and others added 2 commits November 5, 2024 13:59

Merge branch 'main' into main

f06adfc

Merge branch 'main' into main

8940606

howardyoo approved these changes Nov 5, 2024

View reviewed changes

dannyl1u approved these changes Nov 12, 2024

View reviewed changes

ferruzzi mentioned this pull request Nov 12, 2024

[AIP-49] OpenTelemetry Traces for Apache Airflow Part 2 #40802

Merged

ashb reviewed Nov 13, 2024

View reviewed changes

refactor: remove redundant checks, lift OTel length restriction, and …

ed094ad

…apply minor refinements

ArshiaZr requested review from ferruzzi, ashb, dannyl1u and howardyoo November 14, 2024 01:30

ferruzzi reviewed Nov 14, 2024

View reviewed changes

ferruzzi mentioned this pull request Nov 18, 2024

Make Open Telemetry the default instead of StatsD for monitoring #40800

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implemented get_name in StatsLogger, updated Otel and StatsD #43340

Implemented get_name in StatsLogger, updated Otel and StatsD #43340

ArshiaZr commented Oct 24, 2024

ArshiaZr commented Oct 24, 2024

howardyoo commented Oct 24, 2024 •

edited

Loading

potiuk commented Oct 25, 2024

howardyoo commented Oct 25, 2024 via email

potiuk commented Oct 25, 2024

howardyoo commented Oct 25, 2024 via email

howardyoo left a comment

howardyoo commented Oct 28, 2024 via email

potiuk commented Oct 28, 2024

ferruzzi left a comment

ArshiaZr commented Oct 29, 2024

ferruzzi left a comment

ArshiaZr commented Oct 29, 2024

ArshiaZr commented Oct 31, 2024

ferruzzi commented Nov 1, 2024 •

edited

Loading

howardyoo commented Nov 1, 2024 via email •

edited by ashb

Loading

ArshiaZr commented Nov 3, 2024

ashb commented Nov 4, 2024

ArshiaZr commented Nov 5, 2024

ferruzzi commented Nov 5, 2024

howardyoo left a comment

ferruzzi commented Nov 12, 2024

ashb commented Nov 13, 2024

ferruzzi commented Nov 13, 2024

ferruzzi Nov 14, 2024 •

edited

Loading

ferruzzi commented Nov 18, 2024

Implemented get_name in StatsLogger, updated Otel and StatsD #43340

Are you sure you want to change the base?

Implemented get_name in StatsLogger, updated Otel and StatsD #43340

Conversation

ArshiaZr commented Oct 24, 2024

ArshiaZr commented Oct 24, 2024

howardyoo commented Oct 24, 2024 • edited Loading

potiuk commented Oct 25, 2024

howardyoo commented Oct 25, 2024 via email

potiuk commented Oct 25, 2024

howardyoo commented Oct 25, 2024 via email

howardyoo left a comment

Choose a reason for hiding this comment

howardyoo commented Oct 28, 2024 via email

potiuk commented Oct 28, 2024

ferruzzi left a comment

Choose a reason for hiding this comment

ArshiaZr commented Oct 29, 2024

ferruzzi left a comment

Choose a reason for hiding this comment

ArshiaZr commented Oct 29, 2024

ArshiaZr commented Oct 31, 2024

ferruzzi commented Nov 1, 2024 • edited Loading

howardyoo commented Nov 1, 2024 via email • edited by ashb Loading

ArshiaZr commented Nov 3, 2024

Rationale for Using Decorators Instead of Inheritance with Datadog

ashb commented Nov 4, 2024

ArshiaZr commented Nov 5, 2024

ferruzzi commented Nov 5, 2024

howardyoo left a comment

Choose a reason for hiding this comment

ferruzzi commented Nov 12, 2024

ashb commented Nov 13, 2024

ferruzzi commented Nov 13, 2024

ferruzzi Nov 14, 2024 • edited Loading

Choose a reason for hiding this comment

ferruzzi commented Nov 18, 2024

howardyoo commented Oct 24, 2024 •

edited

Loading

ferruzzi commented Nov 1, 2024 •

edited

Loading

howardyoo commented Nov 1, 2024 via email •

edited by ashb

Loading

ferruzzi Nov 14, 2024 •

edited

Loading