github.com/pyroscope-io/pyroscope@v0.37.3-0.20230725203016-5f6947968bd0/examples/python/README.md (about) 1 ## Continuous Profiling for Python applications 2 ### Profiling a Python Rideshare App with Pyroscope 3 ![python_example_architecture_05_00](https://user-images.githubusercontent.com/23323466/135728737-0c5e54ca-1e78-4c6d-933c-145f441c96a9.gif) 4 5 #### _Read this in other languages._ 6 <kbd>[简体中文](README_zh.md)</kbd> 7 8 Note: For documentation on the Pyroscope pip package visit [our website](https://pyroscope.io/docs/python/) 9 ## Background 10 In this example we show a simplified, basic use case of Pyroscope. We simulate a "ride share" company which has three endpoints found in `server.py`: 11 - `/bike` : calls the `order_bike(search_radius)` function to order a bike 12 - `/car` : calls the `order_car(search_radius)` function to order a car 13 - `/scooter` : calls the `order_scooter(search_radius)` function to order a scooter 14 15 We also simulate running 3 distinct servers in 3 different regions (via [docker-compose.yml](https://github.com/pyroscope-io/pyroscope/blob/main/examples/python/docker-compose.yml)) 16 - us-east 17 - eu-north 18 - ap-south 19 20 One of the most useful capabilities of Pyroscope is the ability to tag your data in a way that is meaningful to you. In this case, we have two natural divisions, and so we "tag" our data to represent those: 21 - `region`: statically tags the region of the server running the code 22 - `vehicle`: dynamically tags the endpoint (similar to how one might tag a controller rails) 23 24 25 ## Tagging static region 26 Tagging something static, like the `region`, can be done in the initialization code in the `config.tags` variable: 27 ``` 28 pyroscope.configure( 29 application_name = "ride-sharing-app", 30 server_address = "http://pyroscope:4040", 31 tags = { 32 "region": f'{os.getenv("REGION")}', # Tags the region based off the environment variable 33 } 34 ) 35 ``` 36 37 ## Tagging dynamically within functions 38 Tagging something more dynamically, like we do for the `vehicle` tag can be done inside our utility `find_nearest_vehicle()` function using a `with pyroscope.tag_wrapper()` block 39 ``` 40 def find_nearest_vehicle(n, vehicle): 41 with pyroscope.tag_wrapper({ "vehicle": vehicle}): 42 i = 0 43 start_time = time.time() 44 while time.time() - start_time < n: 45 i += 1 46 ``` 47 48 What this block does, is: 49 1. Add the tag `{ "vehicle" => "car" }` 50 2. execute the `find_nearest_vehicle()` function 51 3. Before the block ends it will (behind the scenes) remove the `{ "vehicle" => "car" }` from the application since that block is complete 52 53 ## Resulting flamegraph / performance results from the example 54 ### Running the example 55 To run the example run the following commands: 56 ``` 57 # Pull latest pyroscope image: 58 docker pull pyroscope/pyroscope:latest 59 60 # Run the example project: 61 docker-compose up --build 62 63 # Reset the database (if needed): 64 # docker-compose down 65 ``` 66 67 What this example will do is run all the code mentioned above and also send some mock-load to the 3 servers as well as their respective 3 endpoints. If you select our application: `ride-sharing-app.cpu` from the dropdown, you should see a flamegraph that looks like this (below). After we give 20-30 seconds for the flamegraph to update and then click the refresh button we see our 3 functions at the bottom of the flamegraph taking CPU resources _proportional to the size_ of their respective `search_radius` parameters. 68 69 ## Where's the performance bottleneck? 70 ![python_first_slide_05](https://user-images.githubusercontent.com/23323466/135881284-c75a5b65-6151-44fb-a459-c1f9559cb51a.jpg) 71 72 The first step when analyzing a profile outputted from your application, is to take note of the _largest node_ which is where your application is spending the most resources. In this case, it happens to be the `order_car` function. 73 74 The benefit of using the Pyroscope package, is that now that we can investigate further as to _why_ the `order_car()` function is problematic. Tagging both `region` and `vehicle` allows us to test two good hypotheses: 75 - Something is wrong with the `/car` endpoint code 76 - Something is wrong with one of our regions 77 78 To analyze this we can select one or more tags from the "Select Tag" dropdown: 79 80 ![image](https://user-images.githubusercontent.com/23323466/135525308-b81e87b0-6ffb-4ef0-a6bf-3338483d0fc4.png) 81 82 ## Narrowing in on the Issue Using Tags 83 Knowing there is an issue with the `order_car()` function we automatically select that tag. Then, after inspecting multiple `region` tags, it becomes clear by looking at the timeline that there is an issue with the `eu-north` region, where it alternates between high-cpu times and low-cpu times. 84 85 We can also see that the `mutex_lock()` function is consuming almost 70% of CPU resources during this time period. 86 ![python_second_slide_05](https://user-images.githubusercontent.com/23323466/135805908-ae9a1650-51fc-457a-8c47-0b56e8538b08.jpg) 87 88 ## Comparing two time periods 89 Using Pyroscope's "comparison view" we can actually select two different time ranges from the timeline to compare the resulting flamegraphs. The pink section on the left timeline results in the left flamegraph, and the blue section on the right represents the right flamegraph. 90 91 When we select a period of low-cpu utilization and a period of high-cpu utilization we can see that there is clearly different behavior in the `mutex_lock()` function where it takes **51% of CPU** during low-cpu times and **78% of CPU** during high-cpu times. 92 ![python_third_slide_05](https://user-images.githubusercontent.com/23323466/135805969-55fdee40-fe0c-412d-9ec0-0bbc6a748ed4.jpg) 93 94 ## Visualizing Diff Between Two Flamegraphs 95 While the difference _in this case_ is stark enough to see in the comparison view, sometimes the diff between the two flamegraphs is better visualized with them overlayed over each other. Without changing any parameters, we can simply select the diff view tab and see the difference represented in a color-coded diff flamegraph. 96 ![python_fourth_slide_05](https://user-images.githubusercontent.com/23323466/135805986-594ffa3b-e735-4f91-875d-4f76fdff2b60.jpg) 97 98 ### More use cases 99 We have been beta testing this feature with several different companies and some of the ways that we've seen companies tag their performance data: 100 - Tagging controllers 101 - Tagging regions 102 - Tagging jobs from a redis / sidekiq / rabbitmq queue 103 - Tagging commits 104 - Tagging staging / production environments 105 - Tagging different parts of their testing suites 106 - Etc... 107 108 ### Future Roadmap 109 We would love for you to try out this example and see what ways you can adapt this to your python application. Continuous profiling has become an increasingly popular tool for the monitoring and debugging of performance issues (arguably the fourth pillar of observability). 110 111 We'd love to continue to improve this pip package by adding things like integrations with popular tools, memory profiling, etc. and we would love to hear what features _you would like to see_.