github.com/twelsh-aw/go/src@v0.0.0-20230516233729-a56fe86a7c81/runtime/HACKING.md

github.com/twelsh-aw/go/src@v0.0.0-20230516233729-a56fe86a7c81/runtime/HACKING.md (about)

     1  This is a living document and at times it will be out of date. It is
     2  intended to articulate how programming in the Go runtime differs from
     3  writing normal Go. It focuses on pervasive concepts rather than
     4  details of particular interfaces.
     5  
     6  Scheduler structures
     7  ====================
     8  
     9  The scheduler manages three types of resources that pervade the
    10  runtime: Gs, Ms, and Ps. It's important to understand these even if
    11  you're not working on the scheduler.
    12  
    13  Gs, Ms, Ps
    14  ----------
    15  
    16  A "G" is simply a goroutine. It's represented by type `g`. When a
    17  goroutine exits, its `g` object is returned to a pool of free `g`s and
    18  can later be reused for some other goroutine.
    19  
    20  An "M" is an OS thread that can be executing user Go code, runtime
    21  code, a system call, or be idle. It's represented by type `m`. There
    22  can be any number of Ms at a time since any number of threads may be
    23  blocked in system calls.
    24  
    25  Finally, a "P" represents the resources required to execute user Go
    26  code, such as scheduler and memory allocator state. It's represented
    27  by type `p`. There are exactly `GOMAXPROCS` Ps. A P can be thought of
    28  like a CPU in the OS scheduler and the contents of the `p` type like
    29  per-CPU state. This is a good place to put state that needs to be
    30  sharded for efficiency, but doesn't need to be per-thread or
    31  per-goroutine.
    32  
    33  The scheduler's job is to match up a G (the code to execute), an M
    34  (where to execute it), and a P (the rights and resources to execute
    35  it). When an M stops executing user Go code, for example by entering a
    36  system call, it returns its P to the idle P pool. In order to resume
    37  executing user Go code, for example on return from a system call, it
    38  must acquire a P from the idle pool.
    39  
    40  All `g`, `m`, and `p` objects are heap allocated, but are never freed,
    41  so their memory remains type stable. As a result, the runtime can
    42  avoid write barriers in the depths of the scheduler.
    43  
    44  `getg()` and `getg().m.curg`
    45  ----------------------------
    46  
    47  To get the current user `g`, use `getg().m.curg`.
    48  
    49  `getg()` alone returns the current `g`, but when executing on the
    50  system or signal stacks, this will return the current M's "g0" or
    51  "gsignal", respectively. This is usually not what you want.
    52  
    53  To determine if you're running on the user stack or the system stack,
    54  use `getg() == getg().m.curg`.
    55  
    56  Stacks
    57  ======
    58  
    59  Every non-dead G has a *user stack* associated with it, which is what
    60  user Go code executes on. User stacks start small (e.g., 2K) and grow
    61  or shrink dynamically.
    62  
    63  Every M has a *system stack* associated with it (also known as the M's
    64  "g0" stack because it's implemented as a stub G) and, on Unix
    65  platforms, a *signal stack* (also known as the M's "gsignal" stack).
    66  System and signal stacks cannot grow, but are large enough to execute
    67  runtime and cgo code (8K in a pure Go binary; system-allocated in a
    68  cgo binary).
    69  
    70  Runtime code often temporarily switches to the system stack using
    71  `systemstack`, `mcall`, or `asmcgocall` to perform tasks that must not
    72  be preempted, that must not grow the user stack, or that switch user
    73  goroutines. Code running on the system stack is implicitly
    74  non-preemptible and the garbage collector does not scan system stacks.
    75  While running on the system stack, the current user stack is not used
    76  for execution.
    77  
    78  nosplit functions
    79  -----------------
    80  
    81  Most functions start with a prologue that inspects the stack pointer
    82  and the current G's stack bound and calls `morestack` if the stack
    83  needs to grow.
    84  
    85  Functions can be marked `//go:nosplit` (or `NOSPLIT` in assembly) to
    86  indicate that they should not get this prologue. This has several
    87  uses:
    88  
    89  - Functions that must run on the user stack, but must not call into
    90    stack growth, for example because this would cause a deadlock, or
    91    because they have untyped words on the stack.
    92  
    93  - Functions that must not be preempted on entry.
    94  
    95  - Functions that may run without a valid G. For example, functions
    96    that run in early runtime start-up, or that may be entered from C
    97    code such as cgo callbacks or the signal handler.
    98  
    99  Splittable functions ensure there's some amount of space on the stack
   100  for nosplit functions to run in and the linker checks that any static
   101  chain of nosplit function calls cannot exceed this bound.
   102  
   103  Any function with a `//go:nosplit` annotation should explain why it is
   104  nosplit in its documentation comment.
   105  
   106  Error handling and reporting
   107  ============================
   108  
   109  Errors that can reasonably be recovered from in user code should use
   110  `panic` like usual. However, there are some situations where `panic`
   111  will cause an immediate fatal error, such as when called on the system
   112  stack or when called during `mallocgc`.
   113  
   114  Most errors in the runtime are not recoverable. For these, use
   115  `throw`, which dumps the traceback and immediately terminates the
   116  process. In general, `throw` should be passed a string constant to
   117  avoid allocating in perilous situations. By convention, additional
   118  details are printed before `throw` using `print` or `println` and the
   119  messages are prefixed with "runtime:".
   120  
   121  For unrecoverable errors where user code is expected to be at fault for the
   122  failure (such as racing map writes), use `fatal`.
   123  
   124  For runtime error debugging, it may be useful to run with `GOTRACEBACK=system`
   125  or `GOTRACEBACK=crash`. The output of `panic` and `fatal` is as described by
   126  `GOTRACEBACK`. The output of `throw` always includes runtime frames, metadata
   127  and all goroutines regardless of `GOTRACEBACK` (i.e., equivalent to
   128  `GOTRACEBACK=system`). Whether `throw` crashes or not is still controlled by
   129  `GOTRACEBACK`.
   130  
   131  Synchronization
   132  ===============
   133  
   134  The runtime has multiple synchronization mechanisms. They differ in
   135  semantics and, in particular, in whether they interact with the
   136  goroutine scheduler or the OS scheduler.
   137  
   138  The simplest is `mutex`, which is manipulated using `lock` and
   139  `unlock`. This should be used to protect shared structures for short
   140  periods. Blocking on a `mutex` directly blocks the M, without
   141  interacting with the Go scheduler. This means it is safe to use from
   142  the lowest levels of the runtime, but also prevents any associated G
   143  and P from being rescheduled. `rwmutex` is similar.
   144  
   145  For one-shot notifications, use `note`, which provides `notesleep` and
   146  `notewakeup`. Unlike traditional UNIX `sleep`/`wakeup`, `note`s are
   147  race-free, so `notesleep` returns immediately if the `notewakeup` has
   148  already happened. A `note` can be reset after use with `noteclear`,
   149  which must not race with a sleep or wakeup. Like `mutex`, blocking on
   150  a `note` blocks the M. However, there are different ways to sleep on a
   151  `note`:`notesleep` also prevents rescheduling of any associated G and
   152  P, while `notetsleepg` acts like a blocking system call that allows
   153  the P to be reused to run another G. This is still less efficient than
   154  blocking the G directly since it consumes an M.
   155  
   156  To interact directly with the goroutine scheduler, use `gopark` and
   157  `goready`. `gopark` parks the current goroutine—putting it in the
   158  "waiting" state and removing it from the scheduler's run queue—and
   159  schedules another goroutine on the current M/P. `goready` puts a
   160  parked goroutine back in the "runnable" state and adds it to the run
   161  queue.
   162  
   163  In summary,
   164  
   165  <table>
   166  <tr><th></th><th colspan="3">Blocks</th></tr>
   167  <tr><th>Interface</th><th>G</th><th>M</th><th>P</th></tr>
   168  <tr><td>(rw)mutex</td><td>Y</td><td>Y</td><td>Y</td></tr>
   169  <tr><td>note</td><td>Y</td><td>Y</td><td>Y/N</td></tr>
   170  <tr><td>park</td><td>Y</td><td>N</td><td>N</td></tr>
   171  </table>
   172  
   173  Atomics
   174  =======
   175  
   176  The runtime uses its own atomics package at `runtime/internal/atomic`.
   177  This corresponds to `sync/atomic`, but functions have different names
   178  for historical reasons and there are a few additional functions needed
   179  by the runtime.
   180  
   181  In general, we think hard about the uses of atomics in the runtime and
   182  try to avoid unnecessary atomic operations. If access to a variable is
   183  sometimes protected by another synchronization mechanism, the
   184  already-protected accesses generally don't need to be atomic. There
   185  are several reasons for this:
   186  
   187  1. Using non-atomic or atomic access where appropriate makes the code
   188     more self-documenting. Atomic access to a variable implies there's
   189     somewhere else that may concurrently access the variable.
   190  
   191  2. Non-atomic access allows for automatic race detection. The runtime
   192     doesn't currently have a race detector, but it may in the future.
   193     Atomic access defeats the race detector, while non-atomic access
   194     allows the race detector to check your assumptions.
   195  
   196  3. Non-atomic access may improve performance.
   197  
   198  Of course, any non-atomic access to a shared variable should be
   199  documented to explain how that access is protected.
   200  
   201  Some common patterns that mix atomic and non-atomic access are:
   202  
   203  * Read-mostly variables where updates are protected by a lock. Within
   204    the locked region, reads do not need to be atomic, but the write
   205    does. Outside the locked region, reads need to be atomic.
   206  
   207  * Reads that only happen during STW, where no writes can happen during
   208    STW, do not need to be atomic.
   209  
   210  That said, the advice from the Go memory model stands: "Don't be
   211  [too] clever." The performance of the runtime matters, but its
   212  robustness matters more.
   213  
   214  Unmanaged memory
   215  ================
   216  
   217  In general, the runtime tries to use regular heap allocation. However,
   218  in some cases the runtime must allocate objects outside of the garbage
   219  collected heap, in *unmanaged memory*. This is necessary if the
   220  objects are part of the memory manager itself or if they must be
   221  allocated in situations where the caller may not have a P.
   222  
   223  There are three mechanisms for allocating unmanaged memory:
   224  
   225  * sysAlloc obtains memory directly from the OS. This comes in whole
   226    multiples of the system page size, but it can be freed with sysFree.
   227  
   228  * persistentalloc combines multiple smaller allocations into a single
   229    sysAlloc to avoid fragmentation. However, there is no way to free
   230    persistentalloced objects (hence the name).
   231  
   232  * fixalloc is a SLAB-style allocator that allocates objects of a fixed
   233    size. fixalloced objects can be freed, but this memory can only be
   234    reused by the same fixalloc pool, so it can only be reused for
   235    objects of the same type.
   236  
   237  In general, types that are allocated using any of these should be
   238  marked as not in heap by embedding `runtime/internal/sys.NotInHeap`.
   239  
   240  Objects that are allocated in unmanaged memory **must not** contain
   241  heap pointers unless the following rules are also obeyed:
   242  
   243  1. Any pointers from unmanaged memory to the heap must be garbage
   244     collection roots. More specifically, any pointer must either be
   245     accessible through a global variable or be added as an explicit
   246     garbage collection root in `runtime.markroot`.
   247  
   248  2. If the memory is reused, the heap pointers must be zero-initialized
   249     before they become visible as GC roots. Otherwise, the GC may
   250     observe stale heap pointers. See "Zero-initialization versus
   251     zeroing".
   252  
   253  Zero-initialization versus zeroing
   254  ==================================
   255  
   256  There are two types of zeroing in the runtime, depending on whether
   257  the memory is already initialized to a type-safe state.
   258  
   259  If memory is not in a type-safe state, meaning it potentially contains
   260  "garbage" because it was just allocated and it is being initialized
   261  for first use, then it must be *zero-initialized* using
   262  `memclrNoHeapPointers` or non-pointer writes. This does not perform
   263  write barriers.
   264  
   265  If memory is already in a type-safe state and is simply being set to
   266  the zero value, this must be done using regular writes, `typedmemclr`,
   267  or `memclrHasPointers`. This performs write barriers.
   268  
   269  Runtime-only compiler directives
   270  ================================
   271  
   272  In addition to the "//go:" directives documented in "go doc compile",
   273  the compiler supports additional directives only in the runtime.
   274  
   275  go:systemstack
   276  --------------
   277  
   278  `go:systemstack` indicates that a function must run on the system
   279  stack. This is checked dynamically by a special function prologue.
   280  
   281  go:nowritebarrier
   282  -----------------
   283  
   284  `go:nowritebarrier` directs the compiler to emit an error if the
   285  following function contains any write barriers. (It *does not*
   286  suppress the generation of write barriers; it is simply an assertion.)
   287  
   288  Usually you want `go:nowritebarrierrec`. `go:nowritebarrier` is
   289  primarily useful in situations where it's "nice" not to have write
   290  barriers, but not required for correctness.
   291  
   292  go:nowritebarrierrec and go:yeswritebarrierrec
   293  ----------------------------------------------
   294  
   295  `go:nowritebarrierrec` directs the compiler to emit an error if the
   296  following function or any function it calls recursively, up to a
   297  `go:yeswritebarrierrec`, contains a write barrier.
   298  
   299  Logically, the compiler floods the call graph starting from each
   300  `go:nowritebarrierrec` function and produces an error if it encounters
   301  a function containing a write barrier. This flood stops at
   302  `go:yeswritebarrierrec` functions.
   303  
   304  `go:nowritebarrierrec` is used in the implementation of the write
   305  barrier to prevent infinite loops.
   306  
   307  Both directives are used in the scheduler. The write barrier requires
   308  an active P (`getg().m.p != nil`) and scheduler code often runs
   309  without an active P. In this case, `go:nowritebarrierrec` is used on
   310  functions that release the P or may run without a P and
   311  `go:yeswritebarrierrec` is used when code re-acquires an active P.
   312  Since these are function-level annotations, code that releases or
   313  acquires a P may need to be split across two functions.
   314  
   315  go:uintptrkeepalive
   316  -------------------
   317  
   318  The //go:uintptrkeepalive directive must be followed by a function declaration.
   319  
   320  It specifies that the function's uintptr arguments may be pointer values that
   321  have been converted to uintptr and must be kept alive for the duration of the
   322  call, even though from the types alone it would appear that the object is no
   323  longer needed during the call.
   324  
   325  This directive is similar to //go:uintptrescapes, but it does not force
   326  arguments to escape. Since stack growth does not understand these arguments,
   327  this directive must be used with //go:nosplit (in the marked function and all
   328  transitive calls) to prevent stack growth.
   329  
   330  The conversion from pointer to uintptr must appear in the argument list of any
   331  call to this function. This directive is used for some low-level system call
   332  implementations.