ooni-probe-cli/internal/tutorial/experiment/torsf/chapter04/README.md


# Chapter IV: writing minimal torsf experiment

In this chapter we will replace the code written in the previous
chapter that simulates running the torsf experiment with code that
uses the `ooni/probe-cli` library to run the real experiment.

(This file is auto-generated from the corresponding source file,
so make sure you don't edit it manually.)

## Updating the imports

We need to update the imports of `torsf.go` first to look like this:

```Go

import (
```

These are standard library imports.

```Go
	"context"
	"path"
	"time"

```

As we have already seen, the `model` package defines the
generic data model used by all experiments.

```Go
	"github.com/ooni/probe-cli/v3/internal/model"

```

The `tracex` package contains code used to format internal
measurements representations to the OONI data format.

```Go
	"github.com/ooni/probe-cli/v3/internal/tracex"

```

The `ptx` package contains pluggable transport code. It includes
code to dial with obfs4 and snowflake and code to create a
pluggable transport listener.

```Go
	"github.com/ooni/probe-cli/v3/internal/ptx"

```

The `tunnel` package contains code to create a tunnel. We will
use this package to start a `tor` tunnel, which executes the `tor`
binary using specified command line arguments.

```Go
	"github.com/ooni/probe-cli/v3/internal/tunnel"
)

```


## Rewriting the run method

Let us now rewrite the `run` method to run a real `torsf`
test rather than just pretending to do it.

```Go
func (m *Measurer) run(ctx context.Context,
	sess model.ExperimentSession, testkeys *TestKeys, errch chan<- error) {
```

As a first step, we create a dialer for snowflake using the
`ptx` package. This dialer will allow us to create a `net.Conn`-like
network connection where traffic is sent using the Snowflake
pluggable transport. There are several optional fields in
`SnowflakeDialer`; the `NewSnowflakeDialer` constructor will
give us a suitable configured dialer with default settings.

```Go
	sfdialer := ptx.NewSnowflakeDialer()
```

Let us now create a listener. The `ptx.Listener` is a listener
that listens on a local port and speaks the SOCKS5 protocol. When
tor connect to this port, the listener will forward the traffic
to the Snowflake dialer we previously created. We will also
use the session's logger to emit logging messages.

```Go
	ptl := &ptx.Listener{
		PTDialer: sfdialer,
		Logger:   sess.Logger(),
	}
```

Now we start the listener. This entails opening a port on the
local host. If this operation fails, we return an error. In fact,
a failure here means a hard error that prevented us from even
starting the experiment. Therefore, it's consistent with the
`Run`'s expectations to return an error here.

```Go
	if err := ptl.Start(); err != nil {
		testkeys.Failure = tracex.NewFailure(err)
		errch <- err
		return
	}
	defer ptl.Stop()
```

Next, we start `tor` using the `tunnel` package. Note how we
pass specific `TorArgs` that cause `tor` to know about the
pluggable transport created by `ptl` and `sfdialer`.

```Go
	tun, _, err := tunnel.Start(ctx, &tunnel.Config{
		Name:      "tor",
		Session:   sess,
		TunnelDir: path.Join(sess.TempDir(), "torsf"),
		Logger:    sess.Logger(),
		TorArgs: []string{
			"UseBridges", "1",
			"ClientTransportPlugin", ptl.AsClientTransportPluginArgument(),
			"Bridge", sfdialer.AsBridgeArgument(),
		},
	})
```

In case of error, we convert `err` to a OONI failure using
the `NewFailure` function of `tracex`. This function reduces
Go error strings to the error strings used by OONI. You can
read the [errors spec](https://github.com/ooni/spec/blob/master/data-formats/df-007-errors.md)
at the [github.com/ooni/spec repo](https://github.com/ooni/spec).

Note that, in this case, we return `nil` to the caller, because
a failure here is not a fundamental failure in running the
experiment, but rather a possibly interesting anomaly.

```Go
	if err != nil {
		testkeys.Failure = tracex.NewFailure(err)
		errch <- nil
		return
	}
```

Otherwise, we successfully created a tor tunnel using Snowflake,
so we just close the tunnel and record the bootstrap time.

```Go
	defer tun.Stop()
	testkeys.BootstrapTime = tun.BootstrapTime().Seconds()
	errch <- nil
}

```

## Running the code

We can now run the code as follows to obtain:

```
$ go run ./experiment/torsf/chapter04 | jq
[...]
Jun 21 23:40:50.000 [notice] Bootstrapped 100% (done): Done
2021/06/21 23:40:50  info [100.0%] torsf experiment is finished
Jun 21 23:40:50.000 [notice] Catching signal TERM, exiting cleanly
{
  "data_format_version": "",
  "input": null,
  "measurement_start_time": "",
  "probe_asn": "",
  "probe_cc": "",
  "probe_network_name": "",
  "report_id": "",
  "resolver_asn": "",
  "resolver_ip": "",
  "resolver_network_name": "",
  "software_name": "",
  "software_version": "",
  "test_keys": {
    "bootstrap_time": 48.122813,
    "failure": null
  },
  "test_name": "",
  "test_runtime": 0,
  "test_start_time": "",
  "test_version": ""
}
```

## Concluding remarks

Congratulations, we have now rewritten together (a simplified version of)
the `torsf` experiment! In this journey, we have learned how experiments
interact with the rest of OONI Probe, how they are typically organized,
and how to use lower-level libraries to implement them.
feat: tutorial on how to write the torsf experiment (#390) Original tracking issue for Sprint 41: https://github.com/ooni/probe/issues/1507 Follow-up work in Sprint 42 tracked by: https://github.com/ooni/probe/issues/1689 2021-06-22 00:12:03 +02:00
			`# Chapter IV: writing minimal torsf experiment`

			`In this chapter we will replace the code written in the previous`
			`chapter that simulates running the torsf experiment with code that`
			uses the `ooni/probe-cli` library to run the real experiment.

			`(This file is auto-generated from the corresponding source file,`
			`so make sure you don't edit it manually.)`

			`## Updating the imports`

			We need to update the imports of `torsf.go` first to look like this:

			```Go

			`import (`
			```

			`These are standard library imports.`

			```Go
			`"context"`
			`"path"`
			`"time"`

			```

			As we have already seen, the `model` package defines the
			`generic data model used by all experiments.`

			```Go
refactor: interfaces and data types into the model package (#642) ## Checklist - [x] I have read the [contribution guidelines](https://github.com/ooni/probe-cli/blob/master/CONTRIBUTING.md) - [x] reference issue for this pull request: https://github.com/ooni/probe/issues/1885 - [x] related ooni/spec pull request: N/A Location of the issue tracker: https://github.com/ooni/probe ## Description This PR contains a set of changes to move important interfaces and data types into the `./internal/model` package. The criteria for including an interface or data type in here is roughly that the type should be important and used by several packages. We are especially interested to move more interfaces here to increase modularity. An additional side effect is that, by reading this package, one should be able to understand more quickly how different parts of the codebase interact with each other. This is what I want to move in `internal/model`: - [x] most important interfaces from `internal/netxlite` - [x] everything that was previously part of `internal/engine/model` - [x] mocks from `internal/netxlite/mocks` should also be moved in here as a subpackage 2022-01-03 13:53:23 +01:00			`"github.com/ooni/probe-cli/v3/internal/model"`
feat: tutorial on how to write the torsf experiment (#390) Original tracking issue for Sprint 41: https://github.com/ooni/probe/issues/1507 Follow-up work in Sprint 42 tracked by: https://github.com/ooni/probe/issues/1689 2021-06-22 00:12:03 +02:00
			```

refactor(netx): merge archival, trace, and the savers (#772) This diff creates a new package under netx called tracex that contains everything we need to perform measurements using events tracing and postprocessing (which is the technique with which we implement most network experiments). The general idea here is to (1) create a unique package out of all of these packages; (2) clean up the code a bit (improve tests, docs, apply more recent code patterns); (3) move the resulting code as a toplevel package inside of internal. Once this is done, netx can be further refactored to avoid subpackages and we can search for more code to salvage/refactor. See https://github.com/ooni/probe/issues/2121 2022-05-31 21:53:01 +02:00			The `tracex` package contains code used to format internal
feat: tutorial on how to write the torsf experiment (#390) Original tracking issue for Sprint 41: https://github.com/ooni/probe/issues/1507 Follow-up work in Sprint 42 tracked by: https://github.com/ooni/probe/issues/1689 2021-06-22 00:12:03 +02:00			`measurements representations to the OONI data format.`

			```Go
refactor: move tracex outside of engine/netx (#782) * refactor: move tracex outside of engine/netx Consistently with https://github.com/ooni/probe/issues/2121 and https://github.com/ooni/probe/issues/2115, we can now move tracex outside of engine/netx. The main reason why this makes sense now is that the package is now changed significantly from the one that we imported from ooni/probe-engine. We have improved its implementation, which had not been touched significantly for quite some time, and converted it to unit testing. I will document tomorrow some extra work I'd like to do with this package but likely could not do $soon. * go fmt * regen tutorials 2022-06-02 00:50:55 +02:00			`"github.com/ooni/probe-cli/v3/internal/tracex"`
feat: tutorial on how to write the torsf experiment (#390) Original tracking issue for Sprint 41: https://github.com/ooni/probe/issues/1507 Follow-up work in Sprint 42 tracked by: https://github.com/ooni/probe/issues/1689 2021-06-22 00:12:03 +02:00
			```

			The `ptx` package contains pluggable transport code. It includes
			`code to dial with obfs4 and snowflake and code to create a`
			`pluggable transport listener.`

			```Go
			`"github.com/ooni/probe-cli/v3/internal/ptx"`

			```

			The `tunnel` package contains code to create a tunnel. We will
			use this package to start a `tor` tunnel, which executes the `tor`
			`binary using specified command line arguments.`

			```Go
			`"github.com/ooni/probe-cli/v3/internal/tunnel"`
			`)`

			```


			`## Rewriting the run method`

			Let us now rewrite the `run` method to run a real `torsf`
			`test rather than just pretending to do it.`

			```Go
			`func (m *Measurer) run(ctx context.Context,`
			`sess model.ExperimentSession, testkeys *TestKeys, errch chan<- error) {`
			```

			`As a first step, we create a dialer for snowflake using the`
			`ptx` package. This dialer will allow us to create a `net.Conn`-like
			`network connection where traffic is sent using the Snowflake`
			`pluggable transport. There are several optional fields in`
feat(torsf): collect tor logs, select rendezvous method, count bytes (#683) This diff contains significant improvements over the previous implementation of the torsf experiment. We add support for configuring different rendezvous methods after the convo at https://github.com/ooni/probe/issues/2004. In doing that, I've tried to use a terminology that is consistent with the names being actually used by tor developers. In terms of what to do next, this diff basically instruments torsf to always rendezvous using domain fronting. Yet, it's also possible to change the rendezvous method from the command line, when using miniooni, which allows to experiment a bit more. In the same vein, by default we use a persistent tor datadir, but it's also possible to use a temporary datadir using the cmdline. Here's how a generic invocation of `torsf` looks like: ```bash ./miniooni -O DisablePersistentDatadir=true \ -O RendezvousMethod=amp \ -O DisableProgress=true \ torsf ``` (The default is `DisablePersistentDatadir=false` and `RendezvousMethod=domain_fronting`.) With this implementation, we can start measuring whether snowflake and tor together can boostrap, which seems the most important thing to focus on at the beginning. Understanding why the bootstrap most often does not converge with a temporary datadir on Android devices remains instead an open problem for now. (I'll also update the relevant issues or create new issues after commit this.) We also address some methodology improvements that were proposed in https://github.com/ooni/probe/issues/1686. Namely: 1. we record the tor version; 2. we include the bootstrap percentage by reading the logs; 3. we set the anomaly key correctly; 4. we measure the bytes send and received (by `tor` not by `snowflake`, since doing it for snowflake seems more complex at this stage). What remains to be done is the possibility of including Snowflake events into the measurement, which is not possible until the new improvements at common/event in snowflake.git are included into a tagged version of snowflake itself. (I'll make sure to mention this aspect to @cohosh in https://github.com/ooni/probe/issues/2004.) 2022-02-07 17:05:36 +01:00			`SnowflakeDialer`; the `NewSnowflakeDialer` constructor will
			`give us a suitable configured dialer with default settings.`
feat: tutorial on how to write the torsf experiment (#390) Original tracking issue for Sprint 41: https://github.com/ooni/probe/issues/1507 Follow-up work in Sprint 42 tracked by: https://github.com/ooni/probe/issues/1689 2021-06-22 00:12:03 +02:00
			```Go
feat(torsf): collect tor logs, select rendezvous method, count bytes (#683) This diff contains significant improvements over the previous implementation of the torsf experiment. We add support for configuring different rendezvous methods after the convo at https://github.com/ooni/probe/issues/2004. In doing that, I've tried to use a terminology that is consistent with the names being actually used by tor developers. In terms of what to do next, this diff basically instruments torsf to always rendezvous using domain fronting. Yet, it's also possible to change the rendezvous method from the command line, when using miniooni, which allows to experiment a bit more. In the same vein, by default we use a persistent tor datadir, but it's also possible to use a temporary datadir using the cmdline. Here's how a generic invocation of `torsf` looks like: ```bash ./miniooni -O DisablePersistentDatadir=true \ -O RendezvousMethod=amp \ -O DisableProgress=true \ torsf ``` (The default is `DisablePersistentDatadir=false` and `RendezvousMethod=domain_fronting`.) With this implementation, we can start measuring whether snowflake and tor together can boostrap, which seems the most important thing to focus on at the beginning. Understanding why the bootstrap most often does not converge with a temporary datadir on Android devices remains instead an open problem for now. (I'll also update the relevant issues or create new issues after commit this.) We also address some methodology improvements that were proposed in https://github.com/ooni/probe/issues/1686. Namely: 1. we record the tor version; 2. we include the bootstrap percentage by reading the logs; 3. we set the anomaly key correctly; 4. we measure the bytes send and received (by `tor` not by `snowflake`, since doing it for snowflake seems more complex at this stage). What remains to be done is the possibility of including Snowflake events into the measurement, which is not possible until the new improvements at common/event in snowflake.git are included into a tagged version of snowflake itself. (I'll make sure to mention this aspect to @cohosh in https://github.com/ooni/probe/issues/2004.) 2022-02-07 17:05:36 +01:00			`sfdialer := ptx.NewSnowflakeDialer()`
feat: tutorial on how to write the torsf experiment (#390) Original tracking issue for Sprint 41: https://github.com/ooni/probe/issues/1507 Follow-up work in Sprint 42 tracked by: https://github.com/ooni/probe/issues/1689 2021-06-22 00:12:03 +02:00			```

			Let us now create a listener. The `ptx.Listener` is a listener
			`that listens on a local port and speaks the SOCKS5 protocol. When`
			`tor connect to this port, the listener will forward the traffic`
			`to the Snowflake dialer we previously created. We will also`
			`use the session's logger to emit logging messages.`

			```Go
			`ptl := &ptx.Listener{`
			`PTDialer: sfdialer,`
			`Logger: sess.Logger(),`
			`}`
			```

			`Now we start the listener. This entails opening a port on the`
			`local host. If this operation fails, we return an error. In fact,`
			`a failure here means a hard error that prevented us from even`
			`starting the experiment. Therefore, it's consistent with the`
			`Run`'s expectations to return an error here.

			```Go
			`if err := ptl.Start(); err != nil {`
refactor(netx): merge archival, trace, and the savers (#772) This diff creates a new package under netx called tracex that contains everything we need to perform measurements using events tracing and postprocessing (which is the technique with which we implement most network experiments). The general idea here is to (1) create a unique package out of all of these packages; (2) clean up the code a bit (improve tests, docs, apply more recent code patterns); (3) move the resulting code as a toplevel package inside of internal. Once this is done, netx can be further refactored to avoid subpackages and we can search for more code to salvage/refactor. See https://github.com/ooni/probe/issues/2121 2022-05-31 21:53:01 +02:00			`testkeys.Failure = tracex.NewFailure(err)`
feat: tutorial on how to write the torsf experiment (#390) Original tracking issue for Sprint 41: https://github.com/ooni/probe/issues/1507 Follow-up work in Sprint 42 tracked by: https://github.com/ooni/probe/issues/1689 2021-06-22 00:12:03 +02:00			`errch <- err`
			`return`
			`}`
			`defer ptl.Stop()`
			```

			Next, we start `tor` using the `tunnel` package. Note how we
			pass specific `TorArgs` that cause `tor` to know about the
			pluggable transport created by `ptl` and `sfdialer`.

			```Go
refactor(netx): merge archival, trace, and the savers (#772) This diff creates a new package under netx called tracex that contains everything we need to perform measurements using events tracing and postprocessing (which is the technique with which we implement most network experiments). The general idea here is to (1) create a unique package out of all of these packages; (2) clean up the code a bit (improve tests, docs, apply more recent code patterns); (3) move the resulting code as a toplevel package inside of internal. Once this is done, netx can be further refactored to avoid subpackages and we can search for more code to salvage/refactor. See https://github.com/ooni/probe/issues/2121 2022-05-31 21:53:01 +02:00			`tun, _, err := tunnel.Start(ctx, &tunnel.Config{`
feat: tutorial on how to write the torsf experiment (#390) Original tracking issue for Sprint 41: https://github.com/ooni/probe/issues/1507 Follow-up work in Sprint 42 tracked by: https://github.com/ooni/probe/issues/1689 2021-06-22 00:12:03 +02:00			`Name: "tor",`
			`Session: sess,`
			`TunnelDir: path.Join(sess.TempDir(), "torsf"),`
			`Logger: sess.Logger(),`
			`TorArgs: []string{`
			`"UseBridges", "1",`
			`"ClientTransportPlugin", ptl.AsClientTransportPluginArgument(),`
			`"Bridge", sfdialer.AsBridgeArgument(),`
			`},`
			`})`
			```

			In case of error, we convert `err` to a OONI failure using
refactor(netx): merge archival, trace, and the savers (#772) This diff creates a new package under netx called tracex that contains everything we need to perform measurements using events tracing and postprocessing (which is the technique with which we implement most network experiments). The general idea here is to (1) create a unique package out of all of these packages; (2) clean up the code a bit (improve tests, docs, apply more recent code patterns); (3) move the resulting code as a toplevel package inside of internal. Once this is done, netx can be further refactored to avoid subpackages and we can search for more code to salvage/refactor. See https://github.com/ooni/probe/issues/2121 2022-05-31 21:53:01 +02:00			the `NewFailure` function of `tracex`. This function reduces
feat: tutorial on how to write the torsf experiment (#390) Original tracking issue for Sprint 41: https://github.com/ooni/probe/issues/1507 Follow-up work in Sprint 42 tracked by: https://github.com/ooni/probe/issues/1689 2021-06-22 00:12:03 +02:00			`Go error strings to the error strings used by OONI. You can`
			`read the [errors spec](https://github.com/ooni/spec/blob/master/data-formats/df-007-errors.md)`
			`at the [github.com/ooni/spec repo](https://github.com/ooni/spec).`

			Note that, in this case, we return `nil` to the caller, because
			`a failure here is not a fundamental failure in running the`
			`experiment, but rather a possibly interesting anomaly.`

			```Go
			`if err != nil {`
refactor(netx): merge archival, trace, and the savers (#772) This diff creates a new package under netx called tracex that contains everything we need to perform measurements using events tracing and postprocessing (which is the technique with which we implement most network experiments). The general idea here is to (1) create a unique package out of all of these packages; (2) clean up the code a bit (improve tests, docs, apply more recent code patterns); (3) move the resulting code as a toplevel package inside of internal. Once this is done, netx can be further refactored to avoid subpackages and we can search for more code to salvage/refactor. See https://github.com/ooni/probe/issues/2121 2022-05-31 21:53:01 +02:00			`testkeys.Failure = tracex.NewFailure(err)`
feat: tutorial on how to write the torsf experiment (#390) Original tracking issue for Sprint 41: https://github.com/ooni/probe/issues/1507 Follow-up work in Sprint 42 tracked by: https://github.com/ooni/probe/issues/1689 2021-06-22 00:12:03 +02:00			`errch <- nil`
			`return`
			`}`
			```

			`Otherwise, we successfully created a tor tunnel using Snowflake,`
			`so we just close the tunnel and record the bootstrap time.`

			```Go
			`defer tun.Stop()`
			`testkeys.BootstrapTime = tun.BootstrapTime().Seconds()`
			`errch <- nil`
			`}`

			```

			`## Running the code`

			`We can now run the code as follows to obtain:`

			```
			`$ go run ./experiment/torsf/chapter04 \| jq`
			`[...]`
			`Jun 21 23:40:50.000 [notice] Bootstrapped 100% (done): Done`
			`2021/06/21 23:40:50 info [100.0%] torsf experiment is finished`
			`Jun 21 23:40:50.000 [notice] Catching signal TERM, exiting cleanly`
			`{`
			`"data_format_version": "",`
			`"input": null,`
			`"measurement_start_time": "",`
			`"probe_asn": "",`
			`"probe_cc": "",`
			`"probe_network_name": "",`
			`"report_id": "",`
			`"resolver_asn": "",`
			`"resolver_ip": "",`
			`"resolver_network_name": "",`
			`"software_name": "",`
			`"software_version": "",`
			`"test_keys": {`
			`"bootstrap_time": 48.122813,`
			`"failure": null`
			`},`
			`"test_name": "",`
			`"test_runtime": 0,`
			`"test_start_time": "",`
			`"test_version": ""`
			`}`
			```

			`## Concluding remarks`

			`Congratulations, we have now rewritten together (a simplified version of)`
			the `torsf` experiment! In this journey, we have learned how experiments
			`interact with the rest of OONI Probe, how they are typically organized,`
			`and how to use lower-level libraries to implement them.`