You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
To write parsed data (as `csv`) to a file, invoke the following (default is to write all parsed data to the specified file).
157
157
158
-
The default `--format` is `csv`. Thus the parameter is elided from the below examples.
158
+
The default `--format` is `csv`. Thus, the parameter is elided from the below examples.
159
159
160
160
```sh
161
161
readstat data /some/dir/to/example.sas7bdat --output /some/dir/to/example.csv
@@ -209,9 +209,9 @@ readstat data /some/dir/to/example.sas7bdat --output /some/dir/to/example.parque
209
209
### Parallelism
210
210
The `data` subcommand includes a parameter for `--parallel`— if invoked, the _**reading**_ of a `sas7bdat` will occur in parallel. If the total rows to process is greater than `stream-rows` (if unset, the default rows to stream is 10,000), then each chunk of rows is read in parallel. Note that all processors on the user's machine are used with the `--parallel` option. In the future, may consider allowing the user to throttle this number.
211
211
212
-
Note that although reading is in parallel, _**writing**_ is still sequential. Thus one should only anticipate moderate speed-ups as much of the time is spent writing.
212
+
Note that although reading is in parallel, _**writing**_ is still sequential. Thus, one should only anticipate moderate speed-ups as much of the time is spent writing.
213
213
214
-
:heavy_exclamation_mark: Utilizing the `--parallel` parameter will increase memory usage — there will be multiple threads simultaneously reading chunks from the `sas7bdat`. In addition because all processors are utilized, CPU usage may be maxed out during reading.
214
+
:heavy_exclamation_mark: Utilizing the `--parallel` parameter will increase memory usage — there will be multiple threads simultaneously reading chunks from the `sas7bdat`. In addition, because all processors are utilized, CPU usage may be maxed out during reading.
215
215
216
216
:warning: Also, note that utilizing the `--parallel` parameter may write rows out of order from the original `sas7bdat`.
217
217
@@ -369,8 +369,8 @@ Flamegraph is written to `readstat/flamegraph.svg`.
369
369
370
370
:memo: Have yet to utilize flamegraphs in order to improve performance.
371
371
372
-
## Github Actions
373
-
Below is the rough `git tag` dance to delete and/or add tags to [trigger Github Actions](https://github.com/curtisalexander/readstat-rs/blob/main/.github/workflows/main.yml#L7-L10).
372
+
## GitHub Actions
373
+
Below is the rough `git tag` dance to delete and/or add tags to [trigger GitHub Actions](https://github.com/curtisalexander/readstat-rs/blob/main/.github/workflows/main.yml#L7-L10).
0 commit comments