You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/about/contact.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -14,7 +14,7 @@ If you have any issues with any of the particular Bullet sub-components, feel fr
14
14
15
15
## Mailing Lists
16
16
17
-
If you have a general question, reach out to us at the Users list and at Developers if you have a particular question for the developers.
17
+
If you have a general question, comment, or observation meant for general visibility, reach out to the Users list. If you want to keep it to just the developers, reach to that list instead.
Copy file name to clipboardExpand all lines: docs/about/contributing.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,14 +1,14 @@
1
1
# Contributing
2
2
3
-
We welcome all contributions! We also welcome all usage experiences, stories, annoyances and whatever else you want to say. Head on over to our [Contact](contact.md) and let us know!
3
+
We welcome all contributions! We also welcome all usage experiences, stories, annoyances and whatever else you want to say. Head on over to our [Contact Us page](contact.md) and let us know!
4
4
5
5
## Contributor License Agreement (CLA)
6
6
7
7
Bullet is hosted under the [Yahoo Github Organization](https://github.com/yahoo). In order to contribute to any Yahoo project, you will need to submit a CLA. When you submit a Pull Request to any Bullet repository, a CLABot will ask you to sign the CLA if you haven't signed one already.
8
8
9
9
## Future plans
10
10
11
-
Here is a list of features we are currently considering/working on. If the Status column is empty, we are still discussing how to approach/break them. They will be updated as they are solidified. Feel free to [contact us](contact.md) with any ideas/suggestions/PRs!
11
+
Here is a list of features we are currently considering/working on. If the Status column is empty, we are still discussing how to prioritize/approach/break them down. They will be updated as they are solidified. Feel free to [contact us](contact.md) with any ideas/suggestions/PRs!
12
12
13
13
This list is neither comprehensive nor in any particular order.
Copy file name to clipboardExpand all lines: docs/quick-start.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -16,7 +16,7 @@ By the following the steps in this section, you will:
16
16
17
17
## Setting up Storm
18
18
19
-
To set up a clean working environment, let's start with setting some directories.
19
+
To set up a clean working environment, let's start with creating some directories.
20
20
21
21
#### Step 1: Setup directories and examples
22
22
@@ -84,7 +84,7 @@ storm kill topology
84
84
85
85
!!! note "Local mode cleanup"
86
86
87
-
If you notice any problems while setting up storm or while relaunching a topology, it may be because some state is corrupted. When running Storm in this fashion, states and serializations are stored in ```storm-local```. You may want to ```rm -rf storm-local/*``` to remove everything in this directory before relaunching Storm components.
87
+
If you notice any problems while setting up storm or while relaunching a topology, it may be because some state is corrupted. When running Storm in this fashion, states and serializations are stored in ```storm-local``` and ```/tmp/```. You may want to ```rm -rf storm-local/* /tmp/dev-storm-zookeeper``` to clean up this state before relaunching Storm components. See the [tear down section](#teardown) on how to kill any running instances.
88
88
89
89
## Setting up the example Bullet topology
90
90
@@ -214,15 +214,15 @@ To cleanup all the components we bought up:
214
214
| Web Service | ```ps aux | grep [e]xample_context.properties | awk '{print $2}' | xargs kill``` |
You can also ```rm -rf $BULLET_HOME /tmp/dev-storm-zookeeper /tmp/jetty-*``` to clean up your file system too.
217
+
You can also ```rm -rf $BULLET_HOME /tmp/dev-storm-zookeeper /tmp/jetty-*``` to delete *everything* that we did so far.
218
218
219
219
## What did we do?
220
220
221
-
This section will cover the various custom pieces this example plugged into Bullet, so you can better understand what we did.
221
+
This section will go over the various custom pieces this example plugged into Bullet, so you can better understand what we did.
222
222
223
223
### Storm topology
224
224
225
-
The topology was the Bullet topology plugged in with a custom spout. This spout is implemented in this [example project](https://github.com/yahoo/bullet-docs/blob/master/examples/storm/) was already built for you when you [downloaded the examples](#step-1-setup-directories-and-examples). This spout produces a maximum number of records in a given period. Both these arguments are configurable. If you examine $BULLET_HOME/backend/storm/launch.sh, you'll see the following:
225
+
The topology was the Bullet topology plugged in with a custom spout. This spout is implemented in this [example project](https://github.com/yahoo/bullet-docs/blob/master/examples/storm/)and was already built for you when you [downloaded the examples](#step-1-setup-directories-and-examples). It does not read from any data source and just produces random, structured data. It also produces only up to a maximum number of records in a given period. Both this maximum and the length of a period are configurable. If you examine $BULLET_HOME/backend/storm/launch.sh, you'll see the following:
226
226
227
227
```bash
228
228
storm jar bullet-storm-example-1.0-SNAPSHOT-jar-with-dependencies.jar \
Copy file name to clipboardExpand all lines: docs/ui/usage.md
+6Lines changed: 6 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -158,3 +158,9 @@ In this example, we group by ```tuple_number```. Recall that this is the number
158
158
!!! note "Why no Count Distinct after Grouping"
159
159
160
160
At this time, we do not support counting distinct values per field because with the current implementation of Grouping, it would involve storing Data Sketches within Data Sketches. We are considering this in a future release however.
161
+
162
+
!!! note "Aha, sorting by tuple_number didn't sort properly!"
163
+
164
+
Good job, eagle eyes! Unfortunately, whenever we group on fields, those fields become strings under the current implementation. Rather than convert them back at the end, we have currently decided to leave it as is. This means that in your results, if you try and sort by a grouped field, it will perform a lexicographical sort.
165
+
166
+
This also means that you can actually group by any field - including non primitives such as maps and lists! The field will be converted to a string and that string will be used as the field's representation for uniqueness and grouping purposes.
0 commit comments