-
Notifications
You must be signed in to change notification settings - Fork 40
/
Copy pathREADME.txt
54 lines (31 loc) · 1.1 KB
/
README.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
DataStax Brisk
==============
This package contains a HDFS compatable layer (CFS) and a CassandraJobConf
which can be used to run MR jobs without HDFS or dedicated job/task trackers.
It also includes a hive-driver for accessing data in cassandra as well as a
hive meta-store implementation.
Hadoop jobs and Hive are setup to work with MR cluster.
For detailed docs please see:
http://www.datastax.com/docs/0.8/brisk/index
You can also discuss Brisk on freenode #datastax-brisk
Required Setup
==============
On linux systems, you need to run the following as root
echo 1 > /proc/sys/vm/overcommit_memory
This is to avoid OOM errors when tasks are spawned.
Getting Started
===============
To try it out run:
1. compile and download all dependencies
ant
2. start cassandra with built in job/task trackers
./bin/brisk cassandra -t
3. view jobtracker
http://localhost:50030
4. examine CassandraFS
./bin/brisk hadoop fs -lsr cfs:///
5. start hive shell or webUI
./bin/brisk hive
or
./bin/brisk hive --service hwi
open web browser to http://localhost:9999/hwi