MARC details
000 -LEADER |
fixed length control field |
07760cam a2200385 i 4500 |
005 - DATE AND TIME OF LATEST TRANSACTION |
control field |
20221101221921.0 |
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION |
fixed length control field |
150515t20152015nyua b 001 0 eng d |
011 ## - LINKING LIBRARY OF CONGRESS CONTROL NUMBER [OBSOLETE] |
Local cataloguing issues note |
BIB MATCHES WORLDCAT |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER |
International Standard Book Number |
1617290343 |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER |
International Standard Book Number |
9781617290343 |
035 ## - SYSTEM CONTROL NUMBER |
System control number |
(OCoLC)909039685 |
040 ## - CATALOGING SOURCE |
Original cataloging agency |
SA$ |
Language of cataloging |
eng |
Description conventions |
rda |
Transcribing agency |
SA$ |
Modifying agency |
OCLCO |
-- |
DOV |
-- |
ATU |
082 04 - DEWEY DECIMAL CLASSIFICATION NUMBER |
Classification number |
658.4038 |
Edition number |
23 |
100 1# - MAIN ENTRY--PERSONAL NAME |
Personal name |
Marz, Nathan, |
Relator term |
author. |
9 (RLIN) |
825866 |
245 10 - TITLE STATEMENT |
Title |
Big data : |
Remainder of title |
principles and best practices of scalable real-time data systems / |
Statement of responsibility, etc. |
Nathan Marz and James Warren. |
264 #1 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE |
Place of production, publication, distribution, manufacture |
Shelter Island, NY : |
Name of producer, publisher, distributor, manufacturer |
Manning, |
Date of production, publication, distribution, manufacture, or copyright notice |
2015. |
264 #4 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE |
Date of production, publication, distribution, manufacture, or copyright notice |
©2015 |
300 ## - PHYSICAL DESCRIPTION |
Extent |
xx, 308 pages : |
Other physical details |
illustrations ; |
Dimensions |
24 cm. |
336 ## - CONTENT TYPE |
Content type term |
text |
Content type code |
txt |
Source |
rdacontent |
337 ## - MEDIA TYPE |
Media type term |
unmediated |
Media type code |
n |
Source |
rdamedia |
338 ## - CARRIER TYPE |
Carrier type term |
volume |
Carrier type code |
nc |
Source |
rdacarrier |
500 ## - GENERAL NOTE |
General note |
Includes index. |
505 0# - FORMATTED CONTENTS NOTE |
Formatted contents note |
1. A new paradigm for Big Data -- Part 1. Batch layer : -- 2. Data model for Big Data -- 3. Data model for Big Data: Illustration -- 4. Data storage on the batch layer -- 5. Data storage on the batch layer: Illustration -- 6. Batch layer -- 7. Batch layer: Illustration -- 8. An example batch layer: Architecture and algorithms -- 9. An example batch layer: Implementation -- Part 2. Serving layer : -- 10. Serving layer -- 11. Serving layer: Illustration -- Part 3. Speed layer : -- 12. Realtime views -- 13. Realtime views: Illustration -- 14. Queuing and stream processing -- 15. Queuing and stream processing: Illustration -- 16. Micro-batch stream processing -- 17. Micro-batch stream processing: Illustration -- 18. Lambda Architecture in depth -- -- |
505 00 - FORMATTED CONTENTS NOTE |
Miscellaneous information |
1. |
Title |
A new paradigm for Big Data -- |
Miscellaneous information |
1.1. |
Title |
How this book is structured -- |
Miscellaneous information |
1.2. |
Title |
1.2Scaling with a traditional database -- |
Miscellaneous information |
1.3. |
Title |
NoSQL is not a panacea -- |
Miscellaneous information |
1.4. |
Title |
First principles -- |
Miscellaneous information |
1.5. |
Title |
Desired properties of a Big Data system -- |
Miscellaneous information |
1.6. |
Title |
The problems with fully incremental architectures -- |
Miscellaneous information |
1.7. |
Title |
Lambda Architecture -- |
Miscellaneous information |
1.8. |
Title |
Recent trends in technology -- |
Miscellaneous information |
1.9. |
Title |
Example application: SuperWebAnalytics.com -- |
Miscellaneous information |
1.10. |
Title |
Summary -- -- |
Miscellaneous information |
Part 1. |
Title |
Batch layer : -- -- |
Miscellaneous information |
2. |
Title |
Data model for Big Data -- |
Miscellaneous information |
2.1. |
Title |
The properties of data -- |
Miscellaneous information |
2.2. |
Title |
The fact-based model for representing data -- |
Miscellaneous information |
2.3. |
Title |
Graph schemas -- |
Miscellaneous information |
2.4. |
Title |
A complete data model for SuperWebAnalytics.com -- |
Miscellaneous information |
2.5. |
Title |
Summary -- -- |
Miscellaneous information |
3. |
Title |
Data model for Big Data: Illustration -- |
Miscellaneous information |
3.1. |
Title |
Why a serialization framework? -- |
Miscellaneous information |
3.2. |
Title |
Apache Thrift -- |
Miscellaneous information |
3.3. |
Title |
Limitations of serialization frameworks -- |
Miscellaneous information |
3.4. |
Title |
Summary -- -- |
Miscellaneous information |
4. |
Title |
Data storage on the batch layer -- |
Miscellaneous information |
4.1. |
Title |
Storage requirements for the master dataset -- |
Miscellaneous information |
4.2. |
Title |
Choosing a storage solution for the batch layer -- |
Miscellaneous information |
4.3. |
Title |
How distributed filesystems work -- |
Miscellaneous information |
4.4. |
Title |
Storing a master dataset with a distributed filesystem -- |
Miscellaneous information |
4.5. |
Title |
Vertical partitioning -- |
Miscellaneous information |
4.6. |
Title |
Low-level nature of distributed filesystems -- |
Miscellaneous information |
4.7. |
Title |
Storing the SuperWebAnalytics.com master dataset on a distributed filesystem -- |
Miscellaneous information |
4.8. |
Title |
Summary -- -- |
Miscellaneous information |
5. |
Title |
Data storage on the batch layer: Illustration -- |
Miscellaneous information |
5.1. |
Title |
Using the Hadoop Distributed File System -- |
Miscellaneous information |
5.2. |
Title |
Data storage in the batch layer with Pail -- |
Miscellaneous information |
5.3. |
Title |
Storing the master dataset for SuperWebAnalytics.com -- |
Miscellaneous information |
5.4. |
Title |
Summary -- -- |
Miscellaneous information |
6. |
Title |
Batch layer -- |
Miscellaneous information |
6.1. |
Title |
Motivating examples -- |
Miscellaneous information |
6.2. |
Title |
Computing on the batch layer -- |
Miscellaneous information |
6.3. |
Title |
Recomputation algorithms vs. incremental algorithms -- |
Miscellaneous information |
6.4. |
Title |
Scalability in the batch layer -- |
Miscellaneous information |
6.5. |
Title |
MapReduce: a paradigm for Big Data computing -- |
Miscellaneous information |
6.6. |
Title |
Low-level nature of MapReduce -- |
Miscellaneous information |
6.7. |
Title |
Pipe diagrams: a higher-level way of thinking about batch computation -- |
Miscellaneous information |
6.8. |
Title |
Summary -- -- |
Miscellaneous information |
7. |
Title |
Batch layer: Illustration -- |
Miscellaneous information |
7.1. |
Title |
An illustrative example -- |
Miscellaneous information |
7.2. |
Title |
Common pitfalls of data-processing tools -- |
Miscellaneous information |
7.3. |
Title |
An introduction to JCascalog -- |
Miscellaneous information |
7.4. |
Title |
Composition -- |
Miscellaneous information |
7.5. |
Title |
Summary -- -- |
Miscellaneous information |
8. |
Title |
An example batch layer: Architecture and algorithms -- |
Miscellaneous information |
8.1. |
Title |
Design of the SuperWebAnalytics.com batch layer -- |
Miscellaneous information |
8.2. |
Title |
Workflow overview -- |
Miscellaneous information |
8.3. |
Title |
Ingesting new data -- |
Miscellaneous information |
8.4. |
Title |
URL normalization -- |
Miscellaneous information |
8.5. |
Title |
User-identifier normalization -- |
Miscellaneous information |
8.6. |
Title |
Deduplicate pageviews -- |
Miscellaneous information |
8.7. |
Title |
Computing batch views -- |
Miscellaneous information |
8.8. |
Title |
Summary -- -- |
Miscellaneous information |
9. |
Title |
An example batch layer: Implementation -- |
Miscellaneous information |
9.1. |
Title |
Starting point -- |
Miscellaneous information |
9.2. |
Title |
Preparing the workflow -- |
Miscellaneous information |
9.3. |
Title |
Ingesting new data -- |
Miscellaneous information |
9.4. |
Title |
URL normalization -- |
Miscellaneous information |
9.5. |
Title |
User-identifier normalization -- |
Miscellaneous information |
9.6. |
Title |
Deduplicate pageviews -- |
Miscellaneous information |
9.7. |
Title |
Computing batch views -- |
Miscellaneous information |
9.8. |
Title |
Summary -- -- |
Miscellaneous information |
Part 2. |
Title |
Serving layer : -- -- |
Miscellaneous information |
10. |
Title |
Serving layer -- |
Miscellaneous information |
10.1. |
Title |
Performance metrics for the serving layer -- |
Miscellaneous information |
10.2. |
Title |
The serving layer solution to the normalization/denormalization problem -- |
Miscellaneous information |
10.3. |
Title |
Requirements for a serving layer database -- |
Miscellaneous information |
10.4. |
Title |
Designing a serving layer for SuperWebAnalytics.com -- |
Miscellaneous information |
10.5. |
Title |
Contrasting with a fully incremental solution -- |
Miscellaneous information |
10.6. |
Title |
Summary -- -- |
Miscellaneous information |
11. |
Title |
Serving layer: Illustration -- |
Miscellaneous information |
11.1. |
Title |
Basics of ElephantDB -- |
Miscellaneous information |
11.2. |
Title |
Building the serving layer for SuperWebAnalytics.com -- |
Miscellaneous information |
11.3. |
Title |
Summary -- -- |
Miscellaneous information |
Part 3. |
Title |
Speed layer : -- -- |
Miscellaneous information |
12. |
Title |
Realtime views -- |
Miscellaneous information |
12.1. |
Title |
Computing realtime views -- |
Miscellaneous information |
12.2. |
Title |
Storing realtime views -- |
Miscellaneous information |
12.3. |
Title |
Challenges of incremental computation -- |
Miscellaneous information |
12.4. |
Title |
Asynchronous versus synchronous updates -- |
Miscellaneous information |
12.5. |
Title |
Expiring realtime views -- |
Miscellaneous information |
12.6. |
Title |
Summary -- -- |
Miscellaneous information |
13. |
Title |
Realtime views: Illustration -- |
Miscellaneous information |
13.1. |
Title |
Cassandra's data model -- |
Miscellaneous information |
13.2. |
Title |
Using Cassandra -- |
Miscellaneous information |
13.3. |
Title |
Summary -- -- |
Miscellaneous information |
14. |
Title |
Queuing and stream processing -- |
Miscellaneous information |
14.1. |
Title |
Queuing -- |
Miscellaneous information |
14.2. |
Title |
Stream processing -- |
Miscellaneous information |
14.3. |
Title |
Higher-level, one-at-a-time stream processing -- |
Miscellaneous information |
14.4. |
Title |
SuperWebAnalytics.com speed layer -- |
Miscellaneous information |
14.5. |
Title |
Summary -- -- |
Miscellaneous information |
15. |
Title |
Queuing and stream processing: Illustration -- |
Miscellaneous information |
15.1. |
Title |
Defining topologies with Apache Storm -- |
Miscellaneous information |
15.2. |
Title |
Apache Storm clusters and deployment -- |
Miscellaneous information |
15.3. |
Title |
Guaranteeing message processing -- |
Miscellaneous information |
15.4. |
Title |
Implementing the SuperWebAnalytics.com uniques-over-time speed layer -- |
Miscellaneous information |
15.5. |
Title |
Summary -- -- |
Miscellaneous information |
16. |
Title |
Micro-batch stream processing -- |
Miscellaneous information |
16.1. |
Title |
Achieving exactly-once semantics -- |
Miscellaneous information |
16.2. |
Title |
Core concepts of micro-batch stream processing -- |
Miscellaneous information |
16.3. |
Title |
Extending pipe diagrams for micro-batch processing -- |
Miscellaneous information |
16.4. |
Title |
Finishing the speed layer for SuperWebAnalytics.com -- |
Miscellaneous information |
16.5. |
Title |
Pageviews over time 262 n Bounce-rate analysis -- |
Miscellaneous information |
16.6. |
Title |
Another look at the bounce-rate-analysis example -- |
Miscellaneous information |
16.7. |
Title |
Summary -- -- |
Miscellaneous information |
17. |
Title |
Micro-batch stream processing: Illustration -- |
Miscellaneous information |
17.1. |
Title |
Using Trident -- |
Miscellaneous information |
17.2. |
Title |
Finishing the SuperWebAnalytics.com speed layer -- |
Miscellaneous information |
17.3. |
Title |
Fully fault-tolerant, in-memory, micro-batch processing -- |
Miscellaneous information |
17.4. |
Title |
Summary -- -- |
Miscellaneous information |
18. |
Title |
Lambda Architecture in depth -- |
Miscellaneous information |
18.1. |
Title |
Defining data systems -- |
Miscellaneous information |
18.2. |
Title |
Batch and serving layers -- |
Miscellaneous information |
18.3. |
Title |
Speed layer -- |
Miscellaneous information |
18.4. |
Title |
Query layer -- |
Miscellaneous information |
18.5. |
Title |
Summary. |
520 ## - SUMMARY, ETC. |
Summary, etc. |
"Big Data teaches you to build big data systems using an architecture that takes advantage of clustered hardware along with new tools designed specifically to capture and analyze web-scale data. It describes a scalable, easy-to-understand approach to big data systems that can be built and run by a small team. Following a realistic example, this book guides readers through the theory of big data systems, how to implement them in practice, and how to deploy and operate them once they're built."--Publisher's website. |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name entry element |
Big data. |
9 (RLIN) |
348058 |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name entry element |
Business intelligence. |
9 (RLIN) |
314899 |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name entry element |
Data mining. |
9 (RLIN) |
328715 |
700 1# - ADDED ENTRY--PERSONAL NAME |
Personal name |
Warren, James, |
Relator term |
author. |
9 (RLIN) |
825871 |
907 ## - LOCAL DATA ELEMENT G, LDG (RLIN) |
a |
.b1412354x |
b |
22-04-20 |
c |
28-10-15 |
942 ## - ADDED ENTRY ELEMENTS (KOHA) |
Koha item type |
Book |
945 ## - LOCAL PROCESSING INFORMATION (OCLC) |
a |
658.4038 MAR |
g |
1 |
i |
A507726B |
j |
0 |
l |
cmain |
n |
27/3/17 DA Main collection item has the z item code 2 to stop this being requested by BONUS+ patrons |
o |
z |
p |
$61.69 |
q |
- |
r |
- |
s |
- |
t |
0 |
u |
8 |
v |
17 |
w |
1 |
x |
7 |
y |
.i13439182 |
z |
29-10-15 |
945 ## - LOCAL PROCESSING INFORMATION (OCLC) |
a |
658.4038 MAR |
g |
1 |
i |
A556463B |
j |
0 |
l |
cmain |
o |
- |
p |
$77.89 |
q |
- |
r |
- |
s |
- |
t |
0 |
u |
3 |
v |
11 |
w |
1 |
x |
3 |
y |
.i13439261 |
z |
29-10-15 |
998 ## - LOCAL CONTROL INFORMATION (RLIN) |
-- |
b |
-- |
c |
Operator's initials, OID (RLIN) |
06-04-16 |
Cataloger's initials, CIN (RLIN) |
m |
First date, FD (RLIN) |
a |
-- |
eng |
-- |
nyu |
-- |
0 |