SUMMARY This document highlights the security features of Power BI, according to the National Institute of Standards and Technology Cybersecurity Framework (NIST CSF). The NIST CSF is a guide for organizations to manage and reduce cybersecurity risk. Irrespective of the size of the business, [...]
This is the latest version of our course on big data: Watch the full course on coursera at … [...]
The following diagram shows a typical Big Data Infrastructure Design. This is from one of Allied Consultant’s Big Data [...]
Cluster Expected Volume Benchmark hardware Project Hardware requirements Cores RAM # nodes Disk Source 6 Million records / month ~ 3 records per second HDFS 6 million/month 1 namenode, 20 datanodes, 2 CPU/node, 64GB RAM/node 1 6G 1 : Master 3: Slaves 120% of 6G =7.2GB/month Kafka 4 [...]
With the help of Big data, Small Businesses can gain the competitive edge they require to stay ahead of the curve. For beginners, Big data includes large data sets of information which can reveal insights about your customers to help you make valuable business decisions. Data can help you to [...]
Data Scientists are known for having a knack for statistics, data analysis etc. in order to understand and obtain insights from a given dataset, usually quite enormous in quantity. If you don’t think you have this skill set, but the career really appeals to you, you can always take a course [...]
What is a Dashboard? A dashboard is a visual display of the most important information needed to achieve one or more objectives; consolidated and arranged on a single screen so the information can be monitored at a glance. A user interface that organizes, integrates, and presents mission critical [...]
Resource Management in Information Technology There is a whole host of technology available now a days to ensure that your IT hardware resources are managed efficiently. You may have a data center in house, a few cloud nodes/services/apps which together may constitute your investment in hardware. [...]
Synchronous vs Async pipelines Synchronous big data pipelines are a series of data processing components that get triggered when a user invokes an action on a screen. e.g. clicking a button. The user typically waits till a response is received to intimate the user for results. In contrast in [...]
There is a lot of hype about “Big Data” solutions with most of our customers. I looked at first a few years ago and I found most things to be very early stage with little genuine intent to implement from customers. However in the recent past, I have seen an increase in the number of [...]
Recent Comments