Building Hadoop clusters review

Building_Hadoop_Clusters

If you are interested in Hadoop technology probably this is an interesting video course you should evaluate. As you probably know, Apache Hadoop is an open-source software framework for storage and large-scale processing of data-sets on clusters of commodity hardware. All the modules in Hadoop are designed with the assumption that hardware failures are common and thus should be automatically handled in software by the framework.

Talking about the video course, we can divide the content in three main macro-sections:
1. how to create and set up a three machines cluster using Amazon EC2,
2. how to install an Hadoop cluster using Apache Ambari,
3. how to start using Hadoop cluster, in particular with Apache Hadoop User Interface (HUE).

The description of all the topics is clear and well done (Sean Mikha, the author, did a good job). All the relevant topics are always detailed before with an explanation of the logic structure and approach and only after with a demostration on how to do it in practice.

Useful also for other purposes, the creation of the virtual machines on Amazon EC2. The practical description and the step by step creation, is not limited to the server’s creation but is detailed also in what concerns the security and connection using, for example, putty ssh client.

apache hadoopIn my opinion the most relevant value of this video course is on the hidden details of the Hadoop cluster installation process. As you will see if you will decide to follow it, the tasks are quite easy to do (probably this a Sean’s merit) but the configuration details and settings are very important if you want to make it work in practice. Following the hints I’m sure every neophyte will gain days of work and lot of nights in googling. 😉

Enjoy your Hadoop Cluster video course… as usual by Packt Publishing.

Francesco Corti

Advertisements

Review of the Alfresco CMIS book by Martin Bergljung

Alfresco CMISAs you probably know (or you read it now for the first time) CMIS is an open standard that allows different ECMs to inter-operate over the Internet through the definition of a collection of services and a powerful query language (CMIS-QL), modeled along a subset of SQL.

The goal of this book is to share and explain all the basics of the CMIS, using a practical and technical approach that starts from the history (why the CMIS was born), going through the definition of the (several) services and the query language, and ending with a collection of examples describing how to use CMIS in practice.

CMISOk, CMIS is thought to make different ECMs interoperate, but the amount of different languages and examples described in this book is interesting and well done. Starting from Java (with Apache Chemistry libraries), Javascript + JQuery, Groovy and (bascis of) PHP. Yes, I’m agree with you if you are thinking that the CMIS libraries are more and more than this but the description (and explanation) of the CMIS services (and examples) is all you need to understand how to approach the development using all the different languages supported (.NET, Python, ecc.).

As you can read from the title, Martin Bergljung focuses his description on Alfresco. And this is true because all the examples are developed using an Alfresco repository as referred architecture. But inside the book you can find something more about Alfresco. Personally I have found very interesting the description of the Alfresco Surf together with CMIS standard. Probably this topic is less useful for the most part of the readers (and practical cases) but is an interesting example related to the basics of the Alfresco Share application. Quite interesting also the example on how to make Alfresco and Drupal interact, using CMIS.

packt-publishingLast but not least, I read the book very easily in the first part (the more descriptive) and in the last (full of practical examples in the different languages). I think I will use the book also as manual of the several CMIS services when I will develop something because I suggest you to remember that…

Standard is good!

😉

Francesco Corti

Alfresco 4 Enterprise Content Management review

Alfresco 4 Enterprise Content Management During this last period I read the new book about Alfresco. The book is titled “Alfresco 4 Enterprise Content Management Implementation” and is Published by Packt Publishing.

The book it’s the natural update and evolution of the famous “Alfresco 3 Enterprise Content Management Implementation”, published in June 2009 and today not so updated. And before, of the “Alfresco Enterprise Content Management Implementation”, published in January 2007 by Packt Publishing.

You know, Alfresco features are growing very rapidly, month after month and release after release.

The fact that the book has more than 500 pages and it is written by 8 different authors (all of them are skilled IT professionists of Cignex Datamatics) clearly means that the content is relevant and detailed. Reading this book, infact, you will find that the content is not so narrative but oriented on practical topics and solutions.

The goal of the authors is relevant: explain Alfresco to IT specialists, starting from scratch and diving deep in the user experience, the customization, the development and the administration. Not bad in a single book!

The book is not specifically for developers (like the authors says) but for administrators, experienced users, developers, in particular for IT consultants.

I have found the description of the installation of the so called “Alfresco bundle” release together with the installation of all the Alfresco components, very useful because it’s not so easy to find, around in the web.

Very interesting, in my opinion, the description of the workflow using Activiti (JBPM is not described even if it supported by Alfresco 4) because it is concrete, developing practical examples for real life cases.

Last but not least, one final consideration about the explanation strategy: in my opinion too much descriptions are developed on Alfresco Explorer. We all know that Alfresco Explorer won’t be improved by Alfresco developers and Alfresco Share is the “modern” user interface. I think could be a good idea, for the next edition of the book, to move the most part of the examples (and images) to it.

In conclusion, I suggest this book to all the people that are not Alfresco Experts but would like to understand better what Alfresco is and how Alfresco works, with a good level of detail.

Have a good read during the next holidays. 😉