• Home
  • Apply for Beta
  • Team
  • Publications
KEEP IN TOUCH

Rapid-I and Radoop enter partnership for user-friendly big data analytics

Feb23
2012
4 Comments Written by Zoltán Prekopcsák

The Radoop story started with a research project when we wanted to analyze a large dataset with Apache Hadoop. We have instantly realized that Hadoop lacks graphical user interfaces and generally, big data systems are really hard to deal with. We had a strong background in graphical data mining tools, so we have decided to create an extension for one of them. The evaluation did not take long, as one tool was way ahead of the others. RapidMiner had hundreds of operators, an easy-to-use interface, and a very flexible extension system.

Today, more than one year later, we are even more satisfied with our decision and we are happy to announce a partnership between Radoop and Rapid-I, the company behind RapidMiner and the leading provider of open-source solutions for data mining and predictive analytics. This partnership is the first step in a long-term collaboration, and it marks our commitment to keep Radoop and RapidMiner highly integrated.

You can read more in the press release.

Posted in Announcements - Tagged big data, partnership, rapid-i, rapidminer
SHARE THIS Twitter Facebook Delicious StumbleUpon E-mail

Radoop 0.3 released – fully graphical big data analytics

Feb15
2012
2 Comments Written by Zoltán Prekopcsák

Today, Radoop had a major step forward with its 0.3 release. The new version of the visual big data analytics package adds full support for all major Hadoop distributions used these days: Apache Hadoop 0.20.2, 0.20.203, 1.0 and Cloudera’s Distribution including Apache Hadoop 3 (CDH3). It also adds support for large clusters by allowing the namenode, the jobtracker and the Hive server to reside on different nodes.

As Radoop’s promise is to make big data analytics easier, the 0.3 release is also focused on improving the user interface. It has an enhanced breakpointing system which allows to investigate intermediate results, and it adds dozens of quick fixes, so common process design mistakes get much easier to solve.

There are many further improvements and fixes, so please consult the release notes for more details. Radoop is in private beta mode, but heading towards a public release in Q2 2012. If you would like to get early access, then please apply at the signup page or describe your use case in email (beta at radoop.eu).

Posted in Announcements - Tagged apache, beta, cloudera, hadoop, hive, release
SHARE THIS Twitter Facebook Delicious StumbleUpon E-mail

Radoop enters the Cloudera Connect Partner Program

Jan10
2012
Leave a Comment Written by Zoltán Prekopcsák

Having strong partnerships is one of the foundations of a successful company. We feel important to collaborate with groups leading the big data revolution and Cloudera is in the top of the league.

Today, we are happy to announce that Radoop has became a Cloudera Connect Member. This allows us to have a closer look on Cloudera’s technology and future products, so we are in a better position to ensure Radoop compatibility with clusters based on Cloudera’s Distribution Including Apache Hadoop (CDH).

CDH includes 13 open-source software components packaged together to form a fully functional big data distribution based on Apache Hadoop. Radoop is also built on Apache Hadoop and uses Apache Hive for data analytics and Apache Mahout for machine learning. All these software tools are included in CDH, so it is a great choice for using Radoop.

Radoop is still in private beta mode, so please apply if you would like to evaluate it for your big data analytics needs.

Posted in Announcements
SHARE THIS Twitter Facebook Delicious StumbleUpon E-mail

Big Data analytics made even easier – Radoop 0.2.2 released

Dec06
2011
Leave a Comment Written by Zoltán Prekopcsák

We are happy to announce a new release of Radoop, versioned 0.2.2. Since the 0.2 release in September, we have been working hard to add new features, fix bugs, improve the user experience and make the user interface more intuitive. You can read more on the changes in the release notes.

In the past few months, Radoop has been presented at Hadoop World and we have published several screencast tutorials to make the start even easier for first-time users. We have a strong focus on the interface to match our promise “Big Data analytics made easy”. Radoop can be used with limited technical knowledge, but it is flexible enough so even experienced analysts can use it in their daily work.

Radoop is in a private beta stage with dozens of companies and analysts evaluating it. If you would like to get access, then please apply for the beta programme or contact us directly and describe your big data analytics use-cases.

Posted in Announcements - Tagged beta, hadoop world, release
SHARE THIS Twitter Facebook Delicious StumbleUpon E-mail

Hello Hadoop World! Radoop is ready for your thoughts

Nov08
2011
Leave a Comment Written by Zoltán Prekopcsák

The annual Hadoop World conference has just started today in New York City. This is the third time that big data experts come together and present what they have achieved in the past year. The agenda is fully packed with presentations from Cloudera, Hortonworks, Facebook, LinkedIn, Oracle, Informatica, Ebay, NetApp, and other big data companies. This year, for the first time, Radoop will be presented by Gabor Makrai right after lunch tomorrow in the Data Scientist track. Feel free to catch Gabor at the conference, share your comments here in the blog, or start a discussion on Twitter.

Posted in Announcements - Tagged hadoop world, new york
SHARE THIS Twitter Facebook Delicious StumbleUpon E-mail

Data import and export tutorial

Nov01
2011
Leave a Comment Written by Zoltán Prekopcsák

The main goal of Radoop is to make it easy to do data analytics with Hadoop. In the first tutorial we have shown how to set up Radoop with your cluster. The next step is usually to import your data, do some transformations, and export the results. The following screencast shows you how to solve it with Radoop.

Posted in Announcements - Tagged data, export, import, screencast, tutorial
SHARE THIS Twitter Facebook Delicious StumbleUpon E-mail

Working on the next big thing…

Oct20
2011
Leave a Comment Written by Zoltán Prekopcsák

We have been silent on this blog for the past few weeks but it does not mean that we are slowing down. We are working hard to create the next big thing in big data analytics, but it will take several months to reach our goal and release Radoop publicly.

In the meantime, you can still apply for evaluating our beta release and if you do so then you will also get timely notification on our public release. You should also visit Hadoop World, where our CTO, Gabor Makrai will present the Radoop vision and many technical details on how we simplify Hadoop technologies.

Posted in Announcements
SHARE THIS Twitter Facebook Delicious StumbleUpon E-mail

Configuring Apache and Cloudera clusters for Radoop

Sep19
2011
Leave a Comment Written by Zoltán Prekopcsák

This is a technical post for people setting up a Hadoop cluster for Radoop.

Hadoop is a single brand, but there are many different versions and distributions nowadays. The two most popular ones are Apache Hadoop 0.20.2 and the Cloudera Distribution including Apache Hadoop version 3 (CDH3). Radoop can work with both of these distributions, but needs an additional Hive 0.7.x installation in case of Apache Hadoop (CDH3 already has it).

Radoop needs three open ports to operate properly:

  • HDFS port, which is 9000 or 8020 by default
  • MapReduce port, which is 9001 or 8021 by default
  • Hive server port, which is 10000 by default

That’s all, if you have these ports open and provide them on the connection management panel, then you are ready to use Radoop’s drag-and-drop interface. However, you might run into problems with these ports if you have the Cloudera distribution, especially if you are running the virtual machines.

First of all, on the Cloudera distribution, Hive is configured for single connection, and Hue is already connected. Second, the Hive server is not running, so you need to start it manually (the manual start is also needed in case of an Apache Hadoop installation). You need to go to the Terminal and type the following commands to stop Hue and start the Hive server:

sudo /etc/init.d/hue stop
sudo hive --service hiveserver

After these steps, the Hive server should be accessible from Radoop.

A further common problem with virtual machine distributions is that the HDFS and MapReduce ports are only open for localhost connections. The workaround to solve this is to install Radoop on the master virtual machine and define the connection to localhost.

Setting up a cluster and making it work with Radoop might seem tricky sometimes, but after the initial configuration, you can use the drag-and-drop interface to analyze and visualize your data which makes big data analytics much easier in the long run. We are working hard to make the first steps as easy as possible, so you can expect further solutions in this direction.

Posted in Announcements - Tagged apache, cloudera, hadoop, hive, port, technical
SHARE THIS Twitter Facebook Delicious StumbleUpon E-mail

Screencast on setting up Hadoop connections

Sep16
2011
2 Comments Written by Zoltán Prekopcsák

The first screencast on Radoop is out and you can see how you can configure the connection to a Hadoop cluster and retrieve data from a Hive database. More tutorials and screencasts will be shared in the coming weeks.

Posted in Announcements - Tagged connection, hadoop, hive, screencast, tutorial
SHARE THIS Twitter Facebook Delicious StumbleUpon E-mail

Radoop to be presented at Hadoop World

Sep13
2011
Leave a Comment Written by Zoltán Prekopcsák

We are happy to announce that Radoop will be presented at the largest Hadoop event this year in New York City. Gábor Makrai, co-founder and CTO, will talk about the importance of visual analytics for big data and he will explain the technical details of the Radoop package.

You can read the abstract here and don’t forget to register for Hadoop World 2011!

Posted in Announcements - Tagged big data, cloudera, hadoop world, new york
SHARE THIS Twitter Facebook Delicious StumbleUpon E-mail
« Older Entries

About

Radoop is an easy-to-use interface for Big Data analytics and machine learning over Hadoop. Read more

Tags

apache athens beta big data boston budapest california cloudera connection data data mining disrupt distributed ecml-pkdd export feedback genome hadoop hadoop world hive hortonworks ideas import interface launch mahout new york partnership port prezi rapid-i rapidminer rcomm release san francisco screencast skype techcrunch technical tutorial yahoo

Partners

EvoLve theme by Theme4Press  •  Powered by WordPress Radoop
Big Data analytics made easy