?
Solved

Looking for help with PIG Latin

Posted on 2014-07-25
4
Medium Priority
?
309 Views
Last Modified: 2014-08-08
Hello All,

I have recently started to use Apache PIG and came across shingling which looks like a very good way of doing document comparisons. Has anyone have any experience trying to implement Shingling in PIG?

Or has any references that can point to that I can look at.

Thanks,

LJ
0
Comment
Question by:Jordan_WM
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
4 Comments
 
LVL 101

Accepted Solution

by:
mlmcc earned 2000 total points
ID: 40231611
Here are some links to the Apache Pig documentation and some tutorials

Apache Pig home -  Includes links to Apache Pig documentation.
http://pig.apache.org/

Tutorial from Apache
http://pig.apache.org/docs/r0.8.1/tutorial.pdf

How To Process Data with Apache Pig - Basic explanation of Apache Pig and an example using a baseball database.
http://hortonworks.com/hadoop-tutorial/how-to-process-data-with-apache-pig/

Pig Tutorial - another tutorial from Apache.
http://pig.apache.org/docs/r0.7.0/tutorial.html

mlmcc
0

Featured Post

Microsoft Certification Exam 74-409

Veeam® is happy to provide the Microsoft community with a study guide prepared by MVP and MCT, Orin Thomas. This guide will take you through each of the exam objectives, helping you to prepare for and pass the examination.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

A Stored Procedure in Microsoft SQL Server is a powerful feature that it can be used to execute the Data Manipulation Language (DML) or Data Definition Language (DDL). Depending on business requirements, a single Stored Procedure can return differe…
In this blog post, we’ll look at how ClickHouse performs in a general analytical workload using the star schema benchmark test.
Starting up a Project
In this video, Percona Solution Engineer Dimitri Vanoverbeke discusses why you want to use at least three nodes in a database cluster. To discuss how Percona Consulting can help with your design and architecture needs for your database and infras…

800 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question