April 2nd, 2013IMPROVEMENTS:
· PIG-3256: Upgrade jython to 2.5.3 (legal concern) (daijy)
· PIG-2988: start deploying pigunit maven artifact part of Pig release process (njw45 via rohini)
· PIG-3148: OutOfMemory exception while spilling stale DefaultDataBag. Extra option to gc() before spilling large bag. (knoguchi via rohini)
· PIG-3216: Groovy UDFs documentation has minor typos (herberts via rohini)
· PIG-3202: CUBE operator not documented in user docs (prasanth_j via billgraham)
BUG FIXES:
· PIG-3252: AvroStorage gives wrong schema for schemas with named records (mwagner via cheolsoo)
· PIG-3132: NPE when illustrating a relation with HCatLoader (daijy)
· PIG-3194: Changes to ObjectSerializer.java break compatibility with Hadoop 0.20.2 (prkommireddi via dvryaboy)
· PIG-3241: ConcurrentModificationException in POPartialAgg (dvryaboy)
· PIG-3144: Erroneous map entry alias resolution leading to "Duplicate schema alias" errors (jcoveney via cheolsoo)
· PIG-3212: Race Conditions in POSort and (Internal)SortedBag during Proactive Spill (kadeng via dvryaboy)
· PIG-3206: HBaseStorage does not work with Oozie pig action and secure HBase (rohini)
February 23rd, 2013· This release includes hundreds of bug fixes and many new features including DateType datatype, RANK, CUBE and ROLLUP operators, Groovy UDFs, pluggable reducer estimation logic, additional UDF features, schema-based tuples and HCatalog DDL integration.
· New RANK, CUBE and ROLLUP operators
· New DateType data type
· Support for Groovy UDFs
· Support for loading macros from jars
· Support for custom PigReducerEstimators
· Support for custom PigProgressNotificatonListeners
· Support for schema-based Tuples for reduced memory footprint
· Support for passing environment variables to streaming jobs
· Support for invoking HCatalog DDL commands from Pig
· Support for .pigbootup file for defaults
· Improved support for working with Maps in Pig scripts
· Grunt improvements: history and clear
· New cleanupOnSuccess method in StoreFunc interface
· UDF timing utilities
· UDF lifecycle improvements
· UDFs for DateType support
· Performance improvements to merge join
· Performance improvements to local mode
· Performance improvements to in memory aggregation
· Performance improvements to Spillable management
· Improvements to HBaseStorage and AvroStorage
· Penny has been removed
· 300+ bug fixes
February 6th, 2013IMPROVEMENTS:
· PIG-2907: Publish pig jars for Hadoop2/23 to maven (rohini)
· PIG-3019: Need a target in build.xml for source releases (gates)
· PIG-2794: Pig test: add utils to simplify testing on Windows (jgordon via gates)
· PIG-2908: Fix unit tests to work with jdk7 (rohini via dvryaboy)
· PIG-2852: Update documentation regarding parallel local mode execution (cheolsoo via jcoveney)
· PIG-2712: Pig does not call OutputCommitter.abortJob() on the underlying OutputFormat (rohini via gates)
· PIG-2727: PigStorage Source tagging does not need pig.splitCombination to be turned off (prkommireddi via dvryaboy)
· PIG-2711: e2e harness: cache benchmark results between test runs (thw via daijy)
· PIG-2680: TOBAG output schema reporting (andy schlaikjer via jcoveney)
· PIG-2650: Convenience mock Loader and Storer to simplify unit testing of Pig scripts (julien)
· BUG FIXES
· PIG-3107: bin and autocomplete are missing in src release (daijy)
· PIG-3106: Missing license header in several java file (daijy)
· PIG-3099: Pig unit test fixes for TestGrunt(1), TestStore(2), TestEmptyInputDir(3) (vikram.dixit via daijy)
· PIG-3035: With latest version of hadoop23 pig does not return the correct exception stack trace from backend (rohini)
· PIG-2953: "which" utility does not exist on Windows (daijy)
· PIG-2960: Increase the timeout for unit test (daijy)
· PIG-2958: Pig tests do not appear to have a logger attached (daijy)
· PIG-2942: DevTests, TestLoad has a false failure on Windows (jgordon via daijy)
· PIG-2943: DevTests, Refactor Windows checks to use new Util.WINDOWS method for code health (jgordon via dvryaboy)
· PIG-2801: grunt "sh" command should invoke the shell implicitly instead of calling exec directly with the command tokens
· (jgordon via daijy)
· PIG-2800: pig.additional.jars path separator should align with File.pathSeparator instead of being hard-coded to ":"
· (jgordon via azaroth)
· PIG-2798: pig streaming tests assume interpreters are auto-resolved (jgordon via daijy)
· PIG-2797: Tests should not create their own file URIs through string concatenation, should use Util.generateURI
· instead (jgordon via daijy)
· PIG-2796: Local temporary paths are not always valid HDFS path names (jgordon via daijy)
· PIG-2795: Fix test cases that generate pig scripts with "load " + pathStr to encode "\" in the path (jgordon via daijy)
· PIG-2940: HBaseStorage store fails in secure cluster (cheolsoo via daijy)
· PIG-2821: HBaseStorage should work with secure hbase (rohini via daijy)
· PIG-2890: Revert PIG-2578 (dvryaboy)
· PIG-2859: Fix few e2e test failures (rohini via daijy)
· PIG-2729: Macro expansion does not use pig.import.search.path - UnitTest borked (johannesch via daijy)
· PIG-2791: Pig does not work with Namenode Federation (rohini via daijy)
· PIG-2783: Fix Iterator_1 e2e test for Hadoop 23 (rohini via daijy)
· PIG-2761: With hadoop23 importing modules inside python script does not work (rohini via daijy)
· PIG-2759: Typo in document "Built In Functions" (daijy)
· PIG-2745: Pig e2e test RubyUDFs fails in MR mode when running from tarball (cheolsoo via daijy)
· PIG-2741: Python script throws an NameError: name 'Configuration' is not defined in case cache dir is not created
· (knoguchi via daijy)
· PIG-2669: Pig release should include pig-default.properties after rebuild (daijy)
· PIG-2739: PyList should map to Bag automatically in Jython (daijy)
· PIG-2730: TFileStorage getStatistics incorrectly throws an exception instead of returning null (traviscrawford via daijy)
· PIG-2717: Tuple field mangled during flattening (daijy)
· PIG-2721: Wrong output generated while loading bags as input (knoguchi via daijy)
· PIG-2912: Pig should clone JobConf while creating JobContextImpl and TaskAttemptContextImpl in Hadoop23 (rohini via daijy)
· PIG-2775: Register jar does not goes to classpath in some cases (daijy)
April 14th, 2010· Added Zebra as a contrib project. See http://wiki.apache.org/pig/zebra
· Added UDFContext, gives UDFs a way to pass info from front to back end and gives UDFS access to JobConf in the backend. PIG-1085
· Added left outer join for fragment replicate join. PIG-1036
· Added ability to set job priority from Pig Latin. PIG-1025
· Enhanced multi-query to work with joins in some cases. PIG-983
· Reworked memory manager to significantly reduce GC Overhead and Out of Heap failures. PIG-975
· Added Accumulator interface for UDFs. PIG-979
September 1st, 2009· The main focus of this release is multiquery support that allows to optimize multiple queries within the same script that share a computation.