设为首页 收藏本站
查看: 602|回复: 0

[经验分享] 升级hadoop0.20.2到hadoop-0.21.0

[复制链接]

尚未签到

发表于 2016-12-8 09:16:01 | 显示全部楼层 |阅读模式
  按照新的文档来 更新配置: http://hadoop.apache.org/common/docs/current/cluster_setup.html
  发现多了很多东西,新的文档也比以前的详细,好的多.
  不过此次只为了升级hadoop0.20.2到hadoop-0.21.0
  看看发布的版本:http://hadoop.apache.org/common/releases.html



  • 23
    August, 2010: release 0.21.0 available

  • 26
    February, 2010: release 0.20.2 available
  整整半年了,等的好辛苦,等得Hbase也很辛苦。 本次版本的更新,修复了N多的BUG ,看看吧:
  http://hadoop.apache.org/common/docs/r0.21.0/changes.html
  着实吓了一跳:



  • INCOMPATIBLE CHANGES
       (31)


    • HADOOP-4895
      .
      Remove deprecated methods DFSClient.getHints(..) and
      DFSClient.isDirectory(..).

      (szetszwo)

    • HADOOP-4941
      .
      Remove deprecated FileSystem methods: getBlockSize(Path f),
      getLength(Path f) and getReplication(Path src).

      (szetszwo)

    • HADOOP-4648
      .
      Remove obsolete, deprecated InMemoryFileSystem and
      ChecksumDistributedFileSystem.

      (cdouglas via szetszwo)

    • HADOOP-4940
      .
      Remove a deprecated method FileSystem.delete(Path f).

      (Enis
      Soztutar via szetszwo)

    • HADOOP-4010
      .
      Change semantics for LineRecordReader to read an additional
      line per split- rather than moving back one character in the stream- to
      work with splittable compression codecs.

      (Abdul Qadeer via cdouglas)

    • HADOOP-5094
      .
      Show hostname and separate live/dead datanodes in DFSAdmin
      report.

      (Jakob Homan via szetszwo)

    • HADOOP-4942
      .
      Remove deprecated FileSystem methods getName() and
      getNamed(String name, Configuration conf).

      (Jakob Homan via szetszwo)

    • HADOOP-5486
      .
      Removes the CLASSPATH string from the command line and instead
      exports it in the environment.

      (Amareshwari Sriramadasu via ddas)

    • HADOOP-2827
      .
      Remove deprecated NetUtils::getServerAddress.

      (cdouglas)

    • HADOOP-5681
      .
      Change examples RandomWriter and RandomTextWriter to
      use new mapreduce API.

      (Amareshwari Sriramadasu via sharad)

    • HADOOP-5680
      .
      Change org.apache.hadoop.examples.SleepJob to use new
      mapreduce api.

      (Amareshwari Sriramadasu via sharad)

    • HADOOP-5699
      .
      Change org.apache.hadoop.examples.PiEstimator to use
      new mapreduce api.

      (Amareshwari Sriramadasu via sharad)

    • HADOOP-5720
      .
      Introduces new task types - JOB_SETUP, JOB_CLEANUP
      and TASK_CLEANUP. Removes the isMap methods from TaskID/TaskAttemptID
      classes.

      (ddas)

    • HADOOP-5668
      .
      Change TotalOrderPartitioner to use new API.

      (Amareshwari
      Sriramadasu via cdouglas)

    • HADOOP-5738
      .
      Split "waiting_tasks" JobTracker metric into waiting maps and
      waiting reduces.

      (Sreekanth Ramakrishnan via cdouglas)

    • HADOOP-5679
      .
      Resolve findbugs warnings in core/streaming/pipes/examples.

      (Jothi
      Padmanabhan via sharad)

    • HADOOP-4359
      .
      Support for data access authorization checking on Datanodes.

      (Kan
      Zhang via rangadi)

    • HADOOP-5690
      .
      Change org.apache.hadoop.examples.DBCountPageView to use
      new mapreduce api.

      (Amareshwari Sriramadasu via sharad)

    • HADOOP-5694
      .
      Change org.apache.hadoop.examples.dancing to use new
      mapreduce api.

      (Amareshwari Sriramadasu via sharad)

    • HADOOP-5696
      .
      Change org.apache.hadoop.examples.Sort to use new
      mapreduce api.

      (Amareshwari Sriramadasu via sharad)

    • HADOOP-5698
      .
      Change org.apache.hadoop.examples.MultiFileWordCount to
      use new mapreduce api.

      (Amareshwari Sriramadasu via sharad)

    • HADOOP-5913
      .
      Provide ability to an administrator to stop and start
      job queues.

      (Rahul Kumar Singh and Hemanth Yamijala via yhemanth)
    • MAPREDUCE-711. Removed Distributed Cache from Common, to move
      it
      under Map/Reduce.

      (Vinod Kumar Vavilapalli via yhemanth)

    • HADOOP-6201
      .
      Change FileSystem::listStatus contract to throw
      FileNotFoundException if the directory does not exist, rather than
      letting
      this be implementation-specific.

      (Jakob Homan via cdouglas)

    • HADOOP-6230
      .
      Moved process tree and memory calculator related classes
      from Common to Map/Reduce.

      (Vinod Kumar Vavilapalli via yhemanth)

    • HADOOP-6203
      .
      FsShell rm/rmr error message indicates exceeding Trash quota
      and suggests using -skpTrash, when moving to trash fails.

      (Boris
      Shkolnik via suresh)

    • HADOOP-6303
      .
      Eclipse .classpath template has outdated jar files and is
      missing some new ones.

      (cos)

    • HADOOP-6396
      .
      Fix uninformative exception message when unable to parse
      umask.

      (jghoman)

    • HADOOP-6299
      .
      Reimplement the UserGroupInformation to use the OS
      specific and Kerberos JAAS login.

      (omalley)

    • HADOOP-6686
      .
      Remove redundant exception class name from the exception
      message for the exceptions thrown at RPC client.

      (suresh)

    • HADOOP-6701
      .
      Fix incorrect exit codes returned from chmod, chown and chgrp
      commands from FsShell.

      (Ravi Phulari via suresh)



  • NEW FEATURES
       (59)


    • HADOOP-6332
      .
      Large-scale Automated Test Framework.

      (sharad, Sreekanth
      Ramakrishnan, at all via cos)

    • HADOOP-4268
      .
      Change fsck to use ClientProtocol methods so that the
      corresponding permission requirement for running the ClientProtocol
      methods will be enforced.

      (szetszwo)

    • HADOOP-3953
      .
      Implement sticky bit for directories in HDFS.

      (Jakob Homan
      via szetszwo)

    • HADOOP-4368
      .
      Implement df in FsShell to show the status of a FileSystem.

      (Craig
      Macdonald via szetszwo)

    • HADOOP-3741
      .
      Add a web ui to the SecondaryNameNode for showing its status.

      (szetszwo)

    • HADOOP-5018
      .
      Add pipelined writers to Chukwa.

      (Ari Rabkin via cdouglas)

    • HADOOP-5052
      .
      Add an example computing exact digits of pi using the
      Bailey-Borwein-Plouffe algorithm. (Tsz Wo (Nicholas), SZE via cdouglas)

    • HADOOP-4927
      .
      Adds a generic wrapper around outputformat to allow creation of
      output on demand

      (Jothi Padmanabhan via ddas)

    • HADOOP-5144
      .
      Add a new DFSAdmin command for changing the setting of restore
      failed storage replicas in namenode.

      (Boris Shkolnik via szetszwo)

    • HADOOP-5258
      .
      Add a new DFSAdmin command to print a tree of the rack and
      datanode topology as seen by the namenode.

      (Jakob Homan via szetszwo)

    • HADOOP-4756
      .
      A command line tool to access JMX properties on NameNode
      and DataNode.

      (Boris Shkolnik via rangadi)

    • HADOOP-4539
      .
      Introduce backup node and checkpoint node.

      (shv)

    • HADOOP-5363
      .
      Add support for proxying connections to multiple clusters with
      different versions to hdfsproxy.

      (Zhiyong Zhang via cdouglas)

    • HADOOP-5528
      .
      Add a configurable hash partitioner operating on ranges of
      BinaryComparable keys.

      (Klaas Bosteels via shv)

    • HADOOP-5257
      .
      HDFS servers may start and stop external components through
      a plugin interface.

      (Carlos Valiente via dhruba)

    • HADOOP-5450
      .
      Add application-specific data types to streaming's typed bytes
      interface.

      (Klaas Bosteels via omalley)

    • HADOOP-5518
      .
      Add contrib/mrunit, a MapReduce unit test framework.

      (Aaron Kimball
      via cutting)

    • HADOOP-5469
      .
      Add /metrics servlet to daemons, providing metrics
      over HTTP as either text or JSON.

      (Philip Zeyliger via cutting)

    • HADOOP-5467
      .
      Introduce offline fsimage image viewer.

      (Jakob Homan via shv)

    • HADOOP-5752
      .
      Add a new hdfs image processor, Delimited, to oiv.

      (Jakob
      Homan via szetszwo)

    • HADOOP-5266
      .
      Adds the capability to do mark/reset of the reduce values
      iterator in the Context object API.

      (Jothi Padmanabhan via ddas)

    • HADOOP-5745
      .
      Allow setting the default value of maxRunningJobs for all
      pools.

      (dhruba via matei)

    • HADOOP-5643
      .
      Adds a way to decommission TaskTrackers while the JobTracker
      is running.

      (Amar Kamat via ddas)

    • HADOOP-4829
      .
      Allow FileSystem shutdown hook to be disabled.

      (Todd Lipcon via
      tomwhite)

    • HADOOP-5815
      .
      Sqoop: A database import tool for Hadoop.

      (Aaron Kimball via
      tomwhite)

    • HADOOP-4861
      .
      Add disk usage with human-readable size (-duh).

      (Todd Lipcon via
      tomwhite)

    • HADOOP-5844
      .
      Use mysqldump when connecting to local mysql instance in Sqoop.

      (Aaron
      Kimball via tomwhite)

    • HADOOP-5976
      .
      Add a new command, classpath, to the hadoop script.

      (Owen
      O'Malley and Gary Murry via szetszwo)

    • HADOOP-6120
      .
      Add support for Avro specific and reflect data.

      (sharad via cutting)

    • HADOOP-6226
      .
      Moves BoundedByteArrayOutputStream from the tfile package to
      the io package and makes it available to other users (MAPREDUCE-318).

      (Jothi
      Padmanabhan via ddas)

    • HADOOP-6105
      .
      Adds support for automatically handling deprecation of
      configuration keys.

      (V.V.Chaitanya Krishna via yhemanth)

    • HADOOP-6235
      .
      Adds new method to FileSystem for clients to get server
      defaults.

      (Kan Zhang via suresh)

    • HADOOP-6234
      .
      Add new option dfs.umaskmode to set umask in configuration
      to use octal or symbolic instead of decimal.

      (Jakob Homan via suresh)

    • HADOOP-5073
      .
      Add annotation mechanism for interface classification.

      (Jakob Homan
      via suresh)

    • HADOOP-4012
      .
      Provide splitting support for bzip2 compressed files.

      (Abdul
      Qadeer via cdouglas)

    • HADOOP-6246
      .
      Add backward compatibility support to use deprecated decimal
      umask from old configuration.

      (Jakob Homan via suresh)

    • HADOOP-4952
      .
      Add new improved file system interface FileContext for the
      application writer

      (Sanjay Radia via suresh)

    • HADOOP-6170
      .
      Add facility to tunnel Avro RPCs through Hadoop RPCs.
      This permits one to take advantage of both Avro's RPC versioning
      features and Hadoop's proven RPC scalability.

      (cutting)

    • HADOOP-6267
      .
      Permit building contrib modules located in external
      source trees.

      (Todd Lipcon via cutting)

    • HADOOP-6240
      .
      Add new FileContext rename operation that posix compliant
      that allows overwriting existing destination.

      (suresh)

    • HADOOP-6204
      .
      Implementing aspects development and fault injeciton
      framework for Hadoop

      (cos)

    • HADOOP-6313
      .
      Implement Syncable interface in FSDataOutputStream to expose
      flush APIs to application users.

      (Hairong Kuang via suresh)

    • HADOOP-6284
      .
      Add a new parameter, HADOOP_JAVA_PLATFORM_OPTS, to
      hadoop-config.sh so that it allows setting java command options for
      JAVA_PLATFORM.

      (Koji Noguchi via szetszwo)

    • HADOOP-6337
      .
      Updates FilterInitializer class to be more visible,
      and the init of the class is made to take a Configuration argument.

      (Jakob
      Homan via ddas)
    • Hadoop-6223. Add new file system interface AbstractFileSystem
      with
      implementation of some file systems that delegate to old FileSystem.

      (Sanjay
      Radia via suresh)

    • HADOOP-6433
      .
      Introduce asychronous deletion of files via a pool of
      threads. This can be used to delete files in the Distributed
      Cache.

      (Zheng Shao via dhruba)

    • HADOOP-6415
      .
      Adds a common token interface for both job token and
      delegation token.

      (Kan Zhang via ddas)

    • HADOOP-6408
      .
      Add a /conf servlet to dump running configuration.

      (Todd Lipcon via
      tomwhite)

    • HADOOP-6520
      .
      Adds APIs to read/write Token and secret keys. Also
      adds the automatic loading of tokens into UserGroupInformation
      upon login. The tokens are read from a file specified in the
      environment variable.

      (ddas)

    • HADOOP-6419
      .
      Adds SASL based authentication to RPC.

      (Kan Zhang via ddas)

    • HADOOP-6510
      .
      Adds a way for superusers to impersonate other users
      in a secure environment.

      (Jitendra Nath Pandey via ddas)

    • HADOOP-6421
      .
      Adds Symbolic links to FileContext, AbstractFileSystem.
      It also adds a limited implementation for the local file system
      (RawLocalFs) that allows local symlinks.

      (Eli Collins via Sanjay
      Radia)

    • HADOOP-6577
      .
      Add hidden configuration option "ipc.server.max.response.size"
      to change the default 1 MB, the maximum size when large IPC handler
      response buffer is reset.

      (suresh)

    • HADOOP-6568
      .
      Adds authorization for the default servlets.

      (Vinod Kumar
      Vavilapalli via ddas)

    • HADOOP-6586
      .
      Log authentication and authorization failures and successes
      for RPC

      (boryas)

    • HADOOP-6580
      .
      UGI should contain authentication method.

      (jnp via boryas)

    • HADOOP-6657
      .
      Add a capitalization method to StringUtils for MAPREDUCE-1545.

      (Luke
      Lu via Steve Loughran)

    • HADOOP-6692
      .
      Add FileContext#listStatus that returns an iterator.

      (hairong)

    • HADOOP-6869
      .
      Functionality to create file or folder on a remote daemon
      side

      (Vinay Thota via cos)



  • IMPROVEMENTS
       (198)


    • HADOOP-6798
      .
      Align Ivy version for all Hadoop subprojects.

      (cos)

    • HADOOP-6777
      .
      Implement a functionality for suspend and resume a process.

      (Vinay
      Thota via cos)

    • HADOOP-6772
      .
      Utilities for system tests specific.

      (Vinay Thota via cos)

    • HADOOP-6771
      .
      Herriot's artifact id for Maven deployment should be set to
      hadoop-core-instrumented

      (cos)

    • HADOOP-6752
      .
      Remote cluster control functionality needs JavaDocs
      improvement (Balaji Rajagopalan via cos).

    • HADOOP-4565
      .
      Added CombineFileInputFormat to use data locality information
      to create splits.

      (dhruba via zshao)

    • HADOOP-4936
      .
      Improvements to TestSafeMode.

      (shv)

    • HADOOP-4985
      .
      Remove unnecessary "throw IOException" declarations in
      FSDirectory related methods.

      (szetszwo)

    • HADOOP-5017
      .
      Change NameNode.namesystem declaration to private.

      (szetszwo)

    • HADOOP-4794
      .
      Add branch information from the source version control into
      the version information that is compiled into Hadoop.

      (cdouglas via
      omalley)

    • HADOOP-5070
      .
      Increment copyright year to 2009, remove assertions of ASF
      copyright to licensed files. (Tsz Wo (Nicholas), SZE via cdouglas)

    • HADOOP-5037
      .
      Deprecate static FSNamesystem.getFSNamesystem().

      (szetszwo)

    • HADOOP-5088
      .
      Include releaseaudit target as part of developer test-patch
      target.

      (Giridharan Kesavan via nigel)

    • HADOOP-2721
      .
      Uses setsid when creating new tasks so that subprocesses of
      this process will be within this new session (and this process will be
      the process leader for all the subprocesses). Killing the process
      leader,
      or the main Java task in Hadoop's case, kills the entire subtree of
      processes.

      (Ravi Gummadi via ddas)

    • HADOOP-5097
      .
      Remove static variable JspHelper.fsn, a static reference to
      a non-singleton FSNamesystem object.

      (szetszwo)

    • HADOOP-3327
      .
      Improves handling of READ_TIMEOUT during map output copying.

      (Amareshwari
      Sriramadasu via ddas)

    • HADOOP-5124
      .
      Choose datanodes randomly instead of starting from the first
      datanode for providing fairness.

      (hairong via szetszwo)

    • HADOOP-4930
      .
      Implement a Linux native executable that can be used to
      launch tasks as users.

      (Sreekanth Ramakrishnan via yhemanth)

    • HADOOP-5122
      .
      Fix format of fs.default.name value in libhdfs test conf.

      (Craig
      Macdonald via tomwhite)

    • HADOOP-5038
      .
      Direct daemon trace to debug log instead of stdout.

      (Jerome
      Boulon via cdouglas)

    • HADOOP-5101
      .
      Improve packaging by adding 'all-jars' target building core,
      tools, and example jars. Let findbugs depend on this rather than the
      'tar'
      target.

      (Giridharan Kesavan via cdouglas)

    • HADOOP-4868
      .
      Splits the hadoop script into three parts - bin/hadoop,
      bin/mapred and bin/hdfs.

      (Sharad Agarwal via ddas)

    • HADOOP-1722
      .
      Adds support for TypedBytes and RawBytes in Streaming.

      (Klaas
      Bosteels via ddas)

    • HADOOP-4220
      .
      Changes the JobTracker restart tests so that they take much
      less time.

      (Amar Kamat via ddas)

    • HADOOP-4885
      .
      Try to restore failed name-node storage directories at
      checkpoint time.

      (Boris Shkolnik via shv)

    • HADOOP-5209
      .
      Update year to 2009 for javadoc.

      (szetszwo)

    • HADOOP-5279
      .
      Remove unnecessary targets from test-patch.sh.

      (Giridharan Kesavan
      via nigel)

    • HADOOP-5120
      .
      Remove the use of FSNamesystem.getFSNamesystem() from
      UpgradeManagerNamenode and UpgradeObjectNamenode.

      (szetszwo)

    • HADOOP-5222
      .
      Add offset to datanode clienttrace.

      (Lei Xu via cdouglas)

    • HADOOP-5240
      .
      Skip re-building javadoc when it is already
      up-to-date.

      (Aaron Kimball via cutting)

    • HADOOP-5042
      .
      Add a cleanup stage to log rollover in Chukwa appender.

      (Jerome
      Boulon via cdouglas)

    • HADOOP-5264
      .
      Removes redundant configuration object from the TaskTracker.

      (Sharad
      Agarwal via ddas)

    • HADOOP-5232
      .
      Enable patch testing to occur on more than one host.

      (Giri Kesavan
      via nigel)

    • HADOOP-4546
      .
      Fix DF reporting for AIX.

      (Bill Habermaas via cdouglas)

    • HADOOP-5023
      .
      Add Tomcat support to HdfsProxy.

      (Zhiyong Zhang via cdouglas)

    • HADOOP-5317
      .
      Provide documentation for LazyOutput Feature.

      (Jothi Padmanabhan via
      johan)

    • HADOOP-5455
      .
      Document rpc metrics context to the extent dfs, mapred, and
      jvm contexts are documented.

      (Philip Zeyliger via cdouglas)

    • HADOOP-5358
      .
      Provide scripting functionality to the synthetic load
      generator.

      (Jakob Homan via hairong)

    • HADOOP-5442
      .
      Paginate jobhistory display and added some search
      capabilities.

      (Amar Kamat via acmurthy)

    • HADOOP-4842
      .
      Streaming now allows specifiying a command for the combiner.

      (Amareshwari
      Sriramadasu via ddas)

    • HADOOP-5196
      .
      avoiding unnecessary byte[] allocation in
      SequenceFile.CompressedBytes and SequenceFile.UncompressedBytes.

      (hong
      tang via mahadev)

    • HADOOP-4655
      .
      New method FileSystem.newInstance() that always returns
      a newly allocated FileSystem object.

      (dhruba)

    • HADOOP-4788
      .
      Set Fair scheduler to assign both a map and a reduce on each
      heartbeat by default.

      (matei)

    • HADOOP-5491
      .
      In contrib/index, better control memory usage.

      (Ning Li via
      cutting)

    • HADOOP-5423
      .
      Include option of preserving file metadata in
      SequenceFile::sort.

      (Michael Tamm via cdouglas)

    • HADOOP-5331
      .
      Add support for KFS appends.

      (Sriram Rao via cdouglas)

    • HADOOP-4365
      .
      Make Configuration::getProps protected in support of
      meaningful subclassing.

      (Steve Loughran via cdouglas)

    • HADOOP-2413
      .
      Remove the static variable FSNamesystem.fsNamesystemObject.

      (Konstantin
      Shvachko via szetszwo)

    • HADOOP-4584
      .
      Improve datanode block reports and associated file system
      scan to avoid interefering with normal datanode operations.

      (Suresh
      Srinivas via rangadi)

    • HADOOP-5502
      .
      Documentation for backup and checkpoint nodes.

      (Jakob Homan via shv)

    • HADOOP-5485
      .
      Mask actions in the fair scheduler's servlet UI based on
      value of webinterface.private.actions.

      (Vinod Kumar Vavilapalli via
      yhemanth)

    • HADOOP-5581
      .
      HDFS should throw FileNotFoundException when while opening
      a file that does not exist.

      (Brian Bockelman via rangadi)

    • HADOOP-5509
      .
      PendingReplicationBlocks does not start monitor in the
      constructor.

      (shv)

    • HADOOP-5494
      .
      Modify sorted map output merger to lazily read values,
      rather than buffering at least one record for each segment.

      (Devaraj
      Das
      via cdouglas)

    • HADOOP-5396
      .
      Provide ability to refresh queue ACLs in the JobTracker
      without having to restart the daemon.

      (Sreekanth Ramakrishnan and
      Vinod Kumar Vavilapalli via yhemanth)

    • HADOOP-4490
      .
      Provide ability to run tasks as job owners.

      (Sreekanth Ramakrishnan
      via yhemanth)

    • HADOOP-5697
      .
      Change org.apache.hadoop.examples.Grep to use new
      mapreduce api.

      (Amareshwari Sriramadasu via sharad)

    • HADOOP-5625
      .
      Add operation duration to clienttrace.

      (Lei Xu via cdouglas)

    • HADOOP-5705
      .
      Improve TotalOrderPartitioner efficiency by updating the trie
      construction.

      (Dick King via cdouglas)

    • HADOOP-5589
      .
      Eliminate source limit of 64 for map-side joins imposed by
      TupleWritable encoding.

      (Jingkei Ly via cdouglas)

    • HADOOP-5734
      .
      Correct block placement policy description in HDFS
      Design document.

      (Konstantin Boudnik via shv)

    • HADOOP-5657
      .
      Validate data in TestReduceFetch to improve merge test
      coverage.

      (cdouglas)

    • HADOOP-5613
      .
      Change S3Exception to checked exception.

      (Andrew Hitchcock via
      tomwhite)

    • HADOOP-5717
      .
      Create public enum class for the Framework counters in
      org.apache.hadoop.mapreduce.

      (Amareshwari Sriramadasu via sharad)

    • HADOOP-5217
      .
      Split AllTestDriver for core, hdfs and mapred.

      (sharad)

    • HADOOP-5364
      .
      Add certificate expiration warning to HsftpFileSystem and HDFS
      proxy.

      (Zhiyong Zhang via cdouglas)

    • HADOOP-5733
      .
      Add map/reduce slot capacity and blacklisted capacity to
      JobTracker metrics.

      (Sreekanth Ramakrishnan via cdouglas)

    • HADOOP-5596
      .
      Add EnumSetWritable.

      (He Yongqiang via szetszwo)

    • HADOOP-5727
      .
      Simplify hashcode for ID types.

      (Shevek via cdouglas)

    • HADOOP-5500
      .
      In DBOutputFormat, where field names are absent permit the
      number of fields to be sufficient to construct the select query.

      (Enis
      Soztutar
      via cdouglas)

    • HADOOP-5081
      .
      Split TestCLI into HDFS, Mapred and Core tests.

      (sharad)

    • HADOOP-5015
      .
      Separate block management code from FSNamesystem.

      (Suresh
      Srinivas via szetszwo)

    • HADOOP-5080
      .
      Add new test cases to TestMRCLI and TestHDFSCLI

      (V.Karthikeyan via
      nigel)

    • HADOOP-5135
      .
      Splits the tests into different directories based on the
      package. Four new test targets have been defined - run-test-core,
      run-test-mapred, run-test-hdfs and run-test-hdfs-with-mr.

      (Sharad
      Agarwal via ddas)

    • HADOOP-5771
      .
      Implements unit tests for LinuxTaskController.

      (Sreekanth
      Ramakrishnan and Vinod Kumar Vavilapalli via yhemanth)

    • HADOOP-5419
      .
      Provide a facility to query the Queue ACLs for the
      current user.

      (Rahul Kumar Singh via yhemanth)

    • HADOOP-5780
      .
      Improve per block message prited by "-metaSave" in HDFS.

      (Raghu
      Angadi)

    • HADOOP-5823
      .
      Added a new class DeprecatedUTF8 to help with removing
      UTF8 related javac warnings. These warnings are removed in
      FSEditLog.java as a use case.

      (Raghu Angadi)

    • HADOOP-5824
      .
      Deprecate DataTransferProtocol.OP_READ_METADATA and remove
      the corresponding unused codes.

      (Kan Zhang via szetszwo)

    • HADOOP-5721
      .
      Factor out EditLogFileInputStream and EditLogFileOutputStream
      into independent classes.

      (Luca Telloli & Flavio Junqueira via
      shv)

    • HADOOP-5838
      .
      Fix a few javac warnings in HDFS.

      (Raghu Angadi)

    • HADOOP-5854
      .
      Fix a few "Inconsistent Synchronization" warnings in HDFS.

      (Raghu
      Angadi)

    • HADOOP-5369
      .
      Small tweaks to reduce MapFile index size.

      (Ben Maurer
      via sharad)

    • HADOOP-5858
      .
      Eliminate UTF8 and fix warnings in test/hdfs-with-mr package.

      (shv)

    • HADOOP-5866
      .
      Move DeprecatedUTF8 from o.a.h.io to o.a.h.hdfs since it may
      not be used outside hdfs.

      (Raghu Angadi)

    • HADOOP-5857
      .
      Move normal java methods from hdfs .jsp files to .java files.

      (szetszwo)

    • HADOOP-5873
      .
      Remove deprecated methods randomDataNode() and
      getDatanodeByIndex(..) in FSNamesystem.

      (szetszwo)

    • HADOOP-5572
      .
      Improves the progress reporting for the sort phase for both
      maps and reduces.

      (Ravi Gummadi via ddas)

    • HADOOP-5839
      .
      Fix EC2 scripts to allow remote job submission.

      (Joydeep Sen Sarma
      via tomwhite)

    • HADOOP-5877
      .
      Fix javac warnings in TestHDFSServerPorts, TestCheckpoint,
      TestNameEditsConfig, TestStartup and TestStorageRestore.

      (Jakob Homan
      via shv)

    • HADOOP-5438
      .
      Provide a single FileSystem method to create or
      open-for-append to a file.

      (He Yongqiang via dhruba)

    • HADOOP-5472
      .
      Change DistCp to support globbing of input paths.

      (Dhruba
      Borthakur and Rodrigo Schmidt via szetszwo)

    • HADOOP-5175
      .
      Don't unpack libjars on classpath.

      (Todd Lipcon via tomwhite)

    • HADOOP-5620
      .
      Add an option to DistCp for preserving modification and access
      times.

      (Rodrigo Schmidt via szetszwo)

    • HADOOP-5664
      .
      Change map serialization so a lock is obtained only where
      contention is possible, rather than for each write.

      (cdouglas)

    • HADOOP-5896
      .
      Remove the dependency of GenericOptionsParser on
      Option.withArgPattern.

      (Giridharan Kesavan and Sharad Agarwal via
      sharad)

    • HADOOP-5784
      .
      Makes the number of heartbeats that should arrive a second
      at the JobTracker configurable.

      (Amareshwari Sriramadasu via ddas)

    • HADOOP-5955
      .
      Changes TestFileOuputFormat so that is uses LOCAL_MR
      instead of CLUSTER_MR.

      (Jothi Padmanabhan via das)

    • HADOOP-5948
      .
      Changes TestJavaSerialization to use LocalJobRunner
      instead of MiniMR/DFS cluster.

      (Jothi Padmanabhan via das)

    • HADOOP-2838
      .
      Add mapred.child.env to pass environment variables to
      tasktracker's child processes.

      (Amar Kamat via sharad)

    • HADOOP-5961
      .
      DataNode process understand generic hadoop command line
      options (like -Ddfs.property=value).

      (Raghu Angadi)

    • HADOOP-5938
      .
      Change org.apache.hadoop.mapred.jobcontrol to use new
      api.

      (Amareshwari Sriramadasu via sharad)

    • HADOOP-2141
      .
      Improves the speculative execution heuristic. The heuristic
      is currently based on the progress-rates of tasks and the expected time
      to complete. Also, statistics about trackers are collected, and
      speculative
      tasks are not given to the ones deduced to be slow.

      (Andy Konwinski
      and ddas)

    • HADOOP-5952
      .
      Change "-1 tests included" wording in test-patch.sh.

      (Gary Murry via
      szetszwo)

    • HADOOP-6106
      .
      Provides an option in ShellCommandExecutor to timeout
      commands that do not complete within a certain amount of time.

      (Sreekanth
      Ramakrishnan via yhemanth)

    • HADOOP-5925
      .
      EC2 scripts should exit on error.

      (tomwhite)

    • HADOOP-6109
      .
      Change Text to grow its internal buffer exponentially, rather
      than the max of the current length and the proposed length to improve
      performance reading large values.

      (thushara wijeratna via cdouglas)

    • HADOOP-2366
      .
      Support trimmed strings in Configuration.

      (Michele Catasta
      via szetszwo)

    • HADOOP-6099
      .
      The RPC module can be configured to not send period pings.
      The default behaviour of sending periodic pings remain unchanged.

      (dhruba)

    • HADOOP-6142
      .
      Update documentation and use of harchives for relative paths
      added in MAPREDUCE-739.

      (Mahadev Konar via cdouglas)

    • HADOOP-6148
      .
      Implement a fast, pure Java CRC32 calculator which outperforms
      java.util.zip.CRC32.

      (Todd Lipcon and Scott Carey via szetszwo)

    • HADOOP-6146
      .
      Upgrade to JetS3t version 0.7.1.

      (tomwhite)

    • HADOOP-6161
      .
      Add get/setEnum methods to Configuration.

      (cdouglas)

    • HADOOP-6160
      .
      Fix releaseaudit target to run on specific directories.

      (gkesavan)

    • HADOOP-6169
      .
      Removing deprecated method calls in TFile.

      (hong tang via
      mahadev)

    • HADOOP-6176
      .
      Add a couple package private methods to AccessTokenHandler
      for testing.

      (Kan Zhang via szetszwo)

    • HADOOP-6182
      .
      Fix ReleaseAudit warnings

      (Giridharan Kesavan and Lee Tucker
      via gkesavan)

    • HADOOP-6173
      .
      Change src/native/packageNativeHadoop.sh to package all
      native library files.

      (Hong Tang via szetszwo)

    • HADOOP-6184
      .
      Provide an API to dump Configuration in a JSON format.

      (V.V.Chaitanya
      Krishna via yhemanth)

    • HADOOP-6224
      .
      Add a method to WritableUtils performing a bounded read of an
      encoded String.

      (Jothi Padmanabhan via cdouglas)

    • HADOOP-6133
      .
      Add a caching layer to Configuration::getClassByName to
      alleviate a performance regression introduced in a compatibility layer.

      (Todd
      Lipcon via cdouglas)

    • HADOOP-6252
      .
      Provide a method to determine if a deprecated key is set in
      config file.

      (Jakob Homan via suresh)

    • HADOOP-5879
      .
      Read compression level and strategy from Configuration for
      gzip compression.

      (He Yongqiang via cdouglas)

    • HADOOP-6216
      .
      Support comments in host files.

      (Ravi Phulari and Dmytro
      Molkov via szetszwo)

    • HADOOP-6217
      .
      Update documentation for project split.

      (Corinne Chandel via
      omalley)

    • HADOOP-6268
      .
      Add ivy jar to .gitignore.

      (Todd Lipcon via cdouglas)

    • HADOOP-6270
      .
      Support deleteOnExit in FileContext.

      (Suresh Srinivas via
      szetszwo)

    • HADOOP-6233
      .
      Rename configuration keys towards API standardization and
      backward compatibility.

      (Jithendra Pandey via suresh)

    • HADOOP-6260
      .
      Add additional unit tests for FileContext util methods.
      (Gary Murry via suresh).

    • HADOOP-6309
      .
      Change build.xml to run tests with java asserts.

      (Eli
      Collins via szetszwo)

    • HADOOP-6326
      .
      Hundson runs should check for AspectJ warnings and report
      failure if any is present

      (cos)

    • HADOOP-6329
      .
      Add build-fi directory to the ignore lists.

      (szetszwo)

    • HADOOP-5107
      .
      Use Maven ant tasks to publish the subproject jars.

      (Giridharan
      Kesavan via omalley)

    • HADOOP-6343
      .
      Log unexpected throwable object caught in RPC.

      (Jitendra Nath
      Pandey via szetszwo)

    • HADOOP-6367
      .
      Removes Access Token implementation from common.

      (Kan Zhang via
      ddas)

    • HADOOP-6395
      .
      Upgrade some libraries to be consistent across common, hdfs,
      and mapreduce.

      (omalley)

    • HADOOP-6398
      .
      Build is broken after HADOOP-6395
      patch has been applied

      (cos)

    • HADOOP-6413
      .
      Move TestReflectionUtils to Common.

      (Todd Lipcon via tomwhite)

    • HADOOP-6283
      .
      Improve the exception messages thrown by
      FileUtil$HardLink.getLinkCount(..).

      (szetszwo)

    • HADOOP-6279
      .
      Add Runtime::maxMemory to JVM metrics.

      (Todd Lipcon via
      cdouglas)

    • HADOOP-6305
      .
      Unify build property names to facilitate cross-projects
      modifications

      (cos)

    • HADOOP-6312
      .
      Remove unnecessary debug logging in Configuration constructor.

      (Aaron
      Kimball via cdouglas)

    • HADOOP-6366
      .
      Reduce ivy console output to ovservable level

      (cos)

    • HADOOP-6400
      .
      Log errors getting Unix UGI.

      (Todd Lipcon via tomwhite)

    • HADOOP-6346
      .
      Add support for specifying unpack pattern regex to
      RunJar.unJar.

      (Todd Lipcon via tomwhite)

    • HADOOP-6422
      .
      Make RPC backend plugable, protocol-by-protocol, to
      ease evolution towards Avro.

      (cutting)

    • HADOOP-5958
      .
      Use JDK 1.6 File APIs in DF.java wherever possible.

      (Aaron Kimball
      via tomwhite)

    • HADOOP-6222
      .
      Core doesn't have TestCommonCLI facility.

      (cos)

    • HADOOP-6394
      .
      Add a helper class to simplify FileContext related tests and
      improve code reusability.

      (Jitendra Nath Pandey via suresh)

    • HADOOP-4656
      .
      Add a user to groups mapping service.

      (boryas, acmurthy)

    • HADOOP-6435
      .
      Make RPC.waitForProxy with timeout public.

      (Steve Loughran
      via tomwhite)

    • HADOOP-6472
      .
      add tokenCache option to GenericOptionsParser for passing
      file with secret keys to a map reduce job.

      (boryas)

    • HADOOP-3205
      .
      Read multiple chunks directly from FSInputChecker subclass
      into user buffers.

      (Todd Lipcon via tomwhite)

    • HADOOP-6479
      .
      TestUTF8 assertions could fail with better text.

      (Steve Loughran via
      tomwhite)

    • HADOOP-6155
      .
      Deprecate RecordIO anticipating Avro.

      (Tom White via cdouglas)

    • HADOOP-6492
      .
      Make some Avro serialization APIs public.

      (Aaron Kimball via
      cutting)

    • HADOOP-6497
      .
      Add an adapter for Avro's SeekableInput interface, so
      that Avro can read FileSystem data.

      (Aaron Kimball via cutting)

    • HADOOP-6495
      .
      Identifier should be serialized after the password is
      created In Token constructor

      (jnp via boryas)

    • HADOOP-6518
      .
      Makes the UGI honor the env var KRB5CCNAME.

      (Owen O'Malley via ddas)

    • HADOOP-6531
      .
      Enhance FileUtil with an API to delete all contents of a
      directory.

      (Amareshwari Sriramadasu via yhemanth)

    • HADOOP-6547
      .
      Move DelegationToken into Common, so that it can be used by
      MapReduce also.

      (devaraj via omalley)

    • HADOOP-6552
      .
      Puts renewTGT=true and useTicketCache=true for the keytab
      kerberos options.

      (ddas)

    • HADOOP-6534
      .
      Trim whitespace from directory lists initializing
      LocalDirAllocator.

      (Todd Lipcon via cdouglas)

    • HADOOP-6559
      .
      Makes the RPC client automatically re-login when the SASL
      connection setup fails. This is applicable only to keytab based logins.

      (Devaraj
      Das)

    • HADOOP-6551
      .
      Delegation token renewing and cancelling should provide
      meaningful exceptions when there are failures instead of returning
      false.

      (omalley)

    • HADOOP-6583
      .
      Captures authentication and authorization metrics.

      (ddas)

    • HADOOP-6543
      .
      Allows secure clients to talk to unsecure clusters.

      (Kan Zhang via
      ddas)

    • HADOOP-6579
      .
      Provide a mechanism for encoding/decoding Tokens from
      a url-safe string and change the commons-code library to 1.4.

      (omalley)

    • HADOOP-6596
      .
      Add a version field to the AbstractDelegationTokenIdentifier's
      serialized value.

      (omalley)

    • HADOOP-6573
      .
      Support for persistent delegation tokens.

      (Jitendra Pandey via shv)

    • HADOOP-6594
      .
      Provide a fetchdt tool via bin/hdfs.

      (jhoman via acmurthy)

    • HADOOP-6589
      .
      Provide better error messages when RPC authentication fails.

      (Kan
      Zhang via omalley)

    • HADOOP-6599
      Split existing RpcMetrics into RpcMetrics & RpcDetailedMetrics.

      (Suresh
      Srinivas via Sanjay Radia)

    • HADOOP-6537
      Declare more detailed exceptions in FileContext and
      AbstractFileSystem

      (Suresh Srinivas via Sanjay Radia)

    • HADOOP-6486
      .
      fix common classes to work with Avro 1.3 reflection.

      (cutting via
      tomwhite)

    • HADOOP-6591
      .
      HarFileSystem can handle paths with the whitespace characters.

      (Rodrigo
      Schmidt via dhruba)

    • HADOOP-6407
      .
      Have a way to automatically update Eclipse .classpath file
      when new libs are added to the classpath through Ivy.

      (tomwhite)

    • HADOOP-3659
      .
      Patch to allow hadoop native to compile on Mac OS X.

      (Colin Evans
      and Allen Wittenauer via tomwhite)

    • HADOOP-6471
      .
      StringBuffer -> StringBuilder - conversion of references
      as necessary.

      (Kay Kay via tomwhite)

    • HADOOP-6646
      .
      Move HarfileSystem out of Hadoop Common.

      (mahadev)

    • HADOOP-6566
      .
      Add methods supporting, enforcing narrower permissions on
      local daemon directories.

      (Arun Murthy and Luke Lu via cdouglas)

    • HADOOP-6705
      .
      Fix to work with 1.5 version of jiracli

      (Giridharan Kesavan)

    • HADOOP-6658
      .
      Exclude Private elements from generated Javadoc.

      (tomwhite)

    • HADOOP-6635
      .
      Install/deploy source jars to Maven repo.

      (Patrick Angeles via
      jghoman)

    • HADOOP-6717
      .
      Log levels in o.a.h.security.Groups too high

      (Todd Lipcon via
      jghoman)

    • HADOOP-6667
      .
      RPC.waitForProxy should retry through NoRouteToHostException.

      (Todd
      Lipcon via tomwhite)

    • HADOOP-6677
      .
      InterfaceAudience.LimitedPrivate should take a string not an
      enum.

      (tomwhite)

    • HADOOP-678
      .
      Remove FileContext#isFile, isDirectory, and exists.

      (Eli Collins via
      hairong)

    • HADOOP-6515
      .
      Make maximum number of http threads configurable.

      (Scott Chen via
      zshao)

    • HADOOP-6563
      .
      Add more symlink tests to cover intermediate symlinks
      in paths.

      (Eli Collins via suresh)

    • HADOOP-6585
      .
      Add FileStatus#isDirectory and isFile.

      (Eli Collins via
      tomwhite)

    • HADOOP-6738
      .
      Move cluster_setup.xml from MapReduce to Common.

      (Tom White via
      tomwhite)

    • HADOOP-6794
      .
      Move configuration and script files post split.

      (tomwhite)

    • HADOOP-6403
      .
      Deprecate EC2 bash scripts.

      (tomwhite)

    • HADOOP-6769
      .
      Add an API in FileSystem to get FileSystem instances based
      on users

      (ddas via boryas)

    • HADOOP-6813
      .
      Add a new newInstance method in FileSystem that takes
      a "user" as argument

      (ddas via boryas)

    • HADOOP-6668
      .
      Apply audience and stability annotations to classes in
      common.

      (tomwhite)

    • HADOOP-6821
      .
      Document changes to memory monitoring.

      (Hemanth Yamijala
      via tomwhite)



  • OPTIMIZATIONS
       (12)


    • HADOOP-5595
      .
      NameNode does not need to run a replicator to choose a
      random DataNode.

      (hairong)

    • HADOOP-5603
      .
      Improve NameNode's block placement performance.

      (hairong)

    • HADOOP-5638
      .
      More improvement on block placement performance.

      (hairong)

    • HADOOP-6180
      .
      NameNode slowed down when many files with same filename
      were moved to Trash.

      (Boris Shkolnik via hairong)

    • HADOOP-6166
      .
      Further improve the performance of the pure-Java CRC32
      implementation. (Tsz Wo (Nicholas), SZE via cdouglas)

    • HADOOP-6271
      .
      Add recursive and non recursive create and mkdir to
      FileContext.

      (Sanjay Radia via suresh)

    • HADOOP-6261
      .
      Add URI based tests for FileContext.
      (Ravi Pulari via suresh).

    • HADOOP-6307
      .
      Add a new SequenceFile.Reader constructor in order to support
      reading on un-closed file.

      (szetszwo)

    • HADOOP-6467
      .
      Improve the performance on HarFileSystem.listStatus(..).

      (mahadev
      via szetszwo)

    • HADOOP-6569
      .
      FsShell#cat should avoid calling unecessary getFileStatus
      before opening a file to read.

      (hairong)

    • HADOOP-6689
      .
      Add directory renaming test to existing FileContext tests.

      (Eli
      Collins via suresh)

    • HADOOP-6713
      .
      The RPC server Listener thread is a scalability bottleneck.

      (Dmytro
      Molkov via hairong)



  • BUG FIXES
       (231)
  不过,个人觉得hadoop的稳定性还是比hbase好,虽然hbase更新速度无比的快。
  算了,还是言归正传,升级:

  1、core-site.xml
不变
  2、hdfs.site.xml
可以不变也可以把dfs.name.dir和dfs.data.dir更新成这样:

<property>
<name>dfs.namenode.name.dir</name>
<value>/data0/cloud/namenode/</value>
</description>
</property>
<property>
<name>dfs.datanode.data.dir</name>
<value>/data0/cloud/datanode/</value>
</property>
  不过,有个警告就是:

WARN org.apache.hadoop.hdfs.server.common.Util: Path /data0/cloud/namenode/ should be specified as a URI in configuration files. Please update hdfs configuration.

   3、mapred-site.xml


<!--
<property>
<name>mapred.job.tracker</name>
<value>name.uc.uuwatch.com:9001</value>
</property>
-->
<property>
<name>mapreduce.jobtracker.address</name>
<value>tracker.uc.uuwatch.com:9001</value>
</property>

   同步所有的配置文件到到所有的节点,然后启动hadoop
  ...bin/start-all.sh
  你会发现脚本也更新了,比之前的分类更好,控制更加细致,很好!
  不过,可惜的是namenode和secondnamenode不能启动,分析下日志会发现需要升级文件(原来文件格式也有一定的更改,优化了吗,呵呵):

ERROR org.apache.hadoop.hdfs.server.namenode.FSNamesystem: FSNamesystem initialization failed.
java.io.IOException:
File system image contains an old layout version -18.
An upgrade to version -24 is required.
Please restart NameNode with -upgrade option.

   4、OK 行了,执行名称节点的文件系统更新命令

  .../bin/hadoop namenode -upgrade
  好,到此成功启动节点。升级完毕!
  接下来,我启动hbase0.20.6试试了。
  特别提醒:备份你的hadoop数据,不要回不到原来的版本了。请查阅 bin/hadoop namenode 相关的命令以及数据节点的备份。
  简单点:可以拷贝你的原始数据到新的文件夹,更改配置,使用新文件夹的数据升级尝试,失败了也不怕。数据完好
  hbase0.20.6是基于hadoop0.20.2的  发现升级后 RPC对不住,文件系统对不住。
  更改hbade的依赖,改RPC 去叉 重新编译 部署
  另外:经过修改hbase可以启动 但是不能读写表,还没有修改完毕的原因。

运维网声明 1、欢迎大家加入本站运维交流群:群②:261659950 群⑤:202807635 群⑦870801961 群⑧679858003
2、本站所有主题由该帖子作者发表,该帖子作者与运维网享有帖子相关版权
3、所有作品的著作权均归原作者享有,请您和我们一样尊重他人的著作权等合法权益。如果您对作品感到满意,请购买正版
4、禁止制作、复制、发布和传播具有反动、淫秽、色情、暴力、凶杀等内容的信息,一经发现立即删除。若您因此触犯法律,一切后果自负,我们对此不承担任何责任
5、所有资源均系网友上传或者通过网络收集,我们仅提供一个展示、介绍、观摩学习的平台,我们不对其内容的准确性、可靠性、正当性、安全性、合法性等负责,亦不承担任何法律责任
6、所有作品仅供您个人学习、研究或欣赏,不得用于商业或者其他用途,否则,一切后果均由您自己承担,我们对此不承担任何法律责任
7、如涉及侵犯版权等问题,请您及时通知我们,我们将立即采取措施予以解决
8、联系人Email:admin@iyunv.com 网址:www.yunweiku.com

所有资源均系网友上传或者通过网络收集,我们仅提供一个展示、介绍、观摩学习的平台,我们不对其承担任何法律责任,如涉及侵犯版权等问题,请您及时通知我们,我们将立即处理,联系人Email:kefu@iyunv.com,QQ:1061981298 本贴地址:https://www.iyunv.com/thread-311283-1-1.html 上篇帖子: 手动安装Hadoop Cloudera CDH4.2版本 下篇帖子: hadoop验证map/reduce失败,ERROR security.UserGroupInformation: PriviledgedAc
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

扫码加入运维网微信交流群X

扫码加入运维网微信交流群

扫描二维码加入运维网微信交流群,最新一手资源尽在官方微信交流群!快快加入我们吧...

扫描微信二维码查看详情

客服E-mail:kefu@iyunv.com 客服QQ:1061981298


QQ群⑦:运维网交流群⑦ QQ群⑧:运维网交流群⑧ k8s群:运维网kubernetes交流群


提醒:禁止发布任何违反国家法律、法规的言论与图片等内容;本站内容均来自个人观点与网络等信息,非本站认同之观点.


本站大部分资源是网友从网上搜集分享而来,其版权均归原作者及其网站所有,我们尊重他人的合法权益,如有内容侵犯您的合法权益,请及时与我们联系进行核实删除!



合作伙伴: 青云cloud

快速回复 返回顶部 返回列表