Include only one element of schema in new schema ?












0














The contents of T are:



one     33     xxx
one 111 xxx
one 111 xxy
two 7 yzy
two 9 klm
two 11 klm
two 365 klm
four 3434 iti


So, we can see that for each hash, I can have multiple values. My goal, is to present the data like below:



one     33      xxx            
one 111 xxy
two 7 yzy
two 9 klm
four 3434 iti


I am only interested in getting 1 of the values, I don't care which one it is.



The following is my approach, but it is not working, as expected.



I am running the job in Hadoop and it is running out of memory. If it did what I want it to do, I am only expecting 400 lines.



T = LOAD X AS (NAME:bytearray, VALUE:bytearray, HASH:bytearray);

NAME_HASH_GROUPED = GROUP X BY (name, hash);

NAME_VALUE_HASH = FOREACH NAME_HASH_GROUPED
GENERATE group, VALUE;









share|improve this question



























    0














    The contents of T are:



    one     33     xxx
    one 111 xxx
    one 111 xxy
    two 7 yzy
    two 9 klm
    two 11 klm
    two 365 klm
    four 3434 iti


    So, we can see that for each hash, I can have multiple values. My goal, is to present the data like below:



    one     33      xxx            
    one 111 xxy
    two 7 yzy
    two 9 klm
    four 3434 iti


    I am only interested in getting 1 of the values, I don't care which one it is.



    The following is my approach, but it is not working, as expected.



    I am running the job in Hadoop and it is running out of memory. If it did what I want it to do, I am only expecting 400 lines.



    T = LOAD X AS (NAME:bytearray, VALUE:bytearray, HASH:bytearray);

    NAME_HASH_GROUPED = GROUP X BY (name, hash);

    NAME_VALUE_HASH = FOREACH NAME_HASH_GROUPED
    GENERATE group, VALUE;









    share|improve this question

























      0












      0








      0







      The contents of T are:



      one     33     xxx
      one 111 xxx
      one 111 xxy
      two 7 yzy
      two 9 klm
      two 11 klm
      two 365 klm
      four 3434 iti


      So, we can see that for each hash, I can have multiple values. My goal, is to present the data like below:



      one     33      xxx            
      one 111 xxy
      two 7 yzy
      two 9 klm
      four 3434 iti


      I am only interested in getting 1 of the values, I don't care which one it is.



      The following is my approach, but it is not working, as expected.



      I am running the job in Hadoop and it is running out of memory. If it did what I want it to do, I am only expecting 400 lines.



      T = LOAD X AS (NAME:bytearray, VALUE:bytearray, HASH:bytearray);

      NAME_HASH_GROUPED = GROUP X BY (name, hash);

      NAME_VALUE_HASH = FOREACH NAME_HASH_GROUPED
      GENERATE group, VALUE;









      share|improve this question













      The contents of T are:



      one     33     xxx
      one 111 xxx
      one 111 xxy
      two 7 yzy
      two 9 klm
      two 11 klm
      two 365 klm
      four 3434 iti


      So, we can see that for each hash, I can have multiple values. My goal, is to present the data like below:



      one     33      xxx            
      one 111 xxy
      two 7 yzy
      two 9 klm
      four 3434 iti


      I am only interested in getting 1 of the values, I don't care which one it is.



      The following is my approach, but it is not working, as expected.



      I am running the job in Hadoop and it is running out of memory. If it did what I want it to do, I am only expecting 400 lines.



      T = LOAD X AS (NAME:bytearray, VALUE:bytearray, HASH:bytearray);

      NAME_HASH_GROUPED = GROUP X BY (name, hash);

      NAME_VALUE_HASH = FOREACH NAME_HASH_GROUPED
      GENERATE group, VALUE;






      apache-pig






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked Dec 28 '18 at 5:36









      Don CodeDon Code

      1871616




      1871616
























          0






          active

          oldest

          votes











          Your Answer






          StackExchange.ifUsing("editor", function () {
          StackExchange.using("externalEditor", function () {
          StackExchange.using("snippets", function () {
          StackExchange.snippets.init();
          });
          });
          }, "code-snippets");

          StackExchange.ready(function() {
          var channelOptions = {
          tags: "".split(" "),
          id: "1"
          };
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function() {
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled) {
          StackExchange.using("snippets", function() {
          createEditor();
          });
          }
          else {
          createEditor();
          }
          });

          function createEditor() {
          StackExchange.prepareEditor({
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: true,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          imageUploader: {
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          },
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          });


          }
          });














          draft saved

          draft discarded


















          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53954103%2finclude-only-one-element-of-schema-in-new-schema%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown

























          0






          active

          oldest

          votes








          0






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes
















          draft saved

          draft discarded




















































          Thanks for contributing an answer to Stack Overflow!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid



          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.


          To learn more, see our tips on writing great answers.





          Some of your past answers have not been well-received, and you're in danger of being blocked from answering.


          Please pay close attention to the following guidance:


          • Please be sure to answer the question. Provide details and share your research!

          But avoid



          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53954103%2finclude-only-one-element-of-schema-in-new-schema%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Monofisismo

          Angular Downloading a file using contenturl with Basic Authentication

          Olmecas