AnsweredAssumed Answered

Metrics Database: Name of Classic MapReduce Jobs?

Question asked by dannyman on Dec 1, 2015
Latest reply on Dec 1, 2015 by dannyman
I'm working on a script to pull statistics out of the metrics database. Unfortunately, Classic MapReduce jobs have messed up names:

    mysql> select JOB_NAME from JOB where CREATED > date_sub(now(), interval 1 day) and time_finished is not NULL;
    +------------------------------------------------------------------+
    | JOB_NAME                                                         |
    +------------------------------------------------------------------+
    | [981488D796404B7C94BE2469292C1CF6/63D2E8B4BAED45DFB87D54A52AA31D |
    | [981488D796404B7C94BE2469292C1CF6/D451E735C72149E18EF89C5CF17AA8 |
    | [981488D796404B7C94BE2469292C1CF6/06E63BC3637A40949F670662F018D0 |
    | [981488D796404B7C94BE2469292C1CF6/6AB85C96499B4C3B82DF0B1C589D17 |
    | [525DAF677C704444A949F4F777BE3B8B/70D1436BC5834051877025B6D52347 |
    | [847121CCE1254AB7A04C9D42607710B0/C07808D2D1C846149CC0543128E9A4 |
    | [5170C6E5AA5E439A8A04D6338BAC9DA5/B0DB4F9637334AE88AA4A7D3ACC4D9 |
    | [5170C6E5AA5E439A8A04D6338BAC9DA5/A66141610C0E4E9EB2E2114C53CB97 |
    | distcp                                                           |
    | [5170C6E5AA5E439A8A04D6338BAC9DA5/96CFC436074A43BFA3BAD407BD0609 |
    | [5170C6E5AA5E439A8A04D6338BAC9DA5/1BCDFFE2FABD499390E77BB378B971 |
    | [2F0C8EAFED06418EB9B2D77D2E9BDFEB/D41E261A494E4FD2A2F964FC5D56C6 |
    | [2F0C8EAFED06418EB9B2D77D2E9BDFEB/95641449875A4B3798C11E7D1E9F7C |
    | [2F0C8EAFED06418EB9B2D77D2E9BDFEB/7AAFD103C6694963AA89F709DD3EB2 |
    | [2F0C8EAFED06418EB9B2D77D2E9BDFEB/D1DE8E0215B143AE80E913438FABE8 |
    | [2F0C8EAFED06418EB9B2D77D2E9BDFEB/538D02A57AD84B5798E9D58A9F3BD3 |
    | [2F0C8EAFED06418EB9B2D77D2E9BDFEB/CF06EA3D43B3437C96F1764D79DA69 |
    | [D06674919477407DA8E4E46338257B3F/447328A306BA462F92E1075379BDC8 |
    +------------------------------------------------------------------+
    18 rows in set (0.00 sec)
    
Over in the old jobtracker page, in the Name column, I see, for example:

    [D06674919477407DA8E4E46338257B3F/447328A306BA462F92E1075379BDC8D4] com.qf.pipeline.annotator.TwAnnotationShuffleJob/(1/4)

As best I can tell, the [] is not some encoded text, but more like a numeric ID.  It also overflows the VARCHAR(64) of the JOB_NAME field in the JOB table.  I'm thinking maybe if I alter the table to make the field wider, then I could parse out the leading [.*/.*] from the JOB_NAME field and get the true name, but ... what is going on here and is there a better way?

Outcomes