add active_tasks for view builds using version stamps #3003

tonysun83 · 2020-07-14T05:25:19Z

Overview

Active Tasks requires TotalChanges and ChangesDone to show the progress
of long running tasks. This requires count_changes_since/2 to be
implemented. Unfortunately, that is not easily done with
foundationdb. This commit replaces TotalChanges with the
versionstamp + the number of docs as a progress indicator. This can
possibly break existing api that relys on TotalChanges. ChangesDone
will still exist, but instead of relying on the current changes seq
it is simply a reflection of how many documents were written by the
updater process.

The new responses look like this:

[
    {
        "node": "[email protected]",
        "pid": "<0.622.0>",
        "changes_done": 199,
        "current_version_stamp": "8131141649532-0-198",
        "database": "testdb",
        "db_version_stamp": "8131141649532-0-999",
        "design_document": "_design/example",
        "started_on": 1594703583,
        "type": "indexer",
        "updated_on": 1594703586
    }
]

[
    {
        "node": "[email protected]",
        "pid": "<0.1030.0>",
        "changes_done": 1000,
        "current_version_stamp": "8131194932916-0-999",
        "database": "testdb",
        "db_version_stamp": "8131194932916-0-999",
        "design_document": "_design/example",
        "started_on": 1594703636,
        "type": "indexer",
        "updated_on": 1594703665
    }
]

Note that this only shows one process. IIUC, we can have multiple workers indexing in parallel. I think this approach also takes care of this with the ChangesDone only reflecting how many documents have been indexed for that range.

Testing recommendations

Tests need to be added. If this approach is accepted, I will add more precise tests. Currently ran manual tests.

Related Issues or Pull Requests

Checklist

Code is written and works correctly
Changes are covered by tests
Any new configurable parameters are documented in rel/overlay/etc/default.ini
A PR for documentation changes has been made in https://github.com/apache/couchdb-documentation

garrensmith

A good start. I'm not familiar with how active_tasks works but I'm not sure this is the best approach. All the information you need for view progress is in the job info. So when a query for _active_tasks is received I think you could just read the job queue and get all the information you need and then respond. Or is there more to this?

src/couch_views/src/couch_views_indexer.erl

garrensmith · 2020-07-14T08:46:14Z

src/couch_views/src/couch_views_indexer.erl

+ {type, indexer},
+ {database, Mrst#mrst.db_name},
+ {design_document, Mrst#mrst.idx_name},
+ {changes_done, 0},


Will changes always be 0? What happens if an index is half-built and the indexer is starting up again?

I didn't have a good solution to this. Mainly that the indexer process has to die first which means the active_task tuple is automatically removed from memory. I was thinking of writing a counter to fdb, but that opened up a new can of worms. The idea here is that is that each process will just own a portion of what they've indexed and not care about what's been done previously. In other words, if you have an index that built 900 documents out of a 1000 document db and it had to restart, it'll start from 0 and just get up to 100 for ChangesDone and finish.

Historically, changes_done refers to how many changes have been processed by this view update. It's for judging how much work is left before the view is up to date. One way to think of this would be if you had a 10,000,000 docs processed for a view, and then went to process another 1,000,000 docs to bring the view up to date, you wouldn't want to start at 99% finished as that'd be misleading to operators.

src/couch_views/src/couch_views_indexer.erl

nickva · 2020-07-14T14:57:27Z

@tonysun83

Agree with @garrensmith, for reporting the job on all the nodes, it might be easier to think of something like this:

Add an <<"active_task_status">> section to each job's data section, already converted to json format.
In the http handler we would query all the active jobs and just emit those json objects. All active jobs are conveniently located under a single ?ACTIVITY subspace and can be queried with get_active_since/3 (it was done that way on purpose to make it simple to query all active jobs in order to report _active_tasks easier 😉 ).
To get all the possible job types can use get_types.
Since we'd be calling those outside of couch_jobs we could add a new function to couch_jobs, something like get_active_jobs perhaps.

davisp

This looks fine to me. I'm not sure whether we should try and compare the version stamps to give a rough approximation of the progress completed. I think for now it would be fine and if/when fdb gets a count_rows_betwee_keys we can replace it with something more accurate. I'm not sure if I want to require that or merely suggest so will wait for feedback on my feedback before approving.

src/couch_views/src/couch_views_indexer.erl

src/chttpd/src/chttpd_misc.erl

garrensmith

That is looking much better. Nice work. Could you check if we have any elixir active tasks tests and make sure they are run with make check. It might be worth adding a few eunit tests around the changes_doc etc.

src/chttpd/src/chttpd_misc.erl

garrensmith · 2020-07-20T07:51:10Z

src/couch_views/src/couch_views_indexer.erl

+
+
+active_tasks_info(ChangesDone, DbName, DDocId, LastSeq, DBSeq) ->
+ VS = case LastSeq of


I don't like storing all this active task info in the job. All of this can be generated when CouchDB receive's a _active_tasks request. If we keep adding more and more information to the job info means we running the risk of exceeding the FDB value limit. Rather just add the current_version_stamp and db_version_stamp to the job options. Is indexer_pid actually useful? What would a user do with that?

The reason I had thought of it is that for replication tasks, for example, we we'd have to know to introspect a few other places specific to replicator to add "source", "target", etc. So active tasks would have to start knowing about how to extract fields from various applications. Perhaps we'd want a simple registration like we have for indices in fabric2_index: https://github.com/apache/couchdb/blob/prototype/fdb-layer/src/fabric/src/fabric2_index.erl#L51-L53 - modules (applications) register a callback for _active_tasks handler and call it when building the response? Or, perhaps we can do that as round 2 if we see the values getting close to 100KB size.

@garrensmith @nickva how about leaving just changes_done to the job_data and then computing the rest on the fly? It seems we should be adding an entirely new fabric_active_tasks module which allows the extractions to reside in there.

That sounds good.

src/couch_views/src/couch_views_indexer.erl

src/couch_jobs/src/couch_jobs_fdb.erl

src/chttpd/src/chttpd_misc.erl

src/fabric/src/fabric2_fdb.erl

src/couch_jobs/src/couch_jobs_fdb.erl

src/couch_jobs/src/couch_jobs.erl

src/couch_jobs/src/couch_jobs_fdb.erl

src/fabric/src/fabric2_active_tasks.erl

src/couch_views/src/couch_views_active_tasks.erl

nickva · 2020-07-22T15:45:38Z

@tonysun83 looks better! Good improvement. I made a few comments-in line.

One part that I am not sure if it is worth starting two transactions per task item just to compute version stamps and maybe a few other fields. That seems quite expensive if, say, someone is polling status on a cluster with hundreds of tasks, given that jobs already have the dbs open and all that data available to start with. The reason we decided to do that was to ensure we are below 100KB for JobData, I think?

We'd have to make a convincing case that our active_jobs status section in each JobData would have a chance of pushing it over the 100KB value limit. I tried encoding a large-ish object similar to the one we are adding for the task status:

S = #{
    <<"database">> => << <<"x">> || _ <- lists:seq(1,256) >>,
    <<"changes_done">> => 999999999999999999999999999999,
    <<"design_document">> => << <<"d">> || _ <- lists:seq(1,512) >>,
    <<"current_version_stamp">> => <<"1111111111111111111111111111111111111111">>,
    <<"db_version_stamp">> => <<"222222222222222222222222222222222222222222222">>
}.

>> size(iolist_to_binary(jiffy:encode(S))).
984

So we'd be adding at most another 1KB (in the worst case) to get closer to the 100KB limit if we save the status in the job data that doesn't seem that bad perhaps and it would simplify the design quite a bit more.

davisp · 2020-07-22T16:25:01Z

I'm actually counting three possible transactions between opening the database if its not in cache, one for the reading the design doc, then one for the other two bits of info.

Can someone point me to the discussion where we decided to move away from storing active task info in the job state? I'm not seeing how this is a better approach or why the job state approach is unworkable.

tonysun83 · 2020-07-22T16:34:33Z

@davisp : Here was @garrensmith 's concern #3003 (comment). But I think I found a compromise and pushed (now the populate_tasks function is way simpler without the reads). We really just need to add two new items to the JobData info: <<"changes_done">> and <<"db_seq">>

davisp · 2020-07-22T16:45:32Z

@tonysun83 The history of this PR is super confusing to me. The last update appears to have reverted to storing all of the info back in the job state which means that this new populate_active_tasks is just "format_job_as_active_task". Which is fine I suppose. Though it seems like it'd be a lot more straightforward for the jobs to just store what they want their active_task output to look like in the first place?

Adding a whole registration/callback system for this seems much more complex than the original plan to just spit out the job data as JSON.

tonysun83 · 2020-07-22T17:02:47Z

@davisp: agreed that it's gotten way more complex. I think @nickva thought that the registration callback would help with the sizing issue, and as a side benefit, add flexibility for future active_task types (I know replication is next). I'm leaning toward the simpler method like we did previously without the callback but would like @nickva to chime in here since he would know what we would need for replication.

nickva · 2020-07-22T17:24:41Z

@tonysun83: Agree with @davisp, it's not worth having the registration system just for this.

We could still hide the <<"active_task_info">> field name with a simple library function like

fabric2_active_tasks:update_active_task_info(JobData, ActiveTaskInfo) ->
     JobData#{?ACTIVE_TASK_INFO => ActiveTaskInfo}.

And use the original design #3003 (comment)

I think @nickva thought that the registration callback would help with the sizing issue

The registration would have helped with encapsulation, if we had a size issue and had to compute the section on the fly to avoid the http handle hard-coding knowledge about the replication and indexing apps. But I think we determined computing on the fly was not as optimal, and it wasn't adding that much more to the job data size. If we ever do need to compute it on the fly, we could also come back add the complexity later.

src/couch_views/src/couch_views_indexer.erl

src/couch_jobs/src/couch_jobs.erl

src/fabric/src/fabric2_active_tasks.erl

src/couch_views/src/couch_views_indexer.erl

tonysun83 · 2020-07-23T18:46:06Z

@nickva addressed your latest comments and added a test. Will add more tests later and then rebase with a proper commit history.

src/fabric/src/fabric2_active_tasks.erl

nickva · 2020-07-23T18:55:43Z

src/couch_jobs/src/couch_jobs.erl

+
+-spec get_types(jtx()) -> [job_type()] | {error, any()}.
+get_types(Tx) ->
+ couch_jobs_fdb:tx(couch_jobs_fdb:get_jtx(Tx), fun(JTx) ->


Minor nit: I think it's a 5 space indent instead of 4

src/fabric/src/fabric2_active_tasks.erl

src/couch_views/src/couch_views_indexer.erl

nickva · 2020-07-23T19:07:48Z

@tonysun83 looks great! awesome work. Just a few minor nits, whitespace and indents and such

garrensmith

This is looking really good. I've left a note about the extra active task info. I'm still not sure why we need to have all of that in the job data.

garrensmith · 2020-07-24T05:46:36Z

src/chttpd/src/chttpd_misc.erl

- [{[{node,Node} | Task]} || Task <- Tasks]
- end, Replies),
- send_json(Req, lists:sort(Response));
+ ActiveTasks = fabric2_active_tasks:get_active_tasks(),


Awesome. This is great.

src/couch_views/src/couch_views_util.erl

garrensmith

Nice work. This looks great.

nickva · 2020-07-24T15:48:29Z

src/fabric/src/fabric2_active_tasks.erl

+
+
+update_active_task_info(JobData, ActiveTaskInfo) ->
+ JobData#{?ACTIVE_TASK_INFO => ActiveTaskInfo}.


Tiny nit: 5 indent instead of 4, add a newline at the end of the file as well, just for so GH diff doesn't show the red warning :-P

oh, I thought I changed this. weird. thx

nickva

+1 Nice work, Tony!

Made a small comment about an indent. Run the new files through emillio, make sure it's happy in case we missed some formatting issues.

Then don't forget to squash some commits, maybe group them into 2 or 3, something like below or whatever looks better to you:

couch_jobs API additions
fabric2 + chttpd part
couch_views bits

We expose get_types in couch_jobs and also add get_active_jobs_ids to get the active job ids given a certain type.

Instead of relying on couch_task_status, we use fabric2_active_tasks to construct active_task info via couch_jobs.

Active Tasks requires TotalChanges and ChangesDone to show the progress of long running tasks. This requires count_changes_since to be implemented. Unfortunately, that is not easily done via with foundationdb. This commit replaces TotalChanges with the versionstamp + the number of docs as a progress indicator. This can possibly break existing api that relys on TotalChanges. ChangesDone will still exist, but instead of relying on the current changes seq it is simply a reflection of how many documents were written by the updater process.

tonysun83 requested a review from davisp July 14, 2020 05:25

garrensmith requested changes Jul 14, 2020

View reviewed changes

davisp reviewed Jul 14, 2020

View reviewed changes

src/couch_views/src/couch_views_indexer.erl Outdated Show resolved Hide resolved

tonysun83 force-pushed the add_active_tasks_fdb branch from 5b9c90b to c9c8767 Compare July 19, 2020 23:48

tonysun83 commented Jul 19, 2020

View reviewed changes

src/chttpd/src/chttpd_misc.erl Outdated Show resolved Hide resolved

tonysun83 force-pushed the add_active_tasks_fdb branch from c9c8767 to 59e1acb Compare July 20, 2020 00:16

garrensmith requested changes Jul 20, 2020

View reviewed changes

nickva reviewed Jul 20, 2020

View reviewed changes

src/couch_jobs/src/couch_jobs_fdb.erl Outdated Show resolved Hide resolved

nickva reviewed Jul 20, 2020

View reviewed changes

src/chttpd/src/chttpd_misc.erl Outdated Show resolved Hide resolved

nickva reviewed Jul 21, 2020

View reviewed changes

src/fabric/src/fabric2_fdb.erl Outdated Show resolved Hide resolved

nickva reviewed Jul 21, 2020

View reviewed changes

src/couch_jobs/src/couch_jobs_fdb.erl Outdated Show resolved Hide resolved

nickva reviewed Jul 21, 2020

View reviewed changes

src/couch_jobs/src/couch_jobs.erl Outdated Show resolved Hide resolved

tonysun83 force-pushed the add_active_tasks_fdb branch from 366a3c6 to c009825 Compare July 22, 2020 01:42

nickva reviewed Jul 22, 2020

View reviewed changes

src/couch_jobs/src/couch_jobs_fdb.erl Outdated Show resolved Hide resolved

nickva reviewed Jul 22, 2020

View reviewed changes

src/couch_jobs/src/couch_jobs_fdb.erl Outdated Show resolved Hide resolved

nickva reviewed Jul 22, 2020

View reviewed changes

src/fabric/src/fabric2_active_tasks.erl Outdated Show resolved Hide resolved

nickva reviewed Jul 22, 2020

View reviewed changes

src/couch_views/src/couch_views_active_tasks.erl Outdated Show resolved Hide resolved

tonysun83 force-pushed the add_active_tasks_fdb branch from 19856c5 to 0e72743 Compare July 22, 2020 19:44