adding paging to get repositories from the database #1793

thiago · 2016-09-13T06:22:02Z

Resolve #1776.
Related PR: #1777 #1783

lu4nation · 2016-09-13T21:19:53Z

Running it here this solution solves the 999 limit to me.

josmo · 2016-09-14T02:26:54Z

Thanks @thiago LGTM and works for me as well. I do think we should consider moving about from an "IN" clause and using the temp inner join like what's mentioned in http:https://www.xaprb.com/blog/2006/06/28/why-large-in-clauses-are-problematic/ it would probably speed things up on orgs that have large repos :)

bradrydzewski · 2016-09-14T02:54:38Z

model/feed.go

@@ -8,6 +8,7 @@ type Feed struct {
 Name string `json:"name" meddler:"repo_name"`
 FullName string `json:"full_name" meddler:"repo_full_name"`

+ Id int `json:"id,omitempty" meddler:"build_id,zeroisnull"`


perhaps we sort on Created instead since we already have the field available?

Yes, i dont know why did it! 😊

bradrydzewski · 2016-09-14T03:33:50Z

I do think we should consider moving about from an "IN" clause and using the temp inner join like what's mentioned in

To be fair this is from 2006 which may not account for 10 years of mysql or postgres optimizations.

it would probably speed things up on orgs that have large repos

Is there any indication that this is currently a performance bottleneck? From my testing with pretty large datasets on a relatively small database server, the IN statement still serves hundreds of feed queries per second with 999 items in the IN statement. I can't think of any real world install that would execute hundreds of feed queries per second.

You can see my performance testing and results at #1234 (comment)

bradrydzewski · 2016-09-14T03:38:36Z

also relevant from the comments section of that post. If you are having perf issues try running ANALYZE on the database in postres or sqlite (not sure about mysql)

My DBA had a really hard time believing my solution in the first email in this thread should be faster. So after digging some, he ran an ANALYZE on the tables in question.

This helps the PostgreSQL planner out to create more optimal query plans. It turns out that after analyze the original query that uses an "IN" takes 1ms.

So we went from:
IN Clause: ~4500ms
Inner Join: ~300ms
Analyze with IN Clause: ~1ms

The lesson is: use explain, use analyze, and consult your DBAs. They are your friends.

… by Feed.Created.

bradrydzewski · 2016-09-14T05:14:37Z

store/datastore/repos_test.go

@@ -5,6 +5,7 @@ import (

 "github.com/drone/drone/model"
 "github.com/franela/goblin"
+ "fmt"


import grouping and sorting :)

bradrydzewski · 2016-09-14T05:26:24Z

the multiple float64 allocations could be avoided with some slight re-factoring as mentioned in #1793 (comment)

this code correctly calculates the number of pages

const limit = 999

total := len(listof)
if total == 0 {
    return feed
}

pages := total / limit
if total % limit != 0 {
  pages++
}

to be honest I find the changes made to toList to be a bit complex and difficult to follow. At first glance it looks buggy, but I can't tell. I would prefer a more simple approach as noted in #1793 (comment).

I also think it would be easier to unit test ...

func toList(listof []*model.RepoLite, start, limit int) ([]string, []interface{}) { ... }

// could be tested with
list := []*RepoLite{
  { ... },
  { ... },
  { ... },
  { ... },
}
placeholders, parameters := toList(list, 0, 2)
g.Assert(len(placeholders)).Equal(2)
g.Assert(len(parameters)).Equal(2)
g.Assert(list[0].FullName).Equal(parameters[0])
g.Assert(list[1].FullName).Equal(parameters[1])

bradrydzewski · 2016-09-14T05:33:57Z

store/datastore/repos_test.go

+ {FullName: "bradrydzewski/drone"},
+ {FullName: "drone/drone"},
+ }
+ _repo := []*model.Repo{


perhaps repoList instead of _repo since it is an array. When I see _repo I think it is a single repo

thiago · 2016-09-15T05:52:09Z

Thank you @bradrydzewski for the details in the comments and your patience. I think I made all considerations, let me know if not.

I decided to create a new branch and perform a clear change in the master branch.

If you confirm, I'll send a new pull request and close this.

thiago · 2016-09-15T05:59:51Z

I have not found another way to perform the tests in the paging function GetRepoListOf without using loop, since the limit is not configurable.

Perhaps it makes sense to make the limit a global setting as DATABASE_DRIVER and DATABASE_CONFIG but don't know exactly how do this, But I think it's better to be solved in another pull request

thiago · 2016-09-16T17:40:15Z

Did I do something wrong? Sorry my anxiety..

bradrydzewski · 2016-09-16T17:46:23Z

EDIT: retracting comment. Looks like you made the requested changes in a different branch. Do you want to open a new PR from that branch? thanks ! :)

thiago · 2016-09-16T18:51:04Z

Yes, after analyzing the previous reviews I thought it best to implement a new branch to make the changes clearer in the master branch.

But I'd like you validate it before I submit a new PR, is it possible? Thanks! =D

…_queries' into remove_limit_repos

Adron · 2017-02-22T20:39:34Z

Just checking in on this PR. I was speaking with @josmo a few days ago about it's status and ways to test and verify the paging and everything is working. Is there anything I could prospectively do to help push the priority up on this PR? (It's one we'd get great use out of for some customers, since they're in the gazillion repo range) . // @thiago @bradrydzewski

Thanks all! 👍

bradrydzewski · 2017-02-23T00:15:33Z

Is there anything I could prospectively do to help push the priority up on this PR

you just did, thanks for the reminder ... just merged #1783 and #1929

adding paging to get repositories from the database

8ed8515

bradrydzewski reviewed Sep 14, 2016
View reviewed changes

Move type Feeds to datastore, remove attribute Feed.Id and apply sort…

56ef552

… by Feed.Created.

bradrydzewski reviewed Sep 14, 2016
View reviewed changes

thiago added 2 commits September 15, 2016 01:18

adding paging to get repositories from the database

239c364

removing underscore prefix

670f37e

Thiago Rodrigues added 2 commits September 20, 2016 10:05

fix merge

64db811

git push origin remove_limit_reposMerge branch 'bugfix/paginated_repo…

46ceeb6

…_queries' into remove_limit_repos

josmo mentioned this pull request Feb 5, 2017

Remove size limit #1929

Merged

bradrydzewski closed this Feb 23, 2017

harness locked and limited conversation to collaborators Feb 23, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding paging to get repositories from the database #1793

adding paging to get repositories from the database #1793

thiago commented Sep 13, 2016

lu4nation commented Sep 13, 2016

josmo commented Sep 14, 2016

bradrydzewski Sep 14, 2016

thiago Sep 14, 2016

bradrydzewski commented Sep 14, 2016 •

edited

bradrydzewski commented Sep 14, 2016

bradrydzewski Sep 14, 2016

bradrydzewski commented Sep 14, 2016 •

edited

bradrydzewski Sep 14, 2016

thiago commented Sep 15, 2016

thiago commented Sep 15, 2016

thiago commented Sep 16, 2016

bradrydzewski commented Sep 16, 2016 •

edited

thiago commented Sep 16, 2016

Adron commented Feb 22, 2017

bradrydzewski commented Feb 23, 2017 •

edited

adding paging to get repositories from the database #1793

adding paging to get repositories from the database #1793

Conversation

thiago commented Sep 13, 2016

lu4nation commented Sep 13, 2016

josmo commented Sep 14, 2016

bradrydzewski Sep 14, 2016

Choose a reason for hiding this comment

thiago Sep 14, 2016

Choose a reason for hiding this comment

bradrydzewski commented Sep 14, 2016 • edited

bradrydzewski commented Sep 14, 2016

bradrydzewski Sep 14, 2016

Choose a reason for hiding this comment

bradrydzewski commented Sep 14, 2016 • edited

bradrydzewski Sep 14, 2016

Choose a reason for hiding this comment

thiago commented Sep 15, 2016

thiago commented Sep 15, 2016

thiago commented Sep 16, 2016

bradrydzewski commented Sep 16, 2016 • edited

thiago commented Sep 16, 2016

Adron commented Feb 22, 2017

bradrydzewski commented Feb 23, 2017 • edited

bradrydzewski commented Sep 14, 2016 •

edited

bradrydzewski commented Sep 14, 2016 •

edited

bradrydzewski commented Sep 16, 2016 •

edited

bradrydzewski commented Feb 23, 2017 •

edited