Bug 1428000 - Migrate: only annotate affected files #34

stasm · 2018-01-04T12:39:06Z

https://bugzilla.mozilla.org/show_bug.cgi?id=1428000

Pike

I'm a tad unsure on the abspath vs path change. Can you add more detail on that in the commit message? I think I know why it's OK in blame (cwd=root vs not), but not in blame -> migration.

The other question I have is whether we should first collect all the migration contexts, then gather all files, and run blame once?

Pike · 2018-01-04T13:34:57Z

tools/migrate/migrate-l10n.py

+        # Annotate legacy localization files used as sources by this migration
+        # to preserve attribution of translations.
+        files = ctx.localization_resources.keys()
+        blame = Blame(localization_dir).main(files)


I think we should pass the hglib client to Blame directly. That way, we only have one hg process open instead of ... n + 1 minus whatever gets garbage-collected?

Either in this patch or in a follow-up, I think.

Pike · 2018-01-04T13:48:12Z

tools/migrate/blame.py

@@ -1,8 +1,10 @@
 import argparse
 import json
+from os.path import join


PS: I prefer to have os.path.join explicit rather than importing join into the global namespace. Just import os is good enough. At least that's what more pythonic people have told me to do ;-)

stasm · 2018-01-04T14:03:06Z

Thanks for the review!

I'm a tad unsure on the abspath vs path change. Can you add more detail on that in the commit message? I think I know why it's OK in blame (cwd=root vs not), but not in blame -> migration.

In blame data, path is the path relative to CWD and abspath is the path relative to the root of the repo. With CWD==root, they're the same and I removed abspath to reduce the number of different paths used in the code.

The other question I have is whether we should first collect all the migration contexts, then gather all files, and run blame once?

Annotating now takes under a second which I think is good enough. The code is cleaner now when annotating happens for each migration context separately. Also, batching blames would only help us in the scenario when we run multiple migrations touching the same files... and it would likely shave off a fraction of a second per locale or so. Let's keep this simple for now and revisit as a potential perf optimization if we hit another bottleneck.

In `hg annotate` data, `path` is the path relative to CWD and `abspath` is the path relative to the root of the repo. With CWD==root, they're the same and we can remove `abspath` to reduce the number of different types of paths used in the code.

Pike

Thanks, now I get it.

Also OK to only refactor the loop for a single blame if we get to need it. I also wondered that there might be situations where it's slower, aka, you have two distinct migrations with different blame authors, then you check empty changesets in one from the other. We'll see, likely hardly noticable impact.

- Implement Fluent Syntax 0.5. - Add support for terms. - Add support for `#`, `##` and `###` comments. - Remove support for tags. - Add support for `=` after the identifier in message and term defintions. - Forbid newlines in string expressions. - Allow trailing comma in call expression argument lists. In fluent 0.6.x the new Syntax 0.5 is supported alongside the old Syntax 0.4. This should make migrations easier. `FluentParser` will correctly parse Syntax 0.4 comments (prefixed with `//`), sections and message definitions without the `=` after the identifier. The one exception are tags which are no longer supported. Please use attributed defined on terms instead. `FluentSerializer` always serializes using the new Syntax 0.5. - Expose `FluentSerializer.serialize_expression`. - Fix Bug 1428000 - Migrate: only annotate affected files (#34)

Bug 1428000 - Migrate: only annotate affected files

c9869a2

Pike reviewed Jan 4, 2018

View reviewed changes

Address review comments

65bcac2

In `hg annotate` data, `path` is the path relative to CWD and `abspath` is the path relative to the root of the repo. With CWD==root, they're the same and we can remove `abspath` to reduce the number of different types of paths used in the code.

Pike approved these changes Jan 4, 2018

View reviewed changes

stasm merged commit 44b1203 into projectfluent:master Jan 4, 2018

stasm deleted the annotate-only-affected branch January 4, 2018 14:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bug 1428000 - Migrate: only annotate affected files #34

Bug 1428000 - Migrate: only annotate affected files #34

Uh oh!

stasm commented Jan 4, 2018

Uh oh!

Pike left a comment

Uh oh!

Pike Jan 4, 2018

Uh oh!

Pike Jan 4, 2018

Uh oh!

stasm commented Jan 4, 2018 •

edited

Loading

Uh oh!

Pike left a comment

Uh oh!

Uh oh!

Bug 1428000 - Migrate: only annotate affected files #34

Bug 1428000 - Migrate: only annotate affected files #34

Uh oh!

Conversation

stasm commented Jan 4, 2018

Uh oh!

Pike left a comment

Choose a reason for hiding this comment

Uh oh!

Pike Jan 4, 2018

Choose a reason for hiding this comment

Uh oh!

Pike Jan 4, 2018

Choose a reason for hiding this comment

Uh oh!

stasm commented Jan 4, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Pike left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

stasm commented Jan 4, 2018 •

edited

Loading