Differential Binarization model #2095

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Draft

mehtamansi29 wants to merge 9 commits into keras-team:master from mehtamansi29:diffbin

+1,104 −0

Collaborator

mehtamansi29 commented Feb 12, 2025 •

edited

Loading

Differential Binarization model.


          ImageText detector preprocessor for Differential Binarization model

ed97271

sineeli reviewed

View reviewed changes

keras_hub/src/models/image_text_detector_preprocessor.py Outdated

+                          output = self.image_converter(
+                              {
+                                  "images": x,
+                                  "bounding_boxes": y,

Collaborator

sineeli Feb 12, 2025

I don't think we need any bounding boxes for this task.

mehtamansi29 added 3 commits

March 11, 2025 17:43


          db_utils functions and testfile

d97f362


          Diffbin utils function and test file

de3aaae


          diffbin utils function and testfile

9a3cf2a

sachinprasadhs added the WIP label

sachinprasadhs mentioned this pull request

Adding Differential Binarization model from PaddleOCR to Keras3 #1739

Closed

mehtamansi29 added 5 commits

May 12, 2025 21:40


          diffbin preprocessing function

93ad1ba


          diffbin postprocessing function


          diffbin postprocessing function_1

f1c3734


          diffbin postprocessing function_2

d3c74c9


          diffbin postprocessing function_3

aafef9e

sachinprasadhs reviewed

View reviewed changes

Collaborator

sachinprasadhs left a comment

Took high level pass and left some comments.
Also,
Make al the file names in follow the same format like other files, for db_utils and losses.py

keras_hub/src/models/diffbin/db_utils.py

		return (self.x, self.y)


		def shrink_polygan(polygon, offset):

Collaborator

sachinprasadhs May 14, 2025

shrink_polygon

keras_hub/src/models/diffbin/db_utils.py

Comment on lines +4 to +23

+              class Point:
+                  def __init__(self, x, y):
+                      self.x = x
+                      self.y = y
+                  def __add__(self, other):
+                      return Point(self.x + other.x, self.y + other.y)
+                  def __sub__(self, other):
+                      return Point(self.x - other.x, self.y - other.y)
+                  def __neg__(self):
+                      return Point(-self.x, -self.y)
+                  def cross(self, other):
+                      return self.x * other.y - self.y * other.x
+                  def to_tuple(self):
+                      return (self.x, self.y)

Collaborator

sachinprasadhs May 14, 2025

Is there any other way you can try to achive this better without using class?

keras_hub/src/models/diffbin/db_utils.py



		# cv2.fillpoly function with keras.ops
		def fill_poly_keras(vertices, image_shape):

Collaborator

sachinprasadhs May 14, 2025

you can just mention fill_poly and remove the cv2 mention here and in he description and description of what this function does.

keras_hub/src/models/diffbin/db_utils.py

		return int(height) if height >= 0.1 else 0


		# project point to line

Collaborator

sachinprasadhs May 14, 2025

Remove all these comments, instead make the description better

keras_hub/src/models/diffbin/db_utils.py

+                          high = mid
+                  height = (low + high) / 2
+                  height = (low + high) / 2

Collaborator

sachinprasadhs May 14, 2025

duplicate line

keras_hub/src/models/diffbin/db_utils.py

+              # get line of height
+              def get_line_height(poly):
+                  return binary_search_smallest_width(poly)

Collaborator

sachinprasadhs May 14, 2025

You can avoid this funtion and instead call binary_search_smallest_width directly

keras_hub/src/models/diffbin/db_utils.py

Comment on lines +26 to +27

		"""
		Shrinks a polygon inward by moving each point toward the center.

Collaborator

sachinprasadhs May 14, 2025

All the docstring text should immediately follow """, change it to, and this style should be applied everywhere
"""Shrinks a polygon inward by moving each point toward the center.

sachinprasadhs reviewed

View reviewed changes

keras_hub/src/models/image_text_detector_preprocessor.py

Comment on lines +45 to +56

+                  @preprocessing_function
+                  def generate_postprocess(self,x):
+                      '''
+                      Generates postprocess function to convert probability map of
+                      model output to polygon
+                      '''
+                      probability_maps,threshold_maps = x["probability_maps"], x["threshold_maps"]
+                      binary_maps = 1.0 / (1.0 + keras.ops.exp(-50.0 * (probability_maps - threshold_maps)))
+                      outputs = keras.layers.Concatenate(axis=-1)(
+                          [probability_maps, threshold_maps, binary_maps])
+                      return outputs

Collaborator

sachinprasadhs May 15, 2025

why do we need this? we don't have .generate function for diff_bin right?

sachinprasadhs reviewed

View reviewed changes

keras_hub/src/models/image_text_detector_preprocessor.py

Comment on lines +17 to +18

		target_size=(640, 640),
		shrink_ratio=0.3,

Collaborator

sachinprasadhs May 15, 2025 •

edited

Loading

target_size will be image_size and will be handled in ImageConverter, not in this.
Where are we using this shrink_ratio here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

WIP