DOC: added string processing comparison with R #16502

natethedrummer · 2017-05-25T19:06:31Z

Added string processing section to comparison_with_r DOC. Used info from http://blog.dominodatalab.com/using-r-and-python-for-common-sas-functions.

This completes the issue.

added string comparison functions section in documentation comparison_with_r.rst

jreback · 2017-05-25T20:59:31Z

doc/source/comparison_with_r.rst

@@ -530,6 +530,103 @@ For more details and examples see :ref:`categorical introduction <categorical>`
 :ref:`differences to R's factor <categorical.rfactor>`.




add a section tag here, like: _compare_with_r.string (actually if you can add them some of the sub-sections would be great). you put right after the sub-section label.

If you aren't familiar with sphinx, you need to start the line with a .. http://www.sphinx-doc.org/en/stable/markup/inline.html#cross-referencing-arbitrary-locations

jreback · 2017-05-25T20:59:50Z

doc/source/comparison_with_r.rst

+``nchar`` includes leading and trailing blanks.  Use ``nchar`` and ``trimws`` 
+to exclude leading and trailing blanks. 
+
+.. code-block:: none


is there a R highlter?

http://pygments.org/docs/lexers/#lexers-for-the-r-s-languages

r or rconsole should work. Probably rconsole if you're showing output.

yeah I realize that these produce the same output, so we don't actually show the output (I think) elsewhere, so maybe that is ok (though obviously the code formatting would be nice)

jreback · 2017-05-25T21:00:16Z

doc/source/comparison_with_r.rst

+``len`` includes leading and trailing blanks.  Use ``len`` and ``strip`` 
+to exclude leading and trailing blanks.
+
+.. code-block:: none


-> ipython:: python (and for all running pandas code)

jreback · 2017-05-25T21:00:36Z

doc/source/comparison_with_r.rst

+
+   df <- data.frame(color = c('red', ' blue', 'green ', ' yellow '))
+   nchar(as.character(df$color))
+   nchar(trimws(as.character(df$color)))


would be nice to show output here (IOW run the R code and show the output too if you can)

codecov · 2017-05-25T23:47:15Z

Codecov Report

Merging #16502 into master will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master   #16502   +/-   ##
=======================================
  Coverage   90.43%   90.43%           
=======================================
  Files         161      161           
  Lines       51045    51045           
=======================================
  Hits        46161    46161           
  Misses       4884     4884

Flag	Coverage Δ
#multiple	`88.27% <ø> (ø)`	⬆️
#single	`40.16% <ø> (ø)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e81f3cc...bf8ae7a. Read the comment docs.

codecov · 2017-05-25T23:47:18Z

Codecov Report

Merging #16502 into master will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master   #16502   +/-   ##
=======================================
  Coverage   90.43%   90.43%           
=======================================
  Files         161      161           
  Lines       51045    51045           
=======================================
  Hits        46161    46161           
  Misses       4884     4884

Flag	Coverage Δ
#multiple	`88.27% <ø> (ø)`	⬆️
#single	`40.16% <ø> (ø)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e81f3cc...bf8ae7a. Read the comment docs.

jreback · 2017-07-19T10:35:33Z

@gfyoung can you review

gfyoung · 2017-07-19T15:10:06Z

doc/source/comparison_with_r.rst

+``find`` function.  ``find`` searches for the first position of the 
+substring.  If the substring is found, the function returns its 
+position.  Keep in mind that Python indexes are zero based whereas 
+R indexes are 1 based.


Simpler: "Keep in mind that Python 0-indexes, whereas R 1-indexes"

gfyoung · 2017-07-19T15:10:57Z

doc/source/comparison_with_r.rst

+
+In Python, you can use ``[]`` notation to extract a substring 
+from a string by position locations.  Keep in mind that Python 
+indexes are zero-based.


Slightly simpler: "Keep in mind that Python 0-indexes"

gfyoung · 2017-07-19T15:12:52Z

doc/source/comparison_with_r.rst

+In addition, Python's ``title`` function changes the string to 
+proper case.
+
+.. code-block:: none


Same comment above

jreback · 2017-09-10T14:50:16Z

can you rebase / update

jreback · 2017-09-23T20:11:32Z

can you update according to comments

jreback · 2017-11-10T20:18:45Z

closing as stale. if you'd like to continue working, pls ping.

added string comparison functions

bf8ae7a

added string comparison functions section in documentation comparison_with_r.rst

jreback requested changes May 25, 2017

View reviewed changes

jreback added the Docs label May 25, 2017

gfyoung added this to the 0.21.0 milestone Jul 19, 2017

gfyoung reviewed Jul 19, 2017

View reviewed changes

jreback removed this from the 0.21.0 milestone Sep 23, 2017

jreback closed this Nov 10, 2017

		@@ -530,6 +530,103 @@ For more details and examples see :ref:`categorical introduction <categorical>`
		:ref:`differences to R's factor <categorical.rfactor>`.

Uh oh!

DOC: added string processing comparison with R #16502

DOC: added string processing comparison with R #16502

Uh oh!

Conversation

natethedrummer commented May 25, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TomAugspurger May 25, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

codecov bot commented May 25, 2017

Codecov Report

Uh oh!

codecov bot commented May 25, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jreback commented Jul 19, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jreback commented Sep 10, 2017

Uh oh!

jreback commented Sep 23, 2017

Uh oh!

jreback commented Nov 10, 2017

Uh oh!

Uh oh!

TomAugspurger May 25, 2017 •

edited

Loading

codecov bot commented May 25, 2017 •

edited

Loading