Split ruby.h #2991

shyouhei · 2020-03-30T06:08:38Z

Turn this:

into this:

shyouhei · 2020-03-30T07:37:27Z

Confirmed that edge rails can properly be bundle installed with it.

ko1 · 2020-03-30T07:50:47Z

could you explain the PR more? Motivation etc.

shyouhei · 2020-03-30T11:51:59Z

This is almost the same as #2711, except it applies the same thing against ruby.h instead of internal.h.

ruby.h@master includes literally thousands of lines of macro definitions, and #includes intern.h, which has another thousands of lines of code. I have problems maintaining these files. I want them to be split into lots of small files.
By splitting into small files, it is now possible to convert macros into inline functions. Macros are difficult to read, hard to maintain, impossible to debug. Converting them into inline functions should at least give a degree of relief from them.
As a result of de-macro transformation, granular type checking is now possible. For instance ruby/spec#762 was found during developing this branch.

ioquatix · 2020-03-31T21:19:04Z

I generally like this idea.

I wonder if we should use ruby3 or ruby/3 (or even just ruby). I guess all are fine but I feel like it's more common to use ruby3 if version number is expected. It also means that user could have "ruby2", "ruby3" and "ruby4" all installed and in theory include all headers simultaneously. Is there any example we can follow? Don't mean to bikeshed.

Also, if we are considering this, is it also open to consider moving all source ruby/*.c into ruby/src/? Because the root directory of Ruby is a bit of a mess.

I tried to move coroutine code into it's own area, using more fine grained files (partly because they are different arch). However, maybe we can rethink the layout of the repo.

shyouhei · 2020-04-01T03:12:19Z

Yes moving *.c files into src/ -like locations is in my TODO list. So far it is postponed because the layout of the repo as a whole is a big story. It should also impact all the core devs (their stash/local branches get ruined).

Re: directory name. It's okay to take ruby3 instead of ruby/3. I don't think side-by-side installation of headers from different versions works, though (for instance 2.6 and 2.7 provide slightly different feature sets so "ruby2" is impractical).

ioquatix · 2020-04-01T03:31:11Z

Yes moving *.c files into src/ -like locations is in my TODO list. So far it is postponed because the layout of the repo as a whole is a big story. It should also impact all the core devs (their stash/local branches get ruined).

Maybe requires more discussion, but I'd support (before Ruby 3):

Better organisation of source files. Maybe src/$module/$thing.c where $thing is logical grouping. We should try to make some organisation such that we have logical Init_$thing and so on, and try to ensure we prefer rb_$module_$thing_function or something similar.
Expand tabs across all code.

Re: directory name. It's okay to take ruby3 instead of ruby/3. I don't think side-by-side installation of headers from different versions works, though (for instance 2.6 and 2.7 provide slightly different feature sets so "ruby2" is impractical).

What does Python do? Can you have Python 3.1, 3.3 installed at the same time? Maybe we should adopt, e.g.

ruby3.1/*.h
ruby3 -> ruby3.1 (symlink to latest)

We should try to find what is best for users, OS distributions.

shyouhei · 2020-04-01T03:48:30Z

I started to think we should move to our bug tracker. The discussion (is fruitful so far, but) started to divert from this particular pull request.

ioquatix · 2020-04-01T03:54:51Z

I am happy for you to move the discussion. Maybe good to have a concrete proposal. We can discuss on slack? I am happy to help edit the proposal.

shyouhei · 2020-04-01T04:01:25Z

Expand tabs across all code.

Heh, you just realized that I silently removed all the tabs in the headers I touched in this pull request 😄

ioquatix · 2020-04-01T04:04:49Z

The way I think will best preserve history:

Expand tabs in one commit.
Move files in another commit.

That way, git blame and other tools should be able to go through rename/whitespace expansion easily.

ioquatix · 2020-04-01T04:09:42Z

Can you tell me if compile performance is affected?

shyouhei · 2020-04-01T04:55:21Z

Well yes, as I mentioned in 7b8969e I see some slowdown in compilation. For instance GitHub Actions reports it compiled current master in 2m 23s. The same thing took 5m 19s for this branch.

ioquatix · 2020-04-01T05:14:18Z

Can you get back speed using precompiled headers?

shyouhei · 2020-04-01T05:33:52Z

I guess so... not tried yet.

ioquatix · 2020-04-03T08:44:53Z

Did you check if incremental build is faster or slower?

shyouhei · 2020-04-06T00:58:25Z

@ioquatix incremental builds are also slower than before. This is because right now, source codes directly includeruby/ruby.h so that they pull everything. Becasue ruby/ruby.h is a public API we cannot make it lightweight. We need to decouple things in each .c files.

ko1 · 2020-04-06T01:39:32Z

I'm not sure the path 3/ is suitable (change them for 4?).

shyouhei · 2020-04-06T01:42:11Z

@ko1 or leave 3 as-is and make new 4. Both are OK.

shyouhei · 2020-04-06T01:49:58Z

My intention to create a subdirectory is to be explicit that files under the new one are implementation details. Those files should not be considered as public APIs. Extension libraries are not expected to do something line #include <ruby/3/intern/select/posix.h>. Instead they should stick to #include "ruby/ruby.h".

... on mswin. According to 3ea6beb, it is a wrong idea to define HAVE_SYS_TIME_H in case of mingw. MakeMakefile#have_header do not know such restriction. It is up to the programmer to properly avoid such situation. :FIXME: I suspect this is rather a bug of have_header.

I am going to modify C codes. Must make sure that does not break things. This changeset adds many CI that basically just make binaries with slightly distinct options each other.

The ruby/ruby.h, our main public header, is the biggest header file except autogenerated ones under enc/unicode. It has roughly 2,500 LOC. It then includes ruby/intern.h, which is ~1,000 LOC. Also included are ruby/defines.h (~500 LOC), ruby/win32.h (~700 LOC), etc. It's too big! Nobody can understand what is going on. We cannot eliminate the contents for backward compatibility, but at least we can split it into many, small parts. I hope this improves understanding of our public APIs. This changeset is a pure refactoring that does not add or remove any single LOC, except for obvious header include guards.

Just a cosmetic update.

This header file improves consistency of macro definitions. Let's use it throughout the project.

This macro is worth defining becuase it elminates literally thousands of lines of copy&paste.

When I made internal/compilers.h I was afraid of global namespace pollutions. Later it turned out that our public header defines __has_attribute anyways. Let's stop worrying. Publicize what we already have internally. Doing so, however, is not a simple copy&paste business. We only had to consider C99 for internal headers. But for public ones, we need to make sure all other C/C++ versions work well.

Minor updates, like adding empty lines for readability.

NORETURN() was missing from the included files, so fixed it.

Reduced use of ruby/backward/2/attributes.h when only one macro from that header was used in the file. Direct use new syntax.

The file was necessary because ruby internals tend to confuse VALUE and void*. That was a bad habit of 20th century. Let's not be loose.

The header is emppty now.

It seems gcc takes more time to compile ruby than before. That also impacts on JIT. Extended timeouts for them.

Don't bother complex preprocessor macros. ST2FIX is exactly what is needed here.

Some experiments revealed that in case of GCC, there are chances for this function to remain not inlined. That impacts runtime performance negatively. Let us force inline the function. It was designed to be inlined, then constant-folded. Calculating ------------------------------------- before after Optcarrot Lan_Master.nes 37.090 39.064 fps Comparison: Optcarrot Lan_Master.nes after: 39.1 fps before: 37.1 fps - 1.05x slower

Not the case of Clang but when compiled using GCC, RB_FL_ABLE shines so brightly on top of perf report. This shall be inlined. Calculating ------------------------------------- before after Optcarrot Lan_Master.nes 39.685 43.147 fps Comparison: Optcarrot Lan_Master.nes after: 43.1 fps before: 39.7 fps - 1.09x slower

shyouhei · 2020-04-07T05:39:05Z

Now I think the branch is ready to be merged.

eregon · 2020-04-08T22:14:52Z

Interesting PR.

Is there any intention to change the set of symbols available through #include "ruby.h"? If so I would like to be part of the discussion as TruffleRuby implements a large part of the C-API and reuses MRI headers pretty much as-is: https://github.com/oracle/truffleruby/tree/08789dca1d40158bc214d9f3a148c79b4c112baa/lib/cext/include/ruby

(this PR will likely cause me merge hell for any modification in ruby.h, but I'll have to do with it)

eregon · 2020-04-08T22:17:50Z

By splitting into small files, it is now possible to convert macros into inline functions. Macros are difficult to read, hard to maintain, impossible to debug. Converting them into inline functions should at least give a degree of relief from them.

Note that this needs to be done quite carefully for compatibility, because C extensions might expect it's a macro (e.g., if they #undef it, or use have_macro), or might expect it's a function (e.g., if assigning to a function pointer variable, or checking with have_func, etc).

shyouhei · 2020-04-10T01:06:20Z

Thank you for reviewing!

By splitting into small files, it is now possible to convert macros into inline functions. Macros are difficult to read, hard to maintain, impossible to debug. Converting them into inline functions should at least give a degree of relief from them.

Note that this needs to be done quite carefully for compatibility, because C extensions might expect it's a macro (e.g., if they #undef it, or use have_macro), or might expect it's a function (e.g., if assigning to a function pointer variable, or checking with have_func, etc).

No functions were deleted. have_func etc must not fail.
All the functions introduced are inline functions, so taking address of them makes no to little sense.
For each inline functions that were formerly macros, I made no-op macros (like this one cb70531#diff-64df151b2239dc342d11589751f3bc47R34). #ifdef or have_macro should work.

ioquatix · 2020-04-13T03:51:03Z

There has been a 30% drop in performance after this was merged. Can you investigate?

master:
Running 2s test @ http://127.0.0.1:9294/
  8 threads and 8 connections
  Thread Stats   Avg      Stdev     Max   +/- Stdev
    Latency   186.71us  417.74us  10.27ms   98.43%
    Req/Sec     6.74k     0.89k    9.00k    57.32%
  110095 requests in 2.10s, 3.99MB read
Requests/sec:  52436.50
Transfer/sec:      1.90MB

2.7.1
Running 2s test @ http://127.0.0.1:9294/
  8 threads and 8 connections
  Thread Stats   Avg      Stdev     Max   +/- Stdev
    Latency   318.78us    1.10ms  20.60ms   95.24%
    Req/Sec     9.08k     2.61k   17.76k    70.12%
  148277 requests in 2.10s, 5.37MB read
Requests/sec:  70576.54
Transfer/sec:      2.56MB

shyouhei · 2020-04-13T09:01:44Z

@ioquatix Can you tell us your environment (OS, compiler, flags passed to configure)?

ioquatix · 2020-04-13T09:12:25Z

It seems to affect optcarrot too.

https://benchmark-driver.github.io/benchmarks/optcarrot/commits.html

My setup has no specific configure flags.

OS: Linux (same used in both tests)
Compiler: Clang (same used in both tests)
Flags: Nothing specific.

shyouhei · 2020-04-13T09:41:43Z

The optcarrot benchmark drop is because it doesn't specify cppflags=-DNDEBUG at configure, @k0kubun confirmed. It seems your situation could be the same. Can you try passing that flag to configure, to see if it fixes the situation?

ioquatix · 2020-04-13T09:44:29Z

I will try it now and report back.

ioquatix · 2020-04-13T09:57:15Z

Ruby flags:

koyoko% make install
	BASERUBY = /home/samuel/.rubies/ruby-2.7.1/bin/ruby --disable=gems
	CC = gcc
	LD = ld
	LDSHARED = gcc -shared
	CFLAGS = -O3 -ggdb3 -Wall -Wextra -Werror=deprecated-declarations -Werror=duplicated-cond -Werror=implicit-function-declaration -Werror=implicit-int -Werror=misleading-indentation -Werror=pointer-arith -Werror=write-strings -Wimplicit-fallthrough=0 -Wmissing-noreturn -Wno-cast-function-type -Wno-constant-logical-operand -Wno-long-long -Wno-missing-field-initializers -Wno-overlength-strings -Wno-packed-bitfield-compat -Wno-parentheses-equality -Wno-self-assign -Wno-tautological-compare -Wno-unused-parameter -Wno-unused-value -Wsuggest-attribute=format -Wsuggest-attribute=noreturn -Wunused-variable -std=gnu99 
	XCFLAGS = -D_FORTIFY_SOURCE=2 -fstack-protector-strong -fno-strict-overflow -DRUBY_DEVEL=1 -fvisibility=hidden -fexcess-precision=standard -DRUBY_EXPORT -fPIE -DCANONICALIZATION_FOR_MATHN -I. -I.ext/include/x86_64-linux -I../include -I.. -I../enc/unicode/12.1.0
	CPPFLAGS =  -DNDEBUG 
	DLDFLAGS = -Wl,--compress-debug-sections=zlib -fstack-protector-strong -pie  
	SOLIBS = -lz -lpthread -lrt -lrt -lgmp -ldl -lcrypt -lm 
	LANG = en_NZ.utf8
	LC_ALL = 
	LC_CTYPE = 
	MFLAGS =

I would say, that probably fixed the issue:

Running 2s test @ http://127.0.0.1:9294/
  8 threads and 8 connections
  Thread Stats   Avg      Stdev     Max   +/- Stdev
    Latency   328.03us    1.14ms  16.96ms   95.53%
    Req/Sec     8.66k     2.80k   17.24k    65.85%
  141326 requests in 2.10s, 5.12MB read
Requests/sec:  67313.64
Transfer/sec:      2.44MB

ioquatix · 2020-04-13T12:25:47Z

I also got some strange reports:

compiling ../debug.c
../complex.c:18: warning: "NDEBUG" redefined
   18 | #define NDEBUG
      |

k0kubun · 2020-04-18T08:46:48Z

include/ruby/3/fl_type.h

+static inline VALUE
+RB_FL_TEST_RAW(VALUE obj, VALUE flags)
+{
+    RUBY3_ASSERT_OR_ASSUME(RB_FL_ABLE(obj));


I guess this is a new assertion introduced in this PR, and I'd like to share this helped to find 04e5695. Thank you!

eregon · 2020-05-06T10:50:08Z

@shyouhei I suspect you might like this somewhat related change in TruffleRuby: oracle/truffleruby@d83ddc3
It's splitting the 3448-lines long C file in 36 files of ~100 lines each 😃

Now it looks like there is a small MRI in https://github.com/oracle/truffleruby/tree/master/src/main/c/cext except most C functions just call back to Ruby and not the other way around.

shyouhei · 2020-05-08T07:54:53Z

Yes, very interesting. Thank you. Let me take a closer look at them. I guess @ko1 might also be interested in it. His current approach (ruby methods are written in ruby, who call C codes via _builtin) seems like the opposite of them.

shyouhei mentioned this pull request Mar 31, 2020

Test #2991 #2993

Closed

shyouhei force-pushed the shyouhei:ruby.h branch from 514b73a to a33b313 Mar 31, 2020

shyouhei force-pushed the shyouhei:ruby.h branch 2 times, most recently from fe56ce8 to 3256ded Apr 6, 2020

shyouhei added 7 commits Mar 2, 2020

add a bunch of compile checks

1445bd5

I am going to modify C codes. Must make sure that does not break things. This changeset adds many CI that basically just make binaries with slightly distinct options each other.

follow public header's style [ci skip]

b8f79be

Just a cosmetic update.

use ruby/3/config.h

61f21ad

This header file improves consistency of macro definitions. Let's use it throughout the project.

implement RUBY3_SYMBOL_EXPORT_BEGIN

99add25

This macro is worth defining becuase it elminates literally thousands of lines of copy&paste.

shyouhei added 11 commits Mar 27, 2020

include/ruby/3/intern/enumerator.h rework

576e3bf

Minor updates, like adding empty lines for readability.

include/ruby/3/variable.h rework

cc2aa97

NORETURN() was missing from the included files, so fixed it.

other minor attribute updates

d275880

Reduced use of ruby/backward/2/attributes.h when only one macro from that header was used in the file. Direct use new syntax.

eliminate include/ruby/backward/2/looser_macros.h

17868a7

The file was necessary because ruby internals tend to confuse VALUE and void*. That was a bad habit of 20th century. Let's not be loose.

delete internal/stdbool.h

a2ae029

The header is emppty now.

.github: extend MinGW timeouts

663cc88

It seems gcc takes more time to compile ruby than before. That also impacts on JIT. Extended timeouts for them.

add NEWS entry for ruby.h split [ci skip]

6954fc6

update dependencies

e12cda1

spec: use ST2FIX

0b692b3

Don't bother complex preprocessor macros. ST2FIX is exactly what is needed here.

shyouhei force-pushed the shyouhei:ruby.h branch from bbb3118 to 12f1e15 Apr 7, 2020

shyouhei deleted the shyouhei:ruby.h branch Apr 8, 2020

k0kubun reviewed Apr 18, 2020

View changes

shyouhei mentioned this pull request May 21, 2020

Use RUBY_DEBUG instead of NDEBUG #3124

Merged

ruby / ruby

Split ruby.h #2991

Split ruby.h #2991

shyouhei commented Mar 30, 2020

shyouhei commented Mar 30, 2020

ko1 commented Mar 30, 2020

shyouhei commented Mar 30, 2020

ioquatix commented Mar 31, 2020 •

edited

shyouhei commented Apr 1, 2020

ioquatix commented Apr 1, 2020

shyouhei commented Apr 1, 2020

ioquatix commented Apr 1, 2020

shyouhei commented Apr 1, 2020

ioquatix commented Apr 1, 2020

ioquatix commented Apr 1, 2020

shyouhei commented Apr 1, 2020

ioquatix commented Apr 1, 2020

shyouhei commented Apr 1, 2020

ioquatix commented Apr 3, 2020

shyouhei commented Apr 6, 2020

ko1 commented Apr 6, 2020

shyouhei commented Apr 6, 2020

shyouhei commented Apr 6, 2020

shyouhei commented Apr 7, 2020

eregon commented Apr 8, 2020

eregon commented Apr 8, 2020 •

edited

shyouhei commented Apr 10, 2020

ioquatix commented Apr 13, 2020 •

edited

shyouhei commented Apr 13, 2020

ioquatix commented Apr 13, 2020

shyouhei commented Apr 13, 2020

ioquatix commented Apr 13, 2020

ioquatix commented Apr 13, 2020

ioquatix commented Apr 13, 2020

This comment has been minimized.

eregon commented May 6, 2020

shyouhei commented May 8, 2020

ruby / ruby

Split ruby.h #2991

Split ruby.h #2991

Conversation

shyouhei commented Mar 30, 2020

shyouhei commented Mar 30, 2020

ko1 commented Mar 30, 2020

shyouhei commented Mar 30, 2020

ioquatix commented Mar 31, 2020 • edited

shyouhei commented Apr 1, 2020

ioquatix commented Apr 1, 2020

shyouhei commented Apr 1, 2020

ioquatix commented Apr 1, 2020

shyouhei commented Apr 1, 2020

ioquatix commented Apr 1, 2020

ioquatix commented Apr 1, 2020

shyouhei commented Apr 1, 2020

ioquatix commented Apr 1, 2020

shyouhei commented Apr 1, 2020

ioquatix commented Apr 3, 2020

shyouhei commented Apr 6, 2020

ko1 commented Apr 6, 2020

shyouhei commented Apr 6, 2020

shyouhei commented Apr 6, 2020

shyouhei commented Apr 7, 2020

eregon commented Apr 8, 2020

eregon commented Apr 8, 2020 • edited

shyouhei commented Apr 10, 2020

ioquatix commented Apr 13, 2020 • edited

shyouhei commented Apr 13, 2020

ioquatix commented Apr 13, 2020

shyouhei commented Apr 13, 2020

ioquatix commented Apr 13, 2020

ioquatix commented Apr 13, 2020

ioquatix commented Apr 13, 2020

This comment has been minimized.

k0kubun Apr 18, 2020 Member

eregon commented May 6, 2020

shyouhei commented May 8, 2020

ioquatix commented Mar 31, 2020 •

edited

eregon commented Apr 8, 2020 •

edited

ioquatix commented Apr 13, 2020 •

edited

k0kubun Apr 18, 2020
Member