I've been doing a *lot* of digging around in the L System tonight and I've come to the conclusion that the 512 bytes of palette ram MUST be stored internally in the TC0090LVC and is NOT stored in the 4 x 32Kb SRAMs. I did this by replacing the L System's 4 x 32Kb SRAMs with 4 x 32Kb NVRAMs (DS1230Y), writing lots of different, short test programs to write various values through the mapped memory ranges and then dumping the contents of the NVRAMs after each test. I know now exactly how the pages are mapped in the 4 x 32Kb rams. Each 32Kb ram holds every fourth byte of data as seen in a linear map.
E.g. byte @ offset 0 in IC1 holds byte mapped to $c000
byte @ offset 0 in IC2 holds byte mapped to $c001
byte @ offset 0 in IC3 holds byte mapped to $c002
byte @ offset 0 in IC4 holds byte mapped to $c003
byte @ offset 1 in IC1 holds byte mapped to $c004
byte @ offset 1 in IC2 holds byte mapped to $c005
byte @ offset 1 in IC3 holds byte mapped to $c006
byte @ offset 1 in IC4 holds byte mapped to $c007
and so on
Each 128 bytes of sram is split into 32 x 4 bytes slices, each slice represents 4 bytes from a page in an 'every fourth byte' configuration. (e.g. IC1 holds bytes mapped to $c000/$c004/$c008/$c00c, IC2 holds bytes mapped to $c001/$c005/$c009/$c00d, etc). This pattern is repeated every 128 bytes throughout each ram.
Hence each 32Kb sram holds one quarter of each of the 32 available 4Kb pages. So when you combine all 4 x 32Kb srams you get the full 32 x 4Kb pages.
32768 / 128 = 256 slices
Each slice holds 4 bytes of each of the 32 pages
256 * 4 = 1024 bytes
4 rams in total
1024 * 4 = 4096 bytes (the size of each page)
Nowhere during my extensive testing did I see any evidence of palette ram data in the NVRAMs, even though palette ram was banked into CPU addressable space in all tests that I did. This leads me to believe that the palette ram MUST be internal to the CPU package and banking it in merely redirects palette ram reads/writes to the internal CPU palette ram rather than the external srams. This kind of makes sense as palette ram is usually the last thing that tile/sprite data passes through before being output to the edge connector/video connector and because the RGB data is output directly from the TC0090LVC to the edge connector (via 3 transistors/6 resistors) it make sense to have the palette ram internal to the CPU for speed.
Kind of makes me want to get a TC0090LVC decapped to prove that I'm right....