asked on

using regexp_instr to compare values, need a little help in Oracle Sql

I used Regexp_substr to good effect recently (with EE tutoring), now I need a little help with regexp_instr.

Here's the neat trick that EE helped me with:
regexp_substr(REPLACE(p_mtg_time_in,' ',''),'([MWTRFS]+[0-9]+,?)+')

this is a scheduling application at a school, with class periods 1 - 10, meeting on Mon - Sat. the regexp_substr took these values to convert:
MTWR1256,FBYARRANG produces MTWR1256 (stripping out BYARRANG and also the "F" because it's not followed by 123456789 (or 10).

Now I want to compare values and say "Ok" or "Bad !". Here are a bunch of examples:
A B C
F12      F1 good
F12      F2 good

A B C
MW6,BY ARRANGEMENT      M6 good
MW6,BY ARRANGEMENT      W6 good
MW6,BY ARRANGEMENT R6 bad

A B C
MWF2,R34      M2 good
MWF2,R34      W2 good
MWF2,R34      R3 good
MWF2,R34      R4 good
MWF2,R34      M3 bad
MWF2,R34      W3 bad

A B C
T10,R9      T10 good
T10,R9      T5 BAD
T10,R9      T6 BAD
T10,R9      R3 BAD
T10,R9      R4 BAD
T10,R9      R9 good
T10,R9      R10 BAD - 10 is not preceded by R !
T10,R9      T9 BAD - 9 is not preceded by T since R breaks the pattern !

column B is always the day (M,T,W,R,F,S) followed by the period (1 - 10)

so here are some rules
if column B is found in Column A, it's good (F1 = F1)

you can find other letters before the number (MWF2,R34 | M2 good !)

you can find other numbers after the letter (F12 |       F2 good !)

but MWF2,R34 |      M3 bad M3 isn't found ! only R3 and R4 would be good

MWF23,R34 |      M3 ok
M2 ok
M4 bad ! because of the intervening R
R2 bad ! there's no 2 after R

R3 ok
R4 ok

F4 bad ! because of the intervening R
F3 ok
F2 ok

also, the comma is irrelevant.

SOLUTION

slightwv (䄆 Netminder)

membership

This solution is only available to members.

To access this solution, you must be a member of Experts Exchange.

Start Free Trial

Sean Stuber

that's quite a bit trickier than the first, but still doable

  SELECT a,
         b,
         MAX(
             CASE
                 WHEN REGEXP_REPLACE(
                          REGEXP_SUBSTR(
                              a,
                              '[^,]+',
                              1,
                              COLUMN_VALUE
                          ),
                          '[^' || b || ']'
                      ) = b
                 THEN
                     'good'
                 ELSE
                     'bad'
             END
         )
             c
    FROM (SELECT RTRIM(REGEXP_SUBSTR(REPLACE(a, ' ', ''), '([MWTRFS]+[0-9]+,?)+'), ',') a, b, c
            FROM yourtable) x,
         TABLE(
                 SELECT COLLECT(LEVEL)
                   FROM DUAL
             CONNECT BY LEVEL <= LENGTH(x.a) - LENGTH(REPLACE(x.a, ',')) + 1
         )
GROUP BY a, b

Open in new window

or, if the max number of csv values in A is 2 then try this...

SELECT a,
       b,
       GREATEST(
           CASE
               WHEN REGEXP_REPLACE(
                        REGEXP_SUBSTR(
                            a,
                            '[^,]+',
                            1,
                            2
                        ),
                        '[^' || b || ']'
                    ) = b
               THEN
                   'good'
               ELSE
                   'bad'
           END,
           CASE
               WHEN REGEXP_REPLACE(
                        REGEXP_SUBSTR(
                            a,
                            '[^,]+',
                            1,
                            1
                        ),
                        '[^' || b || ']'
                    ) = b
               THEN
                   'good'
               ELSE
                   'bad'
           END
       )
           c
  FROM (SELECT RTRIM(REGEXP_SUBSTR(REPLACE(a, ' ', ''), '([MWTRFS]+[0-9]+,?)+'), ',') a, b, c
          FROM yourtable)