asp.net - 在非关系数据库中创建视图

Question

我有一个问题，我希望有人可以帮助我。事实上，我在一个设计不佳的数据库上工作，我无法控制更改其中的内容。我有一个“书籍”表，每本书可以有一个或多个作者。不幸的是，数据库不是完全相关的（请不要问我为什么，因为我从一开始就问同样的问题）。在“书籍”表中，有一个名为“Author_ID”和“Author_Name”的字段，因此当一本书由 2 或 3 个作者撰写时，他们的 ID 和他们的姓名将连接在由星号分隔的同一记录中。这是一个演示：

ID_BOOK | ID_AUTHOR |       NAME AUTHOR       |  Adress        |  Country        |
----------------------------------------------------------------------------------
001     |01         | AuthorU                 | AdrU           | CtryU           |
----------------------------------------------------------------------------------
002     |02*03*04   | AuthorX*AuthorY*AuthorZ | AdrX*NULL*AdrZ | NULL*NULL*CtryZ |
----------------------------------------------------------------------------------

我需要针对这个表创建一个视图，它会给我这个结果：

ID_BOOK | ID_AUTHOR |       NAME AUTHOR       | Adress         | Country         |
----------------------------------------------------------------------------------
001     |01         | AuthorU                 | AdrU           | CtryU           |
----------------------------------------------------------------------------------
002     |02         | AuthorX                 | AdrX           | NULL            |
----------------------------------------------------------------------------------
002     |03         | AuthorY                 | NULL           | NULL            |
----------------------------------------------------------------------------------
002     |04         | AuthorZ                 | AdrZ           | CtryZ           |
----------------------------------------------------------------------------------

我将继续尝试这样做，我希望有人可以帮助我至少提供一些提示。非常感谢你们。

在我应用你们给出的解决方案后，我遇到了这个问题。我正在努力解决它，希望你能帮助我。事实上，当 sql 查询运行时，CLOB 字段在其中一些包含 NULL 值时是杂乱无章的。结果应该像上面一样，但我得到了以下结果：

ID_BOOK | ID_AUTHOR |       NAME AUTHOR       | Adress         | Country         |
----------------------------------------------------------------------------------
001     |01         | AuthorU                 | AdrU           | CtryU           |
----------------------------------------------------------------------------------
002     |02         | AuthorX                 | AdrX           | CtryZ           |
----------------------------------------------------------------------------------
002     |03         | AuthorY                 | AdrZ           | NULL            |
----------------------------------------------------------------------------------
002     |04         | AuthorZ                 | NULL           | NULL            |
----------------------------------------------------------------------------------

为什么将 NULL 值放在最后？谢谢你。

score 2 · Accepted Answer

在 11g 中，您可以为此使用分解递归子查询：

with data (id_book, id_author, name, item_author, item_name, i)
 as (select id_book, id_author, name,
            regexp_substr(id_author, '[^\*]+', 1, 1) item_author, 
            regexp_substr(name, '[^\*]+', 1, 1) item_name,
            2 i 
       from books
     union all
     select id_book, id_author, name,
            regexp_substr(id_author, '[^\*]+', 1, i) item_author, 
            regexp_substr(name, '[^\*]+', 1, i) item_name, 
            i+1
       from data
      where regexp_substr(id_author, '[^\*]+', 1, i) is not null)
select id_book, item_author, item_name
  from data;

小提琴

score 1 · Accepted Answer

几周前，我在这里回答了一个类似的问题。该答案对一般方法有一个解释（我希望），所以我将在这里跳过解释。这个查询可以解决问题；它使用REGEXP_REPLACE并利用其“出现”参数来选择个人作者 ID 和姓名：

SELECT
 ID_Book,
 REGEXP_SUBSTR(ID_Author, '[^*]+', 1, Counter) AS AuthID,
 REGEXP_SUBSTR(Name_Author, '[^*]+', 1, Counter) AS AuthName
FROM Books
CROSS JOIN (
  SELECT LEVEL Counter
    FROM DUAL
    CONNECT BY LEVEL <= (      
      SELECT MAX(REGEXP_COUNT(ID_Author, '[^*]+'))
      FROM Books))
WHERE REGEXP_SUBSTR(Name_Author, '[^*]+', 1, Counter) IS NOT NULL
ORDER BY 1, 2

有一个 Fiddle 与您的数据加上另一行here。

附录：OP 有 Oracle 9，而不是 11，所以正则表达式不起作用。以下是在不使用正则表达式的情况下执行相同任务的说明...

如果没有REGEXP_COUNT，计算作者的最佳方法是计算星号并加一个。要计算星号，请取字符串的长度，然后当所有星号都被吸出时减去它的长度：LENGTH(ID_Author) - LENGTH(REPLACE(ID_Author, '*')).

如果没有REGEX_SUBSTR，则需要使用INSTR查找星号的位置，然后SUBSTR提取作者 ID 和姓名。这有点复杂 - 请考虑您原始帖子中的这些作者专栏：

Author U
Author X*Author Y*Author Z

AuthorX位于字符串开头和第一个星号之间。
AuthorY被星号包围
AuthorZ位于最后一个星号和字符串末尾之间。
AuthorU独自一人，没有被任何东西包围。

因此，开篇（WITH AuthorInfo AS...下图）在开头和结尾添加了一个星号，因此每个作者姓名（和 ID）都被星号包围。它还获取每一行的作者计数。对于您原始帖子中的示例数据，开篇将产生以下结果：

ID_Book  AuthCount  ID_Author   Name_Author
-------  ---------  ----------  -------------------------
001              1  *01*        *AuthorU*
002              3  *02*03*04*  *AuthorX*AuthorY*AuthorZ*

然后是与“计数器”表的连接和SUBSTR提取个人名称和 ID 的阴谋。最终查询如下所示：

WITH AuthorInfo AS (
  SELECT
    ID_Book,
    LENGTH(ID_Author) -
        LENGTH(REPLACE(ID_Author, '*')) + 1 AS AuthCount,
    '*' || ID_Author || '*' AS ID_Author,
    '*' || Name_Author || '*' AS Name_Author
  FROM Books
)
SELECT
  ID_Book,
  SUBSTR(ID_Author,
    INSTR(ID_Author, '*', 1, Counter) + 1,
    INSTR(ID_Author, '*', 1, Counter+1) - INSTR(ID_Author, '*', 1, Counter) - 1) AS AuthID,
  SUBSTR(Name_Author,
    INSTR(Name_Author, '*', 1, Counter) + 1,
    INSTR(Name_Author, '*', 1, Counter+1) - INSTR(Name_Author, '*', 1, Counter) - 1) AS AuthName
FROM AuthorInfo
CROSS JOIN (
  SELECT LEVEL Counter
    FROM DUAL
    CONNECT BY LEVEL <= (SELECT MAX(AuthCount) FROM AuthorInfo))
WHERE AuthCount >= Counter
ORDER BY ID_Book, Counter

小提琴在这里

score 0 · Accepted Answer

此外：

SELECT distinct id_book,
     , trim(regexp_substr(id_author, '[^*]+', 1, LEVEL)) id_author
     , trim(regexp_substr(author_name, '[^*]+', 1, LEVEL)) author_name
 FROM yourtable
CONNECT BY LEVEL <= regexp_count(id_author, '[^*]+')
ORDER BY id_book, id_author
/

ID_BOOK    ID_AUTHOR    AUTHOR_NAME
------------------------------------
001        01           AuthorU
002        02           AuthorX
002        03           AuthorY
002        04           AuthorZ
003        123          Jane Austen
003        456          David Foster Wallace
003        789          Richard Wright

没有正则表达式：

SELECT str, SUBSTR(str, substr_start_pos, substr_end_pos) final_str
  FROM
 (
  SELECT str, substr_start_pos
       , (CASE WHEN substr_end_pos <= 0 THEN (Instr(str, '*', 1)-1) 
            ELSE substr_end_pos END) substr_end_pos
    FROM
   (
   SELECT distinct '02*03*04' AS str
       , (Instr('02*03*04', '*', LEVEL)+1) substr_start_pos
       , (Instr('02*03*04', '*', LEVEL)-1) substr_end_pos           
    FROM dual
   CONNECT BY LEVEL <= length('02*03*04')
   )
  ORDER BY substr_start_pos
  )
 /

STR         FINAL_STR
---------------------
02*03*04    02
02*03*04    03
02*03*04    04

score 0 · Accepted Answer

如果你有一张authors桌子，你可以这样做：

select b.id_book, a.id_author, a.NameAuthor
from books b left outer join
     authors a
     on '*'||NameAuthor||'*' like '%*||a.author||'*%'

asp.net - 在非关系数据库中创建视图

4 回答 4

Related

Reference