我有一个包含下一列的表格:
First Name,
Last Name,
Age
让我们假设我们有
- 2 人年龄 = 25
- 6 人年龄 = 26
- 10 人年龄 = 27
我想获得记录集,每个年龄最多可以有 N 条记录。(记录可能是随机的)
你能建议吗?
例如如果 N = 3 那么我们将有
2 records with age = 25
3 records with age = 26
3 records with age = 27
我有一个包含下一列的表格:
First Name,
Last Name,
Age
让我们假设我们有
我想获得记录集,每个年龄最多可以有 N 条记录。(记录可能是随机的)
你能建议吗?
例如如果 N = 3 那么我们将有
2 records with age = 25
3 records with age = 26
3 records with age = 27
我会这样使用 ROW_NUMBER 函数:
DECLARE @TopN INT;
SET @TopN = 3;
SELECT ...
FROM
(
SELECT ...,
RowNum = ROW_NUMBER() OVER(PARTITION BY t.Age ORDER BY t.LastName, t.FirstName)
FROM MySchema.MyTable AS t
) src
WHERE src.RowNum <= @TopN
如果您安装了AdventureWorks 数据库(我使用 AdventureWorks2008),那么您可以使用此脚本进行测试:
-- Because Person.Person table doesn't has an `Age` column
-- I create a new table (dbo.Person) having following columns:
-- BusinessEntityID, LastName, FirstName and Age columns
SELECT p.BusinessEntityID, p.LastName, p.FirstName,
1 + ABS(CHECKSUM(NEWID())) % 100 AS Age
INTO dbo.Persons
FROM Person.Person p;
GO
/*
ALTER TABLE dbo.Persons
ADD CONSTRAINT PK_Persons_BusinessEntityID
PRIMARY KEY (BusinessEntityID)
*/
DECLARE @TopN INT;
SET @TopN = 3;
SELECT src.BusinessEntityID, src.LastName, src.FirstName, src.Age, src.RowNum
FROM
(
SELECT p.BusinessEntityID, p.LastName, p.FirstName, p.Age,
RowNum = ROW_NUMBER() OVER(PARTITION BY p.Age ORDER BY p.LastName, p.FirstName)
FROM dbo.Persons AS p
) src
WHERE src.RowNum <= @TopN
ORDER BY src.Age, src.LastName, src.FirstName;
-- DROP TABLE dbo.Persons
结果:
BusinessEntityID LastName FirstName Age RowNum
---------------- --------- ---------- --- ------
...
10905 Allen Kaitlyn 30 1
15052 Alonso Gina 30 2
5505 Alonso Jessie 30 3
20216 Alexander Alyssa 31 1
3789 Allen Wyatt 31 2
2798 Alonso Alfredo 31 3
16850 Adams Gabriel 32 1
4747 Adams Ian 32 2
7761 Alexander Jacqueline 32 3
...
您可以使用该ROW_NUMBER()
函数来模拟此行为:
SELECT t.*
FROM (SELECT t.*, ROW_NUMBER() OVER (PARTITIN BY age ORDER BY 1) as rk
FROM some_table
) t
WHERE rk <= 3;