data:image/s3,"s3://crabby-images/71fa1/71fa1ab29bb0ced438a5e8a6fc20c5215db4cac7" alt="image"
Retrieval Queries
- Basic form of the Retrieval Queries:
data:image/s3,"s3://crabby-images/60cf4/60cf4a85d26d3d7a18fbeb382321f8266f849097" alt="image"
-
is a list of attribute names whose values are to be retrieved by the query
-
is a list of the relation names to process query
-
is a conditional (Boolean) expression that identifies the tuples to be retrieved by the query
Simple Query : Single Table
- Retrieve the birthdate and address of employees whose sex is ‘male’ and whose salary is greater than 30,000.
data:image/s3,"s3://crabby-images/4b32a/4b32a71e93a807232e4bb3966e94015361570ba3" alt="image"
- SELECT-clause specifies PROJECT($\pi$);
- WHERE-clause specifies SELECT ($\sigma$)
- (참고: SQL의 SELECT는 relational algebra의 $\sigma$와 다름; $\pi$ 를 의미)
- This SQL expression is equivalent to:
- $\pi$ bdate, address ($\sigma$ (salary > 30000) AND (sex = ‘male’) EMP)
Optional WHERE / Use of *
- ‘WHERE’ is optional; That means “no condition” is applied; Thus, all tuples of the table are retrieved.
data:image/s3,"s3://crabby-images/19789/19789c3dcf45883c7456b164835ef0e5b293e7cc" alt="image"
- To retrieve all the attribute values of the selected tuples, the symbol * is used, which means all the attributes
data:image/s3,"s3://crabby-images/c74be/c74be18829df55462bf1fed0aaf159117a7eaff1" alt="image"
DISTINCT
- SQL does not remove duplicate tuples in query result: Why?
- Duplicates removal is expensive; (Sort and elimination cost!)
- Users want to see duplicate tuples in the result of query.
- When aggregate function (ie., sum, avg) is applied, we need to keep duplicated tuples.
- To remove duplicates in a query result, DISTINCT is used.
- Q1 may have duplicate values, but Q2 does not have any duplicates.
data:image/s3,"s3://crabby-images/9c359/9c359d0246de74b2fa3cc4d0da1818dbfe06844f" alt="image"
- What do think about the following query Q3?
data:image/s3,"s3://crabby-images/4758d/4758dff450bec00ca9cb60f1ea1919f890793870" alt="image"
Multiple Table Query
- Retrieve SSN and age of employees who work for ‘research’ dept.
data:image/s3,"s3://crabby-images/be1d2/be1d2487120ac098ba603e051f6dfb6be270e6f6" alt="image"
- (DNO = DNUMBER) condition specifies the JOIN;
- This SQL expression is equivalent to :
data:image/s3,"s3://crabby-images/865c4/865c476b36d2f43e7c55417d59db8350388f7afb" alt="image"
- For project name ‘product X’, list the project number, controlling department number, and the department manager’s salary
data:image/s3,"s3://crabby-images/9aa51/9aa51d96db8f6e9019f1e5e427a0675f8dc4e045" alt="image"
- Notice that there are two join conditions
- (DNUM = DNUMBER) relates a project to its controlling department
- (Mgr-SSN = SSN) relates the controlling department to the employee who manages that department
Rename Operation
- We can rename for both relation and attributes, taking the form;
- The AS clause can appear in both the SELECT and FROM clause.
data:image/s3,"s3://crabby-images/42a58/42a58bf56c3d4ace3957591fea8fe3fe1d3d28fb" alt="image"
Use of Tuple Variables
- We can use tuple variables to refer any tables in SQL query;
- Retrieve name and dept. phone of employees who work for ‘research’ dept.
data:image/s3,"s3://crabby-images/4d51e/4d51eab57eb008dd5d69fd13bbd26ab249282751" alt="image"
- The alternative table names e and d are called “tuple variables”
- This SQL expression is equivalent to :
data:image/s3,"s3://crabby-images/9ef23/9ef23116f7063dca0268bb2d86d6a6206f5158d9" alt="image"
- Tuple variables are very useful for comparing two tuples in the same relation.
- In this case, two tuple variables are given to the same relation.
- For each employee, retrieve the employee’s SSN, and the name of his (or her) direct supervisor.
data:image/s3,"s3://crabby-images/2bb30/2bb30dad77baeb8a84ce8070dca4b82677170486" alt="image"
- The alternative table names e and s are called “tuple variables”
- We can think of e and s as two different copies of EMPLOYEE;
- e represents employees in role of ’workers’
- s represents employees in role of ‘bosses
- Retrieve SSN and names of all employees whose salary greater than Smith’s salary.
data:image/s3,"s3://crabby-images/de0bc/de0bc85a23dcebf1b342f8ef6048c2ae9fd51b6a" alt="image"
- We can also think of e and b as two different copies of EMPLOYEE;
- e represents in the role of employees
- s represents in the role of employee(s) with name = ‘smith’
- Retrieve the names of two employees (i.e., married) who share the same address.
data:image/s3,"s3://crabby-images/f653d/f653d8c4e5adc20760b74befdaefa7594f709aa4" alt="image"
- The second condition (e1.name < e2.name) means that the name of first employee precedes the name of second employee alphabetically.
- If this condition is omitted, tuple variables e1 and e2 both could refer to the same tuple. Thus, the query result produces each employee name paired with itself.
- Question: If use “<>” instead of “< “,’ what will happen?
Substring Pattern Matching
- SQL provides LIKE comparison operator for pattern matching. We describe patterns by using two special characters.
- Percent(%): The % matches any “substring”.
- Underscore(_): The _ matches any “character”.
- Patterns are case sensitive; that is, upper case characters do not match lower case characters, or vice versa; Some examples are;
- ‘Blue%’ matches any string beginning with ‘Blue’.
- ‘%idge% matches any string containing “idge” as substring.
- Examples: Blueridge, Ridgeriver, ‘Rockbridge, Ridgeway, . .
- ‘_ _ _’ matches any string of exactly 3 characters.
- ‘_ _ _%’ matches any string of at least 3 characters.
- Retrieve all employees whose address is in Houston, Texas
data:image/s3,"s3://crabby-images/73c04/73c049d316dacdb693ff7bbe5392f37553cc10e8" alt="image"
- Retrieve all employees who were born during the 1970s.
data:image/s3,"s3://crabby-images/1bd77/1bd775f2f0504e58272443944c0f9562081047cd" alt="image"
- Note that NOT LIKE also can be used in a similar way
Arithmetic Operations
- We can use arithmetic operations in query; The standard operators +, -, *, and / can be applied to attributes with numeric data types;
- Show resulting salaries for every employee working for the research department is given 10% raise.
data:image/s3,"s3://crabby-images/def0e/def0e4da81da2eb52df35819cdba79fec15893fc" alt="image"
- SQL provides BETWEEN operator to simplify comparison operators.
- For example, retrieve all employees whose salary is between 30,000 and 40,000. (that is, salary >= AND salary <= 40000)
data:image/s3,"s3://crabby-images/0bfc2/0bfc2b4d1cb24070f4a3ac6fd656a70f5ae3622a" alt="image"
Set Operators
- UNION, INTERSECT, EXCEPT
- Each of the above operations automatically removes duplicated tuples;
- To retain all duplicated tuples, use the corresponding operations UNION ALL, INTERSECT ALL, EXCEPT ALL.
- Suppose a tuple t occurs m times in R and n times in S;
- R UNION ALL S : (m + n) times occurs.
- R INTERSECT ALL S : Min(m, n) times occurs.
- R EXCEPT ALL S : Max(0, m – n) times occurs
- Retrieve SSN and name of employees with age > 40 or sex = ‘male’.
data:image/s3,"s3://crabby-images/80c5a/80c5a68d1d72631f573a1db66535507518ddc9c3" alt="image"
data:image/s3,"s3://crabby-images/3d131/3d13141eb0a71fec5933d858a877e8175fcaeabc" alt="image"
- Retrieve SSN of employees with any no dependents.
data:image/s3,"s3://crabby-images/513e9/513e9bd338c63fc1e46448d78bf96e668492fb86" alt="image"
- Retrieve all project numbers for projects that involve an employee whose name is ‘Bob’ as a worker, and as a manager of the department that controls the project.
data:image/s3,"s3://crabby-images/ae3c9/ae3c98e330a904fbf914de15ec3f7bc8d390fde7" alt="image"
NULL Values
- NULL represents missing values; It has 3 different meanings :
- (1) Unknown (exists, but unknown):
- Someone has ‘blood-type’, but currently is not known.
- (2) Unavailable (exists, but is withheld):
- Someone has ‘phone-number’ but does not want it to be listed.
- (3) No applicable (undefined or non-existent):
- ‘spouse’ may be NULL for unmarried someone.
- It is not possible to distinguish these 3 meanings: For example, Suppose many persons have “NULL value for phone-number”; How can we interpret this?
- Let X have the value NULL;
- (1) Any arithmetic operation with NULL returns “NULL”.
- X + 5, X – 5, X * 5, . . .
- (2) Any comparison with NULL returns “UNKOWN” :
- X < 5, X = 5, X = NULL, X <> 5, . . .
- Note that “UNKOWN” is another truth value like TRUE and FALSE.
- SQL provides IS NULL (or IS NOT NULL) for dealing NULL values;
- “X IS NULL” returns “TRUE” if X has NULL value; It returns “FALSE” otherwise.
- SQL uses a three-valued logic: TRUE, FALSE, UNKNOWN
data:image/s3,"s3://crabby-images/e633c/e633c193e97194bbd2cf7039456daf44e8149584" alt="image"
- To understand how 3-valued logic works:
- Let T = 1, F = 0, U = ½,
- AND = MIN, OR = MAX, NOT(X) = 1 - X
- Example:
- T AND (F OR NOT(U)) = MIN(1, MAX(0, (1 - ½ )))
- = MIN(1, MAX(0, ½ ) = MIN(1, ½ ) = ½ (= U).
- Result of WHERE condition is treated as FALSE if it evaluates to UNKNOWN.
- Tuples that are evaluated to only TRUE are selected.
- Thus, tuples with either FALSE or UNKNOWN values are not selected from the result.
data:image/s3,"s3://crabby-images/023bf/023bff9be7242ce25052a350fe72ff63edfc8c02" alt="image"
- Is this tuple selected? Answer: NO!
data:image/s3,"s3://crabby-images/5a10f/5a10f0fe8ac6c08135d672380cd51e7d17048508" alt="image"
- In this case, we need to use IS NULL to check whether the salary attribute has NULL or not.
data:image/s3,"s3://crabby-images/b7a58/b7a58445c412d896cf50700fe3d9287ae099ef26" alt="image"
- Now, all tuples are selected!
Use of JOIN Operators
- SQL permits users to specify join operators in FROM clause; This is easier to understand than mixing together select and join conditions in WHERE;
data:image/s3,"s3://crabby-images/a33cf/a33cf112abbd8e92397eb6f591dd8074e79ba037" alt="image"
Nested Queries (= Subqueries)
- A query that is part of another query is called a subquery (or nested query);
- Subquery can have many nested subquries as we want;
- There are many other ways that subqury can be used;
- (1) Subquery can return constant value(s), and this value can be compared with another value in WHERE clause;
- (2) Subquery can return relation (a set of tuples) that can be used in various ways in WHERE clause;
- (3) Subquery can have their relations appear in FROM clause;
- A nested query can be specified within the WHERE-clause of another query, called the outer query
- Single query :
data:image/s3,"s3://crabby-images/ec18e/ec18e31d2f95ed31e1d5aa8634e3dab9f89e1826" alt="image"
- Nested Query : Having executed nested query returns ‘research’ number; Then, outer query returns SSN(s) only if its DNO is equal to result of nested query.
data:image/s3,"s3://crabby-images/010b0/010b068aaf88f5a47da2097696fa832554f94271" alt="image"
- Question: What if there are more than one research dept numbers?
Nested Queries : IN
- A nested query can be compared with outer query by using IN operator;
- (v IN V ) compares a value v (in outer query) with the value(s) V (in nested query), and returns “true” if v is included in V. Otherwise, return “false”;
data:image/s3,"s3://crabby-images/37f2e/37f2ec28408edf12b389e2cbfe0971c79f4fc5a1" alt="image"
- SQL allows use of (sub)tuples of values in comparisons by placing them within parenthesis;
- Get SSNs of employees who work for the same (project, hours) on some project that employee with SSN = ‘123456789’ works on;
data:image/s3,"s3://crabby-images/912df/912df81477432af65db90613f81a8ddead9f2787" alt="image"
- Get names of employees who work for the same (project, hours) on some project that employee with SSN = ‘123456789’ works on;
data:image/s3,"s3://crabby-images/9e425/9e42504ed7a3a8dbb6f32fcfce0cc3296c435810" alt="image"
Nested Queries : NOT IN
- (v NOT IN V ) compares a value v (in outer query) with the value(s) V (in nested query), and returns “true” if v is not included in V. Otherwise, return “false”;
- Retrieve names of employees with no dependents.
data:image/s3,"s3://crabby-images/95832/95832ee934462fa7093db12139e905b5742e7acc" alt="image"
- Retrieve names of employees whose names are neither Smith nor Jones.
data:image/s3,"s3://crabby-images/f34d3/f34d34ffbbc2042372aecff3c5474515a38c13e1" alt="image"
Nested Queries : Use of Tuple Variables
- Nested Queries also can use tuple variables;
- Retrieve names of employees who have a dependent with the same sex as the employees.
data:image/s3,"s3://crabby-images/4b356/4b356a9948e5bb71e08d046da63bead078c8e43b" alt="image"
Nested Queries : SOME (= ANY)
- SOME (= ANY) can be used with {<, >, =, <=, >=, <>} operators:
- For example, (v > SOME V ) returns “true” if the value v is greater than some of the values in a set V.
- Get SSNs of employees whose salary is greater than salary of some employees in department 5:
data:image/s3,"s3://crabby-images/eed6d/eed6dddeb54eca5845284d310d5ac086cf22655a" alt="image"
Example : Meaning of SOME
- Example
- (5 < SOME {0, 5, 7} ) = true ) = false
- (5 <> SOME {0, 5, 7}) = true
- (5 < SOME {1, 3, 5}
- (5 = SOME {0, 5, 7} ) = true
- 참조: (= SOME)은 IN 과 동일한 표현임. 반면에, (<> SOME)은 NOT IN 과 동일하지 않음.
Nested Queries : ALL
- ALL can be used with {<, >, =, <=, >=, <>} operators:
- For example, (v > ALL V ) returns “true” if the value v is greater than all the values in a set V
- Get SSNs of employees whose salary is greater than salary of all the employees in department 5:
data:image/s3,"s3://crabby-images/a2a2f/a2a2f6efdb74039c03c2a11c0ea054223134d2e5" alt="image"
Example : Meaning of ALL
- Example
- (5 < ALL {0, 5, 6} ) = false) = true
- (5 <> ALL {4, 6, 8} ) = true
- (5 > ALL {0, 3, 4}
- (5 = ALL {4, 5, 7} ) = false
- 참조: (<> ALL)은 NOT IN 과 동일한 표현임. 반면에, (= ALL)은 IN 과 동일하지 않음.
Nested Queries : EXISTS
- EXISTS(Q) returns “true” if there is at least one tuple (= not empty) in the result of a nested query Q. Otherwise, it returns “false”
- Retrieve SSNs of employees who have dependent(s).
data:image/s3,"s3://crabby-images/9c907/9c907a3086c826e2d1e7dbadf8b447052a8106bd" alt="image"
- Retrieve names of employees who have a dependent with the same sex as the employees
data:image/s3,"s3://crabby-images/e1839/e18398407417086a7abfa6a6dca2cf447c2b5e64" alt="image"
Nested Queries : NOT EXISTS
- NOT EXISTS(Q) returns “true” if there no tuple (= empty) in the result of a nested query Q. Otherwise, it returns “false”.
- Retrieve SSNs of employees who have no dependent.
data:image/s3,"s3://crabby-images/6ba47/6ba4759bcef82295bb6dfb07a92257f02762132d" alt="image"
- Retrieve names of employees whose sex are not the same with any dependent or with no dependent.
data:image/s3,"s3://crabby-images/04f2a/04f2a8a4be8944512bf911c85a5585f2cb222e42" alt="image"
- Retrieve names of employees whose salary are greater than salary of all the employees with age < 25.
data:image/s3,"s3://crabby-images/b0872/b0872b29d072dd6b78be5381f6dc20cba1430f43" alt="image"
- Question: Suppose that age of all employees are >= 25; Then, what is the query result?
Division Query : NOT EXISTS and EXCEPT
- Retrieve name of each employee who work on all the projects controlled by department 5.
data:image/s3,"s3://crabby-images/5820b/5820bb1a12845b1981cb03e48cf7b2ed99434144" alt="image"
Aggregate Functions
댓글남기기