Overview

Dataset statistics

Number of variables3
Number of observations23
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory707.0 B
Average record size in memory30.7 B

Variable types

Categorical1
Text1
Numeric1

Dataset

Description전라남도 보건환경연구원 홈페이지에 개시된 검사항목(대기성분시험 및 소음진동 측정)에 대한 수수료를 정리한 파일입니다.
Author전라남도
URLhttps://www.data.go.kr/data/15118191/fileData.do

Alerts

시험항목 has unique valuesUnique

Reproduction

Analysis started2023-12-12 03:35:15.726603
Analysis finished2023-12-12 03:35:16.043121
Duration0.32 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구 분
Categorical

Distinct5
Distinct (%)21.7%
Missing0
Missing (%)0.0%
Memory size316.0 B
가스성분시험
16 
입자상물질 성분시험
유류 시험
 
1
환경소음
 
1
진동측정
 
1

Length

Max length10
Median length6
Mean length6.4782609
Min length4

Unique

Unique3 ?
Unique (%)13.0%

Sample

1st row가스성분시험
2nd row가스성분시험
3rd row가스성분시험
4th row가스성분시험
5th row가스성분시험

Common Values

ValueCountFrequency (%)
가스성분시험 16
69.6%
입자상물질 성분시험 4
 
17.4%
유류 시험 1
 
4.3%
환경소음 1
 
4.3%
진동측정 1
 
4.3%

Length

2023-12-12T12:35:16.133549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:35:16.270948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가스성분시험 16
57.1%
입자상물질 4
 
14.3%
성분시험 4
 
14.3%
유류 1
 
3.6%
시험 1
 
3.6%
환경소음 1
 
3.6%
진동측정 1
 
3.6%

시험항목
Text

UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size316.0 B
2023-12-12T12:35:16.455674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length10
Mean length4.3043478
Min length2

Characters and Unicode

Total characters99
Distinct characters56
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)100.0%

Sample

1st row암모니아
2nd row일산화탄소
3rd row염화수소
4th row염 소
5th row황산화물
ValueCountFrequency (%)
암모니아 1
 
3.6%
일산화탄소 1
 
3.6%
기록측정(소음측정 1
 
3.6%
분석 1
 
3.6%
함유량 1
 
3.6%
1
 
3.6%
중의 1
 
3.6%
유류 1
 
3.6%
매연 1
 
3.6%
중금속 1
 
3.6%
Other values (18) 18
64.3%
2023-12-12T12:35:16.786318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
9.1%
6
 
6.1%
5
 
5.1%
4
 
4.0%
4
 
4.0%
4
 
4.0%
4
 
4.0%
3
 
3.0%
2
 
2.0%
2
 
2.0%
Other values (46) 56
56.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 90
90.9%
Space Separator 5
 
5.1%
Open Punctuation 2
 
2.0%
Close Punctuation 2
 
2.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
10.0%
6
 
6.7%
4
 
4.4%
4
 
4.4%
4
 
4.4%
4
 
4.4%
3
 
3.3%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (43) 50
55.6%
Space Separator
ValueCountFrequency (%)
5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 90
90.9%
Common 9
 
9.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
10.0%
6
 
6.7%
4
 
4.4%
4
 
4.4%
4
 
4.4%
4
 
4.4%
3
 
3.3%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (43) 50
55.6%
Common
ValueCountFrequency (%)
5
55.6%
( 2
 
22.2%
) 2
 
22.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 90
90.9%
ASCII 9
 
9.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9
 
10.0%
6
 
6.7%
4
 
4.4%
4
 
4.4%
4
 
4.4%
4
 
4.4%
3
 
3.3%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (43) 50
55.6%
ASCII
ValueCountFrequency (%)
5
55.6%
( 2
 
22.2%
) 2
 
22.2%

수수료
Real number (ℝ)

Distinct21
Distinct (%)91.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19269.565
Minimum3400
Maximum47300
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size339.0 B
2023-12-12T12:35:16.900892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3400
5-th percentile3610
Q116900
median18300
Q322700
95-th percentile32700
Maximum47300
Range43900
Interquartile range (IQR)5800

Descriptive statistics

Standard deviation10043.833
Coefficient of variation (CV)0.52122778
Kurtosis1.6885529
Mean19269.565
Median Absolute Deviation (MAD)4000
Skewness0.68554571
Sum443200
Variance1.0087858 × 108
MonotonicityNot monotonic
2023-12-12T12:35:16.996753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
17400 2
 
8.7%
3400 2
 
8.7%
6000 1
 
4.3%
33000 1
 
4.3%
5500 1
 
4.3%
20700 1
 
4.3%
22000 1
 
4.3%
12000 1
 
4.3%
29700 1
 
4.3%
25500 1
 
4.3%
Other values (11) 11
47.8%
ValueCountFrequency (%)
3400 2
8.7%
5500 1
4.3%
6000 1
4.3%
12000 1
4.3%
16700 1
4.3%
17100 1
4.3%
17200 1
4.3%
17400 2
8.7%
17700 1
4.3%
18300 1
4.3%
ValueCountFrequency (%)
47300 1
4.3%
33000 1
4.3%
30000 1
4.3%
29700 1
4.3%
25500 1
4.3%
23100 1
4.3%
22300 1
4.3%
22000 1
4.3%
20700 1
4.3%
19100 1
4.3%

Interactions

2023-12-12T12:35:15.830542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:35:17.068604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구 분시험항목수수료
구 분1.0001.0000.532
시험항목1.0001.0001.000
수수료0.5321.0001.000
2023-12-12T12:35:17.457104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수수료구 분
수수료1.0000.217
구 분0.2171.000

Missing values

2023-12-12T12:35:15.936034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:35:16.013497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구 분시험항목수수료
0가스성분시험암모니아17400
1가스성분시험일산화탄소6000
2가스성분시험염화수소17400
3가스성분시험염 소17200
4가스성분시험황산화물18300
5가스성분시험질소산화물18400
6가스성분시험이황화탄소17700
7가스성분시험포름알데히드23100
8가스성분시험황화수소16700
9가스성분시험불소17100
구 분시험항목수수료
13가스성분시험페놀19100
14가스성분시험비소25500
15가스성분시험수은29700
16입자상물질 성분시험먼지12000
17입자상물질 성분시험비산먼지22000
18입자상물질 성분시험중금속20700
19입자상물질 성분시험매연5500
20유류 시험유류 중의 황 함유량 분석33000
21환경소음기록측정(소음측정)3400
22진동측정기록측정(진동측정)3400