Overview

Dataset statistics

Number of variables6
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory55.4 B

Variable types

Numeric3
Text2
DateTime1

Dataset

Description전라북도 고창군에 존재하는 석면조사대상 건축물 현황입니다. 건물명, 소재지주소, 석면 면적, 조사일시에 대한 정보를 제공합니다
URLhttps://www.data.go.kr/data/3074513/fileData.do

Alerts

연면적(제곱미터) is highly overall correlated with 석면(자재)면적(제곱미터)High correlation
석면(자재)면적(제곱미터) is highly overall correlated with 연면적(제곱미터)High correlation
연번 has unique valuesUnique
건물명 has unique valuesUnique
석면(자재)면적(제곱미터) has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:48:29.812997
Analysis finished2023-12-12 12:48:31.296340
Duration1.48 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.5
Minimum1
Maximum30
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-12T21:48:31.372485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.45
Q18.25
median15.5
Q322.75
95-th percentile28.55
Maximum30
Range29
Interquartile range (IQR)14.5

Descriptive statistics

Standard deviation8.8034084
Coefficient of variation (CV)0.56796183
Kurtosis-1.2
Mean15.5
Median Absolute Deviation (MAD)7.5
Skewness0
Sum465
Variance77.5
MonotonicityStrictly increasing
2023-12-12T21:48:31.559334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1 1
 
3.3%
17 1
 
3.3%
30 1
 
3.3%
29 1
 
3.3%
28 1
 
3.3%
27 1
 
3.3%
26 1
 
3.3%
25 1
 
3.3%
24 1
 
3.3%
23 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
1 1
3.3%
2 1
3.3%
3 1
3.3%
4 1
3.3%
5 1
3.3%
6 1
3.3%
7 1
3.3%
8 1
3.3%
9 1
3.3%
10 1
3.3%
ValueCountFrequency (%)
30 1
3.3%
29 1
3.3%
28 1
3.3%
27 1
3.3%
26 1
3.3%
25 1
3.3%
24 1
3.3%
23 1
3.3%
22 1
3.3%
21 1
3.3%

건물명
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-12T21:48:31.800781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length12.5
Mean length10.1
Min length4

Characters and Unicode

Total characters303
Distinct characters107
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row강호 우성쇼핑
2nd row고창부안축협 축사(주7동)
3rd row고창부안축협 축사(주6동)
4th row농어촌폐기물종합처리장 관리동
5th row문수산 터널관리동
ValueCountFrequency (%)
창고 4
 
7.3%
선운산농협 4
 
7.3%
고창부안축협 3
 
5.5%
고창처리장 3
 
5.5%
고창군청 2
 
3.6%
흥덕농협 2
 
3.6%
본관 1
 
1.8%
갑평길 1
 
1.8%
창고1 1
 
1.8%
공음면 1
 
1.8%
Other values (33) 33
60.0%
2023-12-12T21:48:32.276744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25
 
8.3%
18
 
5.9%
18
 
5.9%
11
 
3.6%
9
 
3.0%
9
 
3.0%
8
 
2.6%
8
 
2.6%
8
 
2.6%
( 6
 
2.0%
Other values (97) 183
60.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 259
85.5%
Space Separator 25
 
8.3%
Decimal Number 7
 
2.3%
Open Punctuation 6
 
2.0%
Close Punctuation 6
 
2.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
6.9%
18
 
6.9%
11
 
4.2%
9
 
3.5%
9
 
3.5%
8
 
3.1%
8
 
3.1%
8
 
3.1%
5
 
1.9%
5
 
1.9%
Other values (88) 160
61.8%
Decimal Number
ValueCountFrequency (%)
7 2
28.6%
2 1
14.3%
1 1
14.3%
6 1
14.3%
3 1
14.3%
5 1
14.3%
Space Separator
ValueCountFrequency (%)
25
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 259
85.5%
Common 44
 
14.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
6.9%
18
 
6.9%
11
 
4.2%
9
 
3.5%
9
 
3.5%
8
 
3.1%
8
 
3.1%
8
 
3.1%
5
 
1.9%
5
 
1.9%
Other values (88) 160
61.8%
Common
ValueCountFrequency (%)
25
56.8%
( 6
 
13.6%
) 6
 
13.6%
7 2
 
4.5%
2 1
 
2.3%
1 1
 
2.3%
6 1
 
2.3%
3 1
 
2.3%
5 1
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 259
85.5%
ASCII 44
 
14.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
25
56.8%
( 6
 
13.6%
) 6
 
13.6%
7 2
 
4.5%
2 1
 
2.3%
1 1
 
2.3%
6 1
 
2.3%
3 1
 
2.3%
5 1
 
2.3%
Hangul
ValueCountFrequency (%)
18
 
6.9%
18
 
6.9%
11
 
4.2%
9
 
3.5%
9
 
3.5%
8
 
3.1%
8
 
3.1%
8
 
3.1%
5
 
1.9%
5
 
1.9%
Other values (88) 160
61.8%
Distinct27
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-12T21:48:32.539720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length23
Mean length21.4
Min length19

Characters and Unicode

Total characters642
Distinct characters80
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)83.3%

Sample

1st row전라북도 고창군 고창읍 중앙로 232
2nd row전라북도 고창군 흥덕면 부안로 423
3rd row전라북도 고창군 흥덕면 부안로 423
4th row전라북도 고창군 아산면 인천강변로 201-95
5th row전라북도 고창군 고수면 가협길 26-66
ValueCountFrequency (%)
전라북도 30
20.0%
고창군 30
20.0%
고창읍 10
 
6.7%
흥덕면 4
 
2.7%
공음면 3
 
2.0%
중거리당산로 3
 
2.0%
아산면 3
 
2.0%
중앙로 3
 
2.0%
부안로 3
 
2.0%
459-20 3
 
2.0%
Other values (50) 58
38.7%
2023-12-12T21:48:32.969329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
120
18.7%
42
 
6.5%
40
 
6.2%
31
 
4.8%
30
 
4.7%
30
 
4.7%
30
 
4.7%
30
 
4.7%
23
 
3.6%
20
 
3.1%
Other values (70) 246
38.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 416
64.8%
Space Separator 120
 
18.7%
Decimal Number 96
 
15.0%
Dash Punctuation 10
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
10.1%
40
 
9.6%
31
 
7.5%
30
 
7.2%
30
 
7.2%
30
 
7.2%
30
 
7.2%
23
 
5.5%
20
 
4.8%
10
 
2.4%
Other values (58) 130
31.2%
Decimal Number
ValueCountFrequency (%)
1 15
15.6%
2 14
14.6%
4 12
12.5%
5 12
12.5%
9 9
9.4%
6 9
9.4%
3 7
7.3%
0 7
7.3%
7 6
 
6.2%
8 5
 
5.2%
Space Separator
ValueCountFrequency (%)
120
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 416
64.8%
Common 226
35.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
10.1%
40
 
9.6%
31
 
7.5%
30
 
7.2%
30
 
7.2%
30
 
7.2%
30
 
7.2%
23
 
5.5%
20
 
4.8%
10
 
2.4%
Other values (58) 130
31.2%
Common
ValueCountFrequency (%)
120
53.1%
1 15
 
6.6%
2 14
 
6.2%
4 12
 
5.3%
5 12
 
5.3%
- 10
 
4.4%
9 9
 
4.0%
6 9
 
4.0%
3 7
 
3.1%
0 7
 
3.1%
Other values (2) 11
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 416
64.8%
ASCII 226
35.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
120
53.1%
1 15
 
6.6%
2 14
 
6.2%
4 12
 
5.3%
5 12
 
5.3%
- 10
 
4.4%
9 9
 
4.0%
6 9
 
4.0%
3 7
 
3.1%
0 7
 
3.1%
Other values (2) 11
 
4.9%
Hangul
ValueCountFrequency (%)
42
 
10.1%
40
 
9.6%
31
 
7.5%
30
 
7.2%
30
 
7.2%
30
 
7.2%
30
 
7.2%
23
 
5.5%
20
 
4.8%
10
 
2.4%
Other values (58) 130
31.2%

연면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION 

Distinct29
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1962.2467
Minimum507.8
Maximum7660.34
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-12T21:48:33.145533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum507.8
5-th percentile610.5
Q1856.0075
median1363.27
Q32257.75
95-th percentile6512.34
Maximum7660.34
Range7152.54
Interquartile range (IQR)1401.7425

Descriptive statistics

Standard deviation1851.4785
Coefficient of variation (CV)0.94355032
Kurtosis4.7385014
Mean1962.2467
Median Absolute Deviation (MAD)598.075
Skewness2.2312057
Sum58867.4
Variance3427972.5
MonotonicityNot monotonic
2023-12-12T21:48:33.297489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
660.0 2
 
6.7%
5122.71 1
 
3.3%
835.0 1
 
3.3%
7649.31 1
 
3.3%
776.48 1
 
3.3%
2348.22 1
 
3.3%
3552.0 1
 
3.3%
985.56 1
 
3.3%
1682.87 1
 
3.3%
1733.43 1
 
3.3%
Other values (19) 19
63.3%
ValueCountFrequency (%)
507.8 1
3.3%
570.0 1
3.3%
660.0 2
6.7%
691.31 1
3.3%
753.91 1
3.3%
776.48 1
3.3%
835.0 1
3.3%
919.03 1
3.3%
924.65 1
3.3%
934.93 1
3.3%
ValueCountFrequency (%)
7660.34 1
3.3%
7649.31 1
3.3%
5122.71 1
3.3%
3552.0 1
3.3%
3213.99 1
3.3%
2348.22 1
3.3%
2344.94 1
3.3%
2292.4 1
3.3%
2153.8 1
3.3%
1753.2 1
3.3%

석면(자재)면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean672.45967
Minimum68.51
Maximum2074.96
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-12T21:48:33.449863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum68.51
5-th percentile116.28
Q1306.275
median574.44
Q3914.3625
95-th percentile1313.124
Maximum2074.96
Range2006.45
Interquartile range (IQR)608.0875

Descriptive statistics

Standard deviation460.23923
Coefficient of variation (CV)0.68441165
Kurtosis1.3468728
Mean672.45967
Median Absolute Deviation (MAD)300.54
Skewness1.0144766
Sum20173.79
Variance211820.15
MonotonicityNot monotonic
2023-12-12T21:48:33.593327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1094.19 1
 
3.3%
422.24 1
 
3.3%
1110.19 1
 
3.3%
707.21 1
 
3.3%
2074.96 1
 
3.3%
295.93 1
 
3.3%
824.94 1
 
3.3%
1277.33 1
 
3.3%
881.58 1
 
3.3%
454.95 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
68.51 1
3.3%
81.0 1
3.3%
159.4 1
3.3%
208.04 1
3.3%
210.01 1
3.3%
248.6 1
3.3%
280.5 1
3.3%
295.93 1
3.3%
337.31 1
3.3%
372.27 1
3.3%
ValueCountFrequency (%)
2074.96 1
3.3%
1342.41 1
3.3%
1277.33 1
3.3%
1276.06 1
3.3%
1110.19 1
3.3%
1094.19 1
3.3%
1033.06 1
3.3%
925.29 1
3.3%
881.58 1
3.3%
829.5 1
3.3%
Distinct20
Distinct (%)66.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2013-04-05 00:00:00
Maximum2019-08-05 00:00:00
2023-12-12T21:48:33.721913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:48:33.861018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)

Interactions

2023-12-12T21:48:30.761889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:48:30.149842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:48:30.444775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:48:30.859030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:48:30.240769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:48:30.543938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:48:30.961373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:48:30.339235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:48:30.644924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:48:33.954055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번건물명소재지연면적(제곱미터)석면(자재)면적(제곱미터)조사일시
연번1.0001.0000.9620.0000.2990.827
건물명1.0001.0001.0001.0001.0001.000
소재지0.9621.0001.0000.9820.9571.000
연면적(제곱미터)0.0001.0000.9821.0000.5470.963
석면(자재)면적(제곱미터)0.2991.0000.9570.5471.0000.870
조사일시0.8271.0001.0000.9630.8701.000
2023-12-12T21:48:34.061956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번연면적(제곱미터)석면(자재)면적(제곱미터)
연번1.0000.1020.153
연면적(제곱미터)0.1021.0000.553
석면(자재)면적(제곱미터)0.1530.5531.000

Missing values

2023-12-12T21:48:31.128352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:48:31.246530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번건물명소재지연면적(제곱미터)석면(자재)면적(제곱미터)조사일시
01강호 우성쇼핑전라북도 고창군 고창읍 중앙로 2325122.711094.192015-06-30
12고창부안축협 축사(주7동)전라북도 고창군 흥덕면 부안로 4231197.01342.412014-03-25
23고창부안축협 축사(주6동)전라북도 고창군 흥덕면 부안로 4231753.2661.972014-03-25
34농어촌폐기물종합처리장 관리동전라북도 고창군 아산면 인천강변로 201-951671.72210.012014-04-02
45문수산 터널관리동전라북도 고창군 고수면 가협길 26-66919.03925.292013-09-11
56한국전력공사 고창지사전라북도 고창군 고창읍 중거리당산로 462153.81033.062013-07-02
67메디케어요양병원전라북도 고창군 성내면 이재로 81-92344.94829.52014-01-23
78고창행복원 전관전라북도 고창군 고창읍 모양성로 116-13924.65372.272014-07-22
89흥덕농협 본점전라북도 고창군 흥덕면 흥덕초등길 3753.91337.312014-04-26
910흥덕농협 성내지점전라북도 고창군 성내면 시기1길 57934.93248.62014-04-24
연번건물명소재지연면적(제곱미터)석면(자재)면적(제곱미터)조사일시
2021농업기술센터전라북도 고창군 고창읍 중거리당산로 943213.991276.062014-03-27
2122고창군청전라북도 고창군 고창읍 중앙로 2457660.34800.582014-04-08
2223동학홍보관전라북도 고창군 공음면 왕제산로 5021047.51454.952014-04-02
2324체류형농업창업지원센터(구 복분자연구소)전라북도 고창군 부안면 복분자로 5681733.43881.582014-03-14
2425고창군청 제2청사전라북도 고창군 고창읍 중거리당산로 74-121682.871277.332014-03-24
2526추모의집전라북도 고창군 부안면 복분자로 559985.56824.942014-03-14
2627고인돌휴게소 하행전라북도 고창군 신림면 서해안고속도로 813552.0295.932014-03-28
2728고창군산림조합 연수원전라북도 고창군 흥덕면 부안로 446-212348.222074.962014-04-11
2829고창효자노인병원전라북도 고창군 무장면 감덕길 19776.48707.212014-03-31
2930수산기술연구소전라북도 고창군 해리면 명사십리로 8177649.311110.192013-04-05