Overview

Dataset statistics

Number of variables6
Number of observations37
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory54.6 B

Variable types

Numeric3
Text3

Dataset

Description충청남도 서산시 기계설비서능 점검 대상 건축물 현황 데이터입니다. 항목명은 연번, 단지명, 도로명주소, 대표번호, 세대수, 연면적 등으로 이루어져 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=37&beforeMenuCd=DOM_000000201001001000&publicdatapk=15117224

Alerts

세대수 is highly overall correlated with 연면적(제곱미터)High correlation
연면적(제곱미터) is highly overall correlated with 세대수High correlation
연번 has unique valuesUnique
단지명 has unique valuesUnique
대표번호 has unique valuesUnique
연면적(제곱미터) has unique valuesUnique

Reproduction

Analysis started2024-01-09 22:51:53.761629
Analysis finished2024-01-09 22:51:54.856937
Duration1.1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19
Minimum1
Maximum37
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size465.0 B
2024-01-10T07:51:54.910871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.8
Q110
median19
Q328
95-th percentile35.2
Maximum37
Range36
Interquartile range (IQR)18

Descriptive statistics

Standard deviation10.824355
Coefficient of variation (CV)0.56970291
Kurtosis-1.2
Mean19
Median Absolute Deviation (MAD)9
Skewness0
Sum703
Variance117.16667
MonotonicityStrictly increasing
2024-01-10T07:51:55.019234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
1 1
 
2.7%
29 1
 
2.7%
22 1
 
2.7%
23 1
 
2.7%
24 1
 
2.7%
25 1
 
2.7%
26 1
 
2.7%
27 1
 
2.7%
28 1
 
2.7%
30 1
 
2.7%
Other values (27) 27
73.0%
ValueCountFrequency (%)
1 1
2.7%
2 1
2.7%
3 1
2.7%
4 1
2.7%
5 1
2.7%
6 1
2.7%
7 1
2.7%
8 1
2.7%
9 1
2.7%
10 1
2.7%
ValueCountFrequency (%)
37 1
2.7%
36 1
2.7%
35 1
2.7%
34 1
2.7%
33 1
2.7%
32 1
2.7%
31 1
2.7%
30 1
2.7%
29 1
2.7%
28 1
2.7%

단지명
Text

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size428.0 B
2024-01-10T07:51:55.213162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length8.1621622
Min length5

Characters and Unicode

Total characters302
Distinct characters110
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st row동문삼성아파트
2nd row풍림아파트
3rd row석림주공3단지아파트
4th row부영아파트
5th row석림주공2단지아파트
ValueCountFrequency (%)
동문삼성아파트 1
 
2.4%
이안아파트 1
 
2.4%
대산한성필하우스 1
 
2.4%
힐스테이트아파트 1
 
2.4%
e편한세상서산예천 1
 
2.4%
e편한세상서산테크노밸리 1
 
2.4%
고운라피네 1
 
2.4%
동문동한성필하우스 1
 
2.4%
양우내안애퍼스트힐 1
 
2.4%
라온프라이빗 1
 
2.4%
Other values (31) 31
75.6%
2024-01-10T07:51:55.534503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23
 
7.6%
18
 
6.0%
18
 
6.0%
14
 
4.6%
9
 
3.0%
9
 
3.0%
8
 
2.6%
7
 
2.3%
6
 
2.0%
6
 
2.0%
Other values (100) 184
60.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 290
96.0%
Space Separator 4
 
1.3%
Decimal Number 4
 
1.3%
Lowercase Letter 2
 
0.7%
Uppercase Letter 1
 
0.3%
Dash Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
7.9%
18
 
6.2%
18
 
6.2%
14
 
4.8%
9
 
3.1%
9
 
3.1%
8
 
2.8%
7
 
2.4%
6
 
2.1%
6
 
2.1%
Other values (93) 172
59.3%
Decimal Number
ValueCountFrequency (%)
2 2
50.0%
3 1
25.0%
1 1
25.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
S 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 290
96.0%
Common 9
 
3.0%
Latin 3
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
7.9%
18
 
6.2%
18
 
6.2%
14
 
4.8%
9
 
3.1%
9
 
3.1%
8
 
2.8%
7
 
2.4%
6
 
2.1%
6
 
2.1%
Other values (93) 172
59.3%
Common
ValueCountFrequency (%)
4
44.4%
2 2
22.2%
- 1
 
11.1%
3 1
 
11.1%
1 1
 
11.1%
Latin
ValueCountFrequency (%)
e 2
66.7%
S 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 290
96.0%
ASCII 12
 
4.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
23
 
7.9%
18
 
6.2%
18
 
6.2%
14
 
4.8%
9
 
3.1%
9
 
3.1%
8
 
2.8%
7
 
2.4%
6
 
2.1%
6
 
2.1%
Other values (93) 172
59.3%
ASCII
ValueCountFrequency (%)
4
33.3%
e 2
16.7%
2 2
16.7%
S 1
 
8.3%
- 1
 
8.3%
3 1
 
8.3%
1 1
 
8.3%
Distinct36
Distinct (%)97.3%
Missing0
Missing (%)0.0%
Memory size428.0 B
2024-01-10T07:51:55.717700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length18
Mean length16.918919
Min length11

Characters and Unicode

Total characters626
Distinct characters56
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)94.6%

Sample

1st row서산시 서령로 137 (동문동)
2nd row서산시 대산읍 충의로 1839-7
3rd row서산시 석림4로 83 (석림동)
4th row서산시 부춘1로 19 (읍내동)
5th row서산시 석림4로 83 (석림동)
ValueCountFrequency (%)
서산시 37
26.4%
성연면 7
 
5.0%
동문동 4
 
2.9%
성연3로 4
 
2.9%
예천동 4
 
2.9%
동서1로 3
 
2.1%
예천3로 3
 
2.1%
석림동 3
 
2.1%
서령로 3
 
2.1%
고운로 3
 
2.1%
Other values (57) 69
49.3%
2024-01-10T07:51:56.013813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
104
16.6%
44
 
7.0%
40
 
6.4%
37
 
5.9%
37
 
5.9%
33
 
5.3%
1 25
 
4.0%
( 23
 
3.7%
) 23
 
3.7%
3 23
 
3.7%
Other values (46) 237
37.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 337
53.8%
Decimal Number 130
 
20.8%
Space Separator 104
 
16.6%
Open Punctuation 23
 
3.7%
Close Punctuation 23
 
3.7%
Dash Punctuation 9
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
13.1%
40
11.9%
37
11.0%
37
11.0%
33
9.8%
15
 
4.5%
14
 
4.2%
13
 
3.9%
13
 
3.9%
10
 
3.0%
Other values (32) 81
24.0%
Decimal Number
ValueCountFrequency (%)
1 25
19.2%
3 23
17.7%
7 18
13.8%
2 14
10.8%
4 11
8.5%
6 11
8.5%
5 10
 
7.7%
9 7
 
5.4%
8 6
 
4.6%
0 5
 
3.8%
Space Separator
ValueCountFrequency (%)
104
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 337
53.8%
Common 289
46.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
13.1%
40
11.9%
37
11.0%
37
11.0%
33
9.8%
15
 
4.5%
14
 
4.2%
13
 
3.9%
13
 
3.9%
10
 
3.0%
Other values (32) 81
24.0%
Common
ValueCountFrequency (%)
104
36.0%
1 25
 
8.7%
( 23
 
8.0%
) 23
 
8.0%
3 23
 
8.0%
7 18
 
6.2%
2 14
 
4.8%
4 11
 
3.8%
6 11
 
3.8%
5 10
 
3.5%
Other values (4) 27
 
9.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 337
53.8%
ASCII 289
46.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
104
36.0%
1 25
 
8.7%
( 23
 
8.0%
) 23
 
8.0%
3 23
 
8.0%
7 18
 
6.2%
2 14
 
4.8%
4 11
 
3.8%
6 11
 
3.8%
5 10
 
3.5%
Other values (4) 27
 
9.3%
Hangul
ValueCountFrequency (%)
44
13.1%
40
11.9%
37
11.0%
37
11.0%
33
9.8%
15
 
4.5%
14
 
4.2%
13
 
3.9%
13
 
3.9%
10
 
3.0%
Other values (32) 81
24.0%

대표번호
Text

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size428.0 B
2024-01-10T07:51:56.195481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters444
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st row041-669-0199
2nd row041-681-3577
3rd row041-665-4702
4th row041-665-2997
5th row041-665-5708
ValueCountFrequency (%)
041-669-0199 1
 
2.7%
041-664-6500 1
 
2.7%
041-665-1915 1
 
2.7%
041-681-9004 1
 
2.7%
041-662-4442 1
 
2.7%
041-666-2296 1
 
2.7%
041-681-7720 1
 
2.7%
041-666-1421 1
 
2.7%
041-662-1482 1
 
2.7%
041-666-0235 1
 
2.7%
Other values (27) 27
73.0%
2024-01-10T07:51:56.477413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6 91
20.5%
- 74
16.7%
1 63
14.2%
0 56
12.6%
4 50
11.3%
5 28
 
6.3%
9 27
 
6.1%
2 19
 
4.3%
8 13
 
2.9%
3 12
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 370
83.3%
Dash Punctuation 74
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 91
24.6%
1 63
17.0%
0 56
15.1%
4 50
13.5%
5 28
 
7.6%
9 27
 
7.3%
2 19
 
5.1%
8 13
 
3.5%
3 12
 
3.2%
7 11
 
3.0%
Dash Punctuation
ValueCountFrequency (%)
- 74
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 444
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
6 91
20.5%
- 74
16.7%
1 63
14.2%
0 56
12.6%
4 50
11.3%
5 28
 
6.3%
9 27
 
6.1%
2 19
 
4.3%
8 13
 
2.9%
3 12
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 444
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6 91
20.5%
- 74
16.7%
1 63
14.2%
0 56
12.6%
4 50
11.3%
5 28
 
6.3%
9 27
 
6.1%
2 19
 
4.3%
8 13
 
2.9%
3 12
 
2.7%

세대수
Real number (ℝ)

HIGH CORRELATION 

Distinct36
Distinct (%)97.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean848.56757
Minimum536
Maximum1980
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size465.0 B
2024-01-10T07:51:56.586999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum536
5-th percentile557.4
Q1702
median832
Q3926
95-th percentile1177
Maximum1980
Range1444
Interquartile range (IQR)224

Descriptive statistics

Standard deviation263.18905
Coefficient of variation (CV)0.31015685
Kurtosis8.6418681
Mean848.56757
Median Absolute Deviation (MAD)111
Skewness2.2947905
Sum31397
Variance69268.474
MonotonicityNot monotonic
2024-01-10T07:51:56.688346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
1093 2
 
5.4%
620 1
 
2.7%
943 1
 
2.7%
808 1
 
2.7%
780 1
 
2.7%
892 1
 
2.7%
936 1
 
2.7%
568 1
 
2.7%
758 1
 
2.7%
926 1
 
2.7%
Other values (26) 26
70.3%
ValueCountFrequency (%)
536 1
2.7%
551 1
2.7%
559 1
2.7%
568 1
2.7%
569 1
2.7%
570 1
2.7%
620 1
2.7%
653 1
2.7%
696 1
2.7%
702 1
2.7%
ValueCountFrequency (%)
1980 1
2.7%
1273 1
2.7%
1153 1
2.7%
1115 1
2.7%
1093 2
5.4%
948 1
2.7%
943 1
2.7%
936 1
2.7%
926 1
2.7%
915 1
2.7%

연면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean105080.17
Minimum38822.06
Maximum234962.38
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size465.0 B
2024-01-10T07:51:56.795616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum38822.06
5-th percentile55763.804
Q182307.75
median106106.34
Q3129291.18
95-th percentile152570.63
Maximum234962.38
Range196140.32
Interquartile range (IQR)46983.43

Descriptive statistics

Standard deviation40378.921
Coefficient of variation (CV)0.38426776
Kurtosis2.2582589
Mean105080.17
Median Absolute Deviation (MAD)23798.59
Skewness1.0257756
Sum3887966.2
Variance1.6304573 × 109
MonotonicityNot monotonic
2024-01-10T07:51:56.899084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
56185.45 1
 
2.7%
138103.17 1
 
2.7%
124595.06 1
 
2.7%
115396.41 1
 
2.7%
105607.1 1
 
2.7%
132145.45 1
 
2.7%
128220.88 1
 
2.7%
70999.17 1
 
2.7%
113197.06 1
 
2.7%
129291.18 1
 
2.7%
Other values (27) 27
73.0%
ValueCountFrequency (%)
38822.06 1
2.7%
54077.22 1
2.7%
56185.45 1
2.7%
56683.88 1
2.7%
56932.21 1
2.7%
57628.51 1
2.7%
60014.72 1
2.7%
64375.66 1
2.7%
70999.17 1
2.7%
82307.75 1
2.7%
ValueCountFrequency (%)
234962.38 1
2.7%
204878.68 1
2.7%
139493.62 1
2.7%
138103.17 1
2.7%
136170.66 1
2.7%
134885.34 1
2.7%
132997.04 1
2.7%
132955.66 1
2.7%
132145.45 1
2.7%
129291.18 1
2.7%

Interactions

2024-01-10T07:51:54.470499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:51:53.996208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:51:54.227137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:51:54.550187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:51:54.067937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:51:54.314889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:51:54.633044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:51:54.149785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:51:54.389173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:51:56.980823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번단지명도로명주소대표번호세대수연면적(제곱미터)
연번1.0001.0000.9391.0000.3370.352
단지명1.0001.0001.0001.0001.0001.000
도로명주소0.9391.0001.0001.0000.6631.000
대표번호1.0001.0001.0001.0001.0001.000
세대수0.3371.0000.6631.0001.0000.864
연면적(제곱미터)0.3521.0001.0001.0000.8641.000
2024-01-10T07:51:57.066658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번세대수연면적(제곱미터)
연번1.000-0.1500.402
세대수-0.1501.0000.606
연면적(제곱미터)0.4020.6061.000

Missing values

2024-01-10T07:51:54.740138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:51:54.823226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번단지명도로명주소대표번호세대수연면적(제곱미터)
01동문삼성아파트서산시 서령로 137 (동문동)041-669-019962056185.45
12풍림아파트서산시 대산읍 충의로 1839-7041-681-357757056683.88
23석림주공3단지아파트서산시 석림4로 83 (석림동)041-665-470277538822.06
34부영아파트서산시 부춘1로 19 (읍내동)041-665-2997109383913.43
45석림주공2단지아파트서산시 석림4로 83 (석림동)041-665-5708115385902.45
56엘지사원아파트서산시 대산읍 명지1로 259-14041-661-2121111586831.06
67죽성동삼성아파트서산시 남부순환로 767 (죽성동)041-665-0992948112020.58
78읍내현대아파트서산시 호수공원6로 19 (읍내동)041-681-4969915107755.01
89한성스위트빌서산시 음암면 서령로 325-23041-669-916669657628.51
910센스빌아파트서산시 동서1로 130 (석남동)041-663-2662109398349.87
연번단지명도로명주소대표번호세대수연면적(제곱미터)
2728고운라피네서산시 성연면 성연3로 57-30041-662-1482758113197.06
2829동문동한성필하우스서산시 서령로 36(동문동)041-666-0235926138103.17
2930양우내안애퍼스트힐서산시 안견로 457(읍내동)041-666-2914943129291.18
3031라온프라이빗서산시 석림1로 77(석림동)041-669-451556982572.42
3132골드클래스서산시 성연면 성연5로 46041-681-3341880132997.04
3233금호어울림에듀퍼스트서산시 성연면 성연5로 92-7041-666-0818725104717.7
3334중흥S-클래스더퍼스트서산시 예천5로 32(예천동)041-669-11561273204878.68
3435센텀파크뷰서희서산시 고운로 17(예천동)041-665-955765391394.0
3536서산테크노밸리 우미린서산시 성연면 성연3로 77041-666-116355164375.66
3637서산 푸르지오 더 센트럴서산시 예천3로 72041-666-0526861132955.66