Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows5
Duplicate rows (%)0.1%
Total size in memory390.6 KiB
Average record size in memory40.0 B

Variable types

Categorical3
Text1

Dataset

Description한국지역난방공사에서 공급하는 지역난방 사용자에 관한 정보로 전국의 상세한 건물명, 건물용도별 사용자 정보를 제공합니다.
Author한국지역난방공사
URLhttps://www.data.go.kr/data/15090340/fileData.do

Alerts

Dataset has 5 (0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 08:38:38.186197
Analysis finished2023-12-12 08:38:38.914586
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준일
Categorical

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2022-12-31
1739 
2022-06-30
1724 
2021-12-31
1674 
2020-06-30
1649 
2020-12-31
1615 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-06-30
2nd row2020-06-30
3rd row2021-06-30
4th row2021-12-31
5th row2022-06-30

Common Values

ValueCountFrequency (%)
2022-12-31 1739
17.4%
2022-06-30 1724
17.2%
2021-12-31 1674
16.7%
2020-06-30 1649
16.5%
2020-12-31 1615
16.2%
2021-06-30 1599
16.0%

Length

2023-12-12T17:38:38.978687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:38:39.103750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-12-31 1739
17.4%
2022-06-30 1724
17.2%
2021-12-31 1674
16.7%
2020-06-30 1649
16.5%
2020-12-31 1615
16.2%
2021-06-30 1599
16.0%

용도
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
아파트
5124 
업무용
3603 
공공용
1273 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row업무용
2nd row아파트
3rd row아파트
4th row업무용
5th row공공용

Common Values

ValueCountFrequency (%)
아파트 5124
51.2%
업무용 3603
36.0%
공공용 1273
 
12.7%

Length

2023-12-12T17:38:39.279623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:38:39.385937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
아파트 5124
51.2%
업무용 3603
36.0%
공공용 1273
 
12.7%
Distinct4674
Distinct (%)46.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:38:39.690518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length25
Mean length11.1471
Min length2

Characters and Unicode

Total characters111471
Distinct characters678
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1464 ?
Unique (%)14.6%

Sample

1st row에스티보보오피스텔
2nd row서초현대아파트
3rd row옥빛13단지(일신건영)
4th row샘물교회 분당타운
5th row분당노인종합복지관
ValueCountFrequency (%)
오피스텔 179
 
1.2%
아파트 104
 
0.7%
어진동 83
 
0.6%
나성동 61
 
0.4%
은평뉴타운 58
 
0.4%
광교 54
 
0.4%
아름동 53
 
0.4%
도담동 51
 
0.3%
소담동 50
 
0.3%
평택고덕 46
 
0.3%
Other values (5508) 14342
95.1%
2023-12-12T17:38:40.196356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5086
 
4.6%
( 3326
 
3.0%
) 3308
 
3.0%
3092
 
2.8%
2721
 
2.4%
2629
 
2.4%
2545
 
2.3%
1 2312
 
2.1%
2097
 
1.9%
2071
 
1.9%
Other values (668) 82284
73.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 85557
76.8%
Decimal Number 8357
 
7.5%
Space Separator 5086
 
4.6%
Open Punctuation 3380
 
3.0%
Close Punctuation 3362
 
3.0%
Uppercase Letter 2809
 
2.5%
Dash Punctuation 1619
 
1.5%
Other Punctuation 1026
 
0.9%
Lowercase Letter 211
 
0.2%
Letter Number 35
 
< 0.1%
Other values (3) 29
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3092
 
3.6%
2721
 
3.2%
2629
 
3.1%
2545
 
3.0%
2097
 
2.5%
2071
 
2.4%
1932
 
2.3%
1592
 
1.9%
1360
 
1.6%
1325
 
1.5%
Other values (582) 64193
75.0%
Uppercase Letter
ValueCountFrequency (%)
A 437
15.6%
B 387
13.8%
L 374
13.3%
C 253
9.0%
S 177
 
6.3%
K 164
 
5.8%
H 137
 
4.9%
D 104
 
3.7%
M 98
 
3.5%
T 96
 
3.4%
Other values (16) 582
20.7%
Lowercase Letter
ValueCountFrequency (%)
e 103
48.8%
t 19
 
9.0%
l 19
 
9.0%
h 13
 
6.2%
r 11
 
5.2%
o 8
 
3.8%
c 7
 
3.3%
w 6
 
2.8%
i 5
 
2.4%
y 4
 
1.9%
Other values (10) 16
 
7.6%
Decimal Number
ValueCountFrequency (%)
1 2312
27.7%
2 1657
19.8%
3 1066
12.8%
4 747
 
8.9%
0 669
 
8.0%
5 539
 
6.4%
6 407
 
4.9%
7 324
 
3.9%
8 303
 
3.6%
9 270
 
3.2%
Other values (9) 63
 
0.8%
Other Punctuation
ValueCountFrequency (%)
, 760
74.1%
/ 163
 
15.9%
& 51
 
5.0%
. 43
 
4.2%
# 6
 
0.6%
· 3
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 3326
98.4%
[ 37
 
1.1%
17
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 3308
98.4%
] 37
 
1.1%
17
 
0.5%
Letter Number
ValueCountFrequency (%)
15
42.9%
14
40.0%
6
 
17.1%
Math Symbol
ValueCountFrequency (%)
~ 6
60.0%
+ 4
40.0%
Space Separator
ValueCountFrequency (%)
5086
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1619
100.0%
Other Symbol
ValueCountFrequency (%)
13
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 85563
76.8%
Common 22846
 
20.5%
Latin 3055
 
2.7%
Han 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3092
 
3.6%
2721
 
3.2%
2629
 
3.1%
2545
 
3.0%
2097
 
2.5%
2071
 
2.4%
1932
 
2.3%
1592
 
1.9%
1360
 
1.6%
1325
 
1.5%
Other values (580) 64199
75.0%
Latin
ValueCountFrequency (%)
A 437
14.3%
B 387
12.7%
L 374
12.2%
C 253
 
8.3%
S 177
 
5.8%
K 164
 
5.4%
H 137
 
4.5%
D 104
 
3.4%
e 103
 
3.4%
M 98
 
3.2%
Other values (39) 821
26.9%
Common
ValueCountFrequency (%)
5086
22.3%
( 3326
14.6%
) 3308
14.5%
1 2312
10.1%
2 1657
 
7.3%
- 1619
 
7.1%
3 1066
 
4.7%
, 760
 
3.3%
4 747
 
3.3%
0 669
 
2.9%
Other values (26) 2296
10.0%
Han
ValueCountFrequency (%)
4
57.1%
2
28.6%
1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 85550
76.7%
ASCII 25766
 
23.1%
None 113
 
0.1%
Number Forms 35
 
< 0.1%
CJK 7
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5086
19.7%
( 3326
12.9%
) 3308
12.8%
1 2312
9.0%
2 1657
 
6.4%
- 1619
 
6.3%
3 1066
 
4.1%
, 760
 
2.9%
4 747
 
2.9%
0 669
 
2.6%
Other values (60) 5216
20.2%
Hangul
ValueCountFrequency (%)
3092
 
3.6%
2721
 
3.2%
2629
 
3.1%
2545
 
3.0%
2097
 
2.5%
2071
 
2.4%
1932
 
2.3%
1592
 
1.9%
1360
 
1.6%
1325
 
1.5%
Other values (579) 64186
75.0%
None
ValueCountFrequency (%)
21
18.6%
19
16.8%
17
15.0%
17
15.0%
13
11.5%
8
 
7.1%
4
 
3.5%
4
 
3.5%
3
 
2.7%
· 3
 
2.7%
Other values (3) 4
 
3.5%
Number Forms
ValueCountFrequency (%)
15
42.9%
14
40.0%
6
 
17.1%
CJK
ValueCountFrequency (%)
4
57.1%
2
28.6%
1
 
14.3%

주소
Categorical

Distinct33
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도 성남시 분당구
1703 
경기도 화성시
911 
경기도 고양시 일산동구
695 
세종특별자치시
667 
경기도 고양시 덕양구
612 
Other values (28)
5412 

Length

Max length12
Median length11
Mean length9.9254
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도 수원시 권선구
2nd row서울특별시 서초구
3rd row경기도 고양시 덕양구
4th row경기도 성남시 분당구
5th row경기도 성남시 분당구

Common Values

ValueCountFrequency (%)
경기도 성남시 분당구 1703
17.0%
경기도 화성시 911
 
9.1%
경기도 고양시 일산동구 695
 
7.0%
세종특별자치시 667
 
6.7%
경기도 고양시 덕양구 612
 
6.1%
경기도 용인시 수지구 556
 
5.6%
경기도 수원시 영통구 546
 
5.5%
경기도 고양시 일산서구 500
 
5.0%
경기도 용인시 기흥구 487
 
4.9%
서울특별시 강남구 412
 
4.1%
Other values (23) 2911
29.1%

Length

2023-12-12T17:38:40.369629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 6686
26.6%
고양시 1807
 
7.2%
성남시 1803
 
7.2%
분당구 1703
 
6.8%
서울특별시 1445
 
5.7%
용인시 1043
 
4.1%
화성시 911
 
3.6%
수원시 843
 
3.3%
일산동구 695
 
2.8%
세종특별자치시 667
 
2.7%
Other values (34) 7566
30.1%

Correlations

2023-12-12T17:38:40.451982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준일용도주소
기준일1.0000.0250.000
용도0.0251.0000.528
주소0.0000.5281.000
2023-12-12T17:38:40.547647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
용도주소기준일
용도1.0000.2890.010
주소0.2891.0000.000
기준일0.0100.0001.000
2023-12-12T17:38:40.654867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준일용도주소
기준일1.0000.0100.000
용도0.0101.0000.289
주소0.0000.2891.000

Missing values

2023-12-12T17:38:38.757172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:38:38.867104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준일용도건물명주소
117442021-06-30업무용에스티보보오피스텔경기도 수원시 권선구
39622020-06-30아파트서초현대아파트서울특별시 서초구
99142021-06-30아파트옥빛13단지(일신건영)경기도 고양시 덕양구
159312021-12-31업무용샘물교회 분당타운경기도 성남시 분당구
217712022-06-30공공용분당노인종합복지관경기도 성남시 분당구
119502021-06-30아파트벽적골 삼성태영아파트경기도 수원시 영통구
222922022-06-30공공용노인장애인복지시설경기도 수원시 영통구
9182020-06-30공공용하얀마을복지회관경기도 성남시 분당구
171972021-12-31아파트화서한신휴플러스경기도 수원시 팔달구
226392022-06-30업무용소호스타트업 인큐베이팅센터경기도 용인시 기흥구
기준일용도건물명주소
222642022-06-30아파트청명마을 주공아파트경기도 수원시 영통구
170142021-12-31공공용신영초등학교경기도 수원시 영통구
150802021-12-31아파트무원08단지(신안)경기도 고양시 덕양구
99922021-06-30업무용동원빌딩경기도 고양시 덕양구
253052022-06-30공공용충청북도선거관리위원회청사충청북도 청주시 흥덕구
129152021-06-30업무용롬엔드하스전자재료코리아(유)경기도 화성시
195682021-12-31업무용세종비즈니스센터(1-5생활권, 어진동)세종특별자치시
249672022-06-30아파트나릿재마을 2단지 아파트세종특별자치시
65452020-12-31공공용불곡고등학교경기도 성남시 분당구
248862022-06-30공공용3-3 복합커뮤니티센터(3-3생활권, 소담동)세종특별자치시

Duplicate rows

Most frequently occurring

기준일용도건물명주소# duplicates
02020-06-30아파트화이트빌경기도 성남시 분당구2
12021-06-30아파트은평뉴타운 우물골 5단지서울특별시 은평구2
22021-12-31아파트동신건영2차경기도 성남시 분당구2
32021-12-31아파트화이트빌경기도 성남시 분당구2
42022-12-31아파트망포마을 현대1차 아이파크경기도 수원시 영통구2