Overview

Dataset statistics

Number of variables5
Number of observations59
Missing cells17
Missing cells (%)5.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 KiB
Average record size in memory43.2 B

Variable types

Numeric1
Text4

Dataset

Description세종특별자치시에 있는 실내건축 공사업체 정보를 제공합니다.데이터는 업체명, 도로명주소, 지번주소, 전화번호로 구성되어 있습니다.
Author세종특별자치시
URLhttps://www.data.go.kr/data/15038216/fileData.do

Alerts

도로명주소 has 1 (1.7%) missing valuesMissing
지번주소 has 1 (1.7%) missing valuesMissing
전화번호 has 15 (25.4%) missing valuesMissing
번호 has unique valuesUnique
업체명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:02:05.862167
Analysis finished2023-12-12 10:02:07.063076
Duration1.2 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct59
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean30
Minimum1
Maximum59
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size663.0 B
2023-12-12T19:02:07.187020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.9
Q115.5
median30
Q344.5
95-th percentile56.1
Maximum59
Range58
Interquartile range (IQR)29

Descriptive statistics

Standard deviation17.175564
Coefficient of variation (CV)0.5725188
Kurtosis-1.2
Mean30
Median Absolute Deviation (MAD)15
Skewness0
Sum1770
Variance295
MonotonicityStrictly increasing
2023-12-12T19:02:07.380411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.7%
2 1
 
1.7%
33 1
 
1.7%
34 1
 
1.7%
35 1
 
1.7%
36 1
 
1.7%
37 1
 
1.7%
38 1
 
1.7%
39 1
 
1.7%
40 1
 
1.7%
Other values (49) 49
83.1%
ValueCountFrequency (%)
1 1
1.7%
2 1
1.7%
3 1
1.7%
4 1
1.7%
5 1
1.7%
6 1
1.7%
7 1
1.7%
8 1
1.7%
9 1
1.7%
10 1
1.7%
ValueCountFrequency (%)
59 1
1.7%
58 1
1.7%
57 1
1.7%
56 1
1.7%
55 1
1.7%
54 1
1.7%
53 1
1.7%
52 1
1.7%
51 1
1.7%
50 1
1.7%

업체명
Text

UNIQUE 

Distinct59
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size604.0 B
2023-12-12T19:02:07.675919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length7.7118644
Min length2

Characters and Unicode

Total characters455
Distinct characters107
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)100.0%

Sample

1st row(주)건솔
2nd row(주)다임하우징
3rd row(주)담다
4th row(주)대광이엔지
5th row(주)라온디자인
ValueCountFrequency (%)
주)건솔 1
 
1.7%
금륭투수개발(주 1
 
1.7%
대교산업개발(주 1
 
1.7%
디자인다름주식회사 1
 
1.7%
만경엔지니어링주식회사 1
 
1.7%
세운종합건설주식회사 1
 
1.7%
세윤디자인주식회사 1
 
1.7%
시소우건축디자인연구소 1
 
1.7%
아뜰리에지음주식회사 1
 
1.7%
에스씨건설(주 1
 
1.7%
Other values (49) 49
83.1%
2023-12-12T19:02:08.166377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
55
 
12.1%
( 37
 
8.1%
) 37
 
8.1%
21
 
4.6%
19
 
4.2%
19
 
4.2%
15
 
3.3%
14
 
3.1%
13
 
2.9%
11
 
2.4%
Other values (97) 214
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 381
83.7%
Open Punctuation 37
 
8.1%
Close Punctuation 37
 
8.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
55
 
14.4%
21
 
5.5%
19
 
5.0%
19
 
5.0%
15
 
3.9%
14
 
3.7%
13
 
3.4%
11
 
2.9%
11
 
2.9%
10
 
2.6%
Other values (95) 193
50.7%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 381
83.7%
Common 74
 
16.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
55
 
14.4%
21
 
5.5%
19
 
5.0%
19
 
5.0%
15
 
3.9%
14
 
3.7%
13
 
3.4%
11
 
2.9%
11
 
2.9%
10
 
2.6%
Other values (95) 193
50.7%
Common
ValueCountFrequency (%)
( 37
50.0%
) 37
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 381
83.7%
ASCII 74
 
16.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
55
 
14.4%
21
 
5.5%
19
 
5.0%
19
 
5.0%
15
 
3.9%
14
 
3.7%
13
 
3.4%
11
 
2.9%
11
 
2.9%
10
 
2.6%
Other values (95) 193
50.7%
ASCII
ValueCountFrequency (%)
( 37
50.0%
) 37
50.0%

도로명주소
Text

MISSING 

Distinct52
Distinct (%)89.7%
Missing1
Missing (%)1.7%
Memory size604.0 B
2023-12-12T19:02:08.486275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length22
Mean length18.413793
Min length13

Characters and Unicode

Total characters1068
Distinct characters103
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)81.0%

Sample

1st row세종특별자치시 부강면 시장길 38
2nd row세종특별자치시 대평로75
3rd row세종특별자치시 한누리대로 2009
4th row세종특별자치시 연서면 가마골길 140-8
5th row세종특별자치시 갈매로 351
ValueCountFrequency (%)
세종특별자치시 58
29.0%
조치원읍 10
 
5.0%
한누리대로 8
 
4.0%
연서면 8
 
4.0%
갈매로 6
 
3.0%
금남면 4
 
2.0%
월성로 3
 
1.5%
351 3
 
1.5%
353 3
 
1.5%
2009 3
 
1.5%
Other values (81) 94
47.0%
2023-12-12T19:02:08.977913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
174
16.3%
69
 
6.5%
62
 
5.8%
61
 
5.7%
60
 
5.6%
58
 
5.4%
58
 
5.4%
58
 
5.4%
40
 
3.7%
1 33
 
3.1%
Other values (93) 395
37.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 715
66.9%
Space Separator 174
 
16.3%
Decimal Number 169
 
15.8%
Dash Punctuation 10
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
69
 
9.7%
62
 
8.7%
61
 
8.5%
60
 
8.4%
58
 
8.1%
58
 
8.1%
58
 
8.1%
40
 
5.6%
18
 
2.5%
15
 
2.1%
Other values (81) 216
30.2%
Decimal Number
ValueCountFrequency (%)
1 33
19.5%
3 32
18.9%
0 21
12.4%
2 20
11.8%
4 15
8.9%
5 13
 
7.7%
7 11
 
6.5%
9 10
 
5.9%
8 9
 
5.3%
6 5
 
3.0%
Space Separator
ValueCountFrequency (%)
174
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 715
66.9%
Common 353
33.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
69
 
9.7%
62
 
8.7%
61
 
8.5%
60
 
8.4%
58
 
8.1%
58
 
8.1%
58
 
8.1%
40
 
5.6%
18
 
2.5%
15
 
2.1%
Other values (81) 216
30.2%
Common
ValueCountFrequency (%)
174
49.3%
1 33
 
9.3%
3 32
 
9.1%
0 21
 
5.9%
2 20
 
5.7%
4 15
 
4.2%
5 13
 
3.7%
7 11
 
3.1%
9 10
 
2.8%
- 10
 
2.8%
Other values (2) 14
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 715
66.9%
ASCII 353
33.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
174
49.3%
1 33
 
9.3%
3 32
 
9.1%
0 21
 
5.9%
2 20
 
5.7%
4 15
 
4.2%
5 13
 
3.7%
7 11
 
3.1%
9 10
 
2.8%
- 10
 
2.8%
Other values (2) 14
 
4.0%
Hangul
ValueCountFrequency (%)
69
 
9.7%
62
 
8.7%
61
 
8.5%
60
 
8.4%
58
 
8.1%
58
 
8.1%
58
 
8.1%
40
 
5.6%
18
 
2.5%
15
 
2.1%
Other values (81) 216
30.2%

지번주소
Text

MISSING 

Distinct54
Distinct (%)93.1%
Missing1
Missing (%)1.7%
Memory size604.0 B
2023-12-12T19:02:09.615818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length21
Mean length18.103448
Min length14

Characters and Unicode

Total characters1050
Distinct characters66
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)87.9%

Sample

1st row세종특별자치시 부강면 부강리 458-1
2nd row세종특별자치시 대평동677
3rd row세종특별자치시 소담동 576
4th row세종특별자치시 연서면 봉암리 678-1
5th row세종특별자치시 어진동 671
ValueCountFrequency (%)
세종특별자치시 58
29.3%
조치원읍 10
 
5.1%
연서면 8
 
4.0%
나성동 8
 
4.0%
어진동 7
 
3.5%
대평동 5
 
2.5%
월하리 5
 
2.5%
소담동 4
 
2.0%
금남면 4
 
2.0%
침산리 3
 
1.5%
Other values (68) 86
43.4%
2023-12-12T19:02:10.007920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
164
15.6%
68
 
6.5%
58
 
5.5%
58
 
5.5%
58
 
5.5%
58
 
5.5%
58
 
5.5%
58
 
5.5%
7 36
 
3.4%
34
 
3.2%
Other values (56) 400
38.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 663
63.1%
Decimal Number 198
 
18.9%
Space Separator 164
 
15.6%
Dash Punctuation 25
 
2.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
68
 
10.3%
58
 
8.7%
58
 
8.7%
58
 
8.7%
58
 
8.7%
58
 
8.7%
58
 
8.7%
34
 
5.1%
25
 
3.8%
15
 
2.3%
Other values (44) 173
26.1%
Decimal Number
ValueCountFrequency (%)
7 36
18.2%
6 31
15.7%
1 28
14.1%
2 24
12.1%
8 19
9.6%
5 18
9.1%
0 15
7.6%
4 10
 
5.1%
3 10
 
5.1%
9 7
 
3.5%
Space Separator
ValueCountFrequency (%)
164
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 663
63.1%
Common 387
36.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
68
 
10.3%
58
 
8.7%
58
 
8.7%
58
 
8.7%
58
 
8.7%
58
 
8.7%
58
 
8.7%
34
 
5.1%
25
 
3.8%
15
 
2.3%
Other values (44) 173
26.1%
Common
ValueCountFrequency (%)
164
42.4%
7 36
 
9.3%
6 31
 
8.0%
1 28
 
7.2%
- 25
 
6.5%
2 24
 
6.2%
8 19
 
4.9%
5 18
 
4.7%
0 15
 
3.9%
4 10
 
2.6%
Other values (2) 17
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 663
63.1%
ASCII 387
36.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
164
42.4%
7 36
 
9.3%
6 31
 
8.0%
1 28
 
7.2%
- 25
 
6.5%
2 24
 
6.2%
8 19
 
4.9%
5 18
 
4.7%
0 15
 
3.9%
4 10
 
2.6%
Other values (2) 17
 
4.4%
Hangul
ValueCountFrequency (%)
68
 
10.3%
58
 
8.7%
58
 
8.7%
58
 
8.7%
58
 
8.7%
58
 
8.7%
58
 
8.7%
34
 
5.1%
25
 
3.8%
15
 
2.3%
Other values (44) 173
26.1%

전화번호
Text

MISSING 

Distinct44
Distinct (%)100.0%
Missing15
Missing (%)25.4%
Memory size604.0 B
2023-12-12T19:02:10.291672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.136364
Min length12

Characters and Unicode

Total characters534
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)100.0%

Sample

1st row070-4383-1614
2nd row044-865-6388
3rd row044-866-2341
4th row044-862-6871
5th row044-862-2845
ValueCountFrequency (%)
044-863-7072 1
 
2.3%
044-864-0733 1
 
2.3%
044-865-0480 1
 
2.3%
044-555-1103 1
 
2.3%
044-863-7735 1
 
2.3%
044-866-0705 1
 
2.3%
070-7500-5616 1
 
2.3%
044-866-1617 1
 
2.3%
044-863-6037 1
 
2.3%
044-863-5941 1
 
2.3%
Other values (34) 34
77.3%
2023-12-12T19:02:10.721118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 100
18.7%
- 88
16.5%
0 75
14.0%
8 60
11.2%
6 60
11.2%
7 33
 
6.2%
3 28
 
5.2%
5 28
 
5.2%
1 27
 
5.1%
2 23
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 446
83.5%
Dash Punctuation 88
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 100
22.4%
0 75
16.8%
8 60
13.5%
6 60
13.5%
7 33
 
7.4%
3 28
 
6.3%
5 28
 
6.3%
1 27
 
6.1%
2 23
 
5.2%
9 12
 
2.7%
Dash Punctuation
ValueCountFrequency (%)
- 88
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 534
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 100
18.7%
- 88
16.5%
0 75
14.0%
8 60
11.2%
6 60
11.2%
7 33
 
6.2%
3 28
 
5.2%
5 28
 
5.2%
1 27
 
5.1%
2 23
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 534
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 100
18.7%
- 88
16.5%
0 75
14.0%
8 60
11.2%
6 60
11.2%
7 33
 
6.2%
3 28
 
5.2%
5 28
 
5.2%
1 27
 
5.1%
2 23
 
4.3%

Interactions

2023-12-12T19:02:06.541515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:02:10.827147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업체명도로명주소지번주소전화번호
번호1.0001.0000.8860.7181.000
업체명1.0001.0001.0001.0001.000
도로명주소0.8861.0001.0000.9981.000
지번주소0.7181.0000.9981.0001.000
전화번호1.0001.0001.0001.0001.000

Missing values

2023-12-12T19:02:06.739556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:02:06.853786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T19:02:06.987027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호업체명도로명주소지번주소전화번호
01(주)건솔세종특별자치시 부강면 시장길 38세종특별자치시 부강면 부강리 458-1070-4383-1614
12(주)다임하우징세종특별자치시 대평로75세종특별자치시 대평동677044-865-6388
23(주)담다세종특별자치시 한누리대로 2009세종특별자치시 소담동 576<NA>
34(주)대광이엔지세종특별자치시 연서면 가마골길 140-8세종특별자치시 연서면 봉암리 678-1044-866-2341
45(주)라온디자인세종특별자치시 갈매로 351세종특별자치시 어진동 671044-862-6871
56(주)미성아이디세종특별자치시 보듬3로 8-20세종특별자치시 도담동 660<NA>
67(주)사우세종특별자치시 가름로 232세종특별자치시 어진동 657044-862-2845
78(주)산마들세종특별자치시 연서면 월성로 14세종특별자치시 연서면 월하리 515-2044-862-1855
89(주)새인세종특별자치시 조치원읍 건강길 6세종특별자치시 조치원읍 교리 12-2044-864-0629
910(주)서현세종특별자치시 금남면 범허리길 105세종특별자치시 금남면 성강리 22-2044-864-7288
번호업체명도로명주소지번주소전화번호
4950주식회사성아디에스세종특별자치시 보듬3로 8-20세종특별자치시 도담동 660<NA>
5051주식회사순리세종특별자치시 시청대로 20세종특별자치시 대평동 672<NA>
5152주식회사씨에스엠세종특별자치시 국세청로 4세종특별자치시 나성동 767044-865-4319
5253주식회사아림<NA><NA>044-864-4664
5354주식회사에이스건축세종특별자치시 대평3길 17세종특별자치시 대평동 680<NA>
5455주식회사청원씨엔텍세종특별자치시 연서면 세종로 2139-10세종특별자치시 연서면 월하리 650-4070-7500-4370
5556주식회사토브밸리원세종특별자치시 호려울로 9세종특별자치시 보람동 755<NA>
5657지한건설(주)세종특별자치시 집현중앙7로 6세종특별자치시 집현동 1163044-715-7395
5758케이에스씨건설(주)세종특별자치시 대평3길 17세종특별자치시 대평동 680070-4167-8264
5859큰빛건설(주)세종특별자치시 연서면 당산로 318-1세종특별자치시 연서면 봉암리 624-11044-863-1198