Overview

Dataset statistics

Number of variables5
Number of observations28
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory45.7 B

Variable types

Numeric1
Text2
Categorical2

Dataset

Description대전광역시 서구 관내에서 운영중인 고압가스업 현황 정보(업소명, 주소, 사업 종류(제조/판매/저장소))를 제공합니다
URLhttps://www.data.go.kr/data/15061959/fileData.do

Alerts

기준일자 has constant value ""Constant
순번 has unique valuesUnique
사업소소재지 has unique valuesUnique

Reproduction

Analysis started2023-12-11 23:31:58.805839
Analysis finished2023-12-11 23:31:59.302489
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct28
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.5
Minimum1
Maximum28
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size384.0 B
2023-12-12T08:31:59.363614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.35
Q17.75
median14.5
Q321.25
95-th percentile26.65
Maximum28
Range27
Interquartile range (IQR)13.5

Descriptive statistics

Standard deviation8.2259751
Coefficient of variation (CV)0.56730863
Kurtosis-1.2
Mean14.5
Median Absolute Deviation (MAD)7
Skewness0
Sum406
Variance67.666667
MonotonicityStrictly increasing
2023-12-12T08:31:59.492390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
1 1
 
3.6%
16 1
 
3.6%
28 1
 
3.6%
27 1
 
3.6%
26 1
 
3.6%
25 1
 
3.6%
24 1
 
3.6%
23 1
 
3.6%
22 1
 
3.6%
21 1
 
3.6%
Other values (18) 18
64.3%
ValueCountFrequency (%)
1 1
3.6%
2 1
3.6%
3 1
3.6%
4 1
3.6%
5 1
3.6%
6 1
3.6%
7 1
3.6%
8 1
3.6%
9 1
3.6%
10 1
3.6%
ValueCountFrequency (%)
28 1
3.6%
27 1
3.6%
26 1
3.6%
25 1
3.6%
24 1
3.6%
23 1
3.6%
22 1
3.6%
21 1
3.6%
20 1
3.6%
19 1
3.6%
Distinct26
Distinct (%)92.9%
Missing0
Missing (%)0.0%
Memory size356.0 B
2023-12-12T08:31:59.737795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length16.5
Mean length9.3571429
Min length4

Characters and Unicode

Total characters262
Distinct characters104
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)85.7%

Sample

1st row대전광역시 상수도사업본부 월평정수사업소
2nd row대전문화예술의전당
3rd row한국방송공사
4th row대전둔산소방서(샘머리119안전센터)
5th row(주)세창
ValueCountFrequency (%)
대전청사관리소 2
 
5.7%
에스케이텔레콤(주 2
 
5.7%
주)국민은행 1
 
2.9%
월평정수사업소 1
 
2.9%
상수도사업본부 1
 
2.9%
대전광역시 1
 
2.9%
용문역지점 1
 
2.9%
주)우리은행 1
 
2.9%
을지학원 1
 
2.9%
남선공원종합체육관(빙상장 1
 
2.9%
Other values (23) 23
65.7%
2023-12-12T08:32:00.416728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
 
6.1%
16
 
6.1%
) 9
 
3.4%
( 9
 
3.4%
9
 
3.4%
8
 
3.1%
7
 
2.7%
7
 
2.7%
6
 
2.3%
6
 
2.3%
Other values (94) 169
64.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 234
89.3%
Close Punctuation 9
 
3.4%
Open Punctuation 9
 
3.4%
Space Separator 7
 
2.7%
Decimal Number 3
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
6.8%
16
 
6.8%
9
 
3.8%
8
 
3.4%
7
 
3.0%
6
 
2.6%
6
 
2.6%
5
 
2.1%
5
 
2.1%
5
 
2.1%
Other values (89) 151
64.5%
Decimal Number
ValueCountFrequency (%)
1 2
66.7%
9 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 234
89.3%
Common 28
 
10.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
6.8%
16
 
6.8%
9
 
3.8%
8
 
3.4%
7
 
3.0%
6
 
2.6%
6
 
2.6%
5
 
2.1%
5
 
2.1%
5
 
2.1%
Other values (89) 151
64.5%
Common
ValueCountFrequency (%)
) 9
32.1%
( 9
32.1%
7
25.0%
1 2
 
7.1%
9 1
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 234
89.3%
ASCII 28
 
10.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
16
 
6.8%
16
 
6.8%
9
 
3.8%
8
 
3.4%
7
 
3.0%
6
 
2.6%
6
 
2.6%
5
 
2.1%
5
 
2.1%
5
 
2.1%
Other values (89) 151
64.5%
ASCII
ValueCountFrequency (%)
) 9
32.1%
( 9
32.1%
7
25.0%
1 2
 
7.1%
9 1
 
3.6%

사업의종류
Categorical

Distinct3
Distinct (%)10.7%
Missing0
Missing (%)0.0%
Memory size356.0 B
제조
23 
저장소
판매
 
1

Length

Max length3
Median length2
Mean length2.1428571
Min length2

Unique

Unique1 ?
Unique (%)3.6%

Sample

1st row저장소
2nd row제조
3rd row제조
4th row제조
5th row제조

Common Values

ValueCountFrequency (%)
제조 23
82.1%
저장소 4
 
14.3%
판매 1
 
3.6%

Length

2023-12-12T08:32:00.564658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:32:00.667474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제조 23
82.1%
저장소 4
 
14.3%
판매 1
 
3.6%

사업소소재지
Text

UNIQUE 

Distinct28
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size356.0 B
2023-12-12T08:32:00.882273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length34
Mean length24.464286
Min length21

Characters and Unicode

Total characters685
Distinct characters75
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)100.0%

Sample

1st row대전광역시 서구 신갈마로141번길 82 (월평동)
2nd row대전광역시 서구 둔산대로 155 (만년동)
3rd row대전광역시 서구 둔산대로 117번길 128 (만년동)
4th row대전광역시 서구 둔산로101번길 26 (둔산동)
5th row대전광역시 서구 계룡로 598 (괴정동)
ValueCountFrequency (%)
대전광역시 28
19.2%
서구 28
19.2%
둔산동 9
 
6.2%
탄방동 4
 
2.7%
월평동 3
 
2.1%
청사로 3
 
2.1%
갈마동 2
 
1.4%
문정로 2
 
1.4%
189 2
 
1.4%
둔산로 2
 
1.4%
Other values (60) 63
43.2%
2023-12-12T08:32:01.274799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
122
17.8%
35
 
5.1%
31
 
4.5%
31
 
4.5%
31
 
4.5%
30
 
4.4%
30
 
4.4%
30
 
4.4%
28
 
4.1%
27
 
3.9%
Other values (65) 290
42.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 413
60.3%
Space Separator 122
 
17.8%
Decimal Number 91
 
13.3%
Close Punctuation 27
 
3.9%
Open Punctuation 27
 
3.9%
Other Punctuation 4
 
0.6%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
8.5%
31
 
7.5%
31
 
7.5%
31
 
7.5%
30
 
7.3%
30
 
7.3%
30
 
7.3%
28
 
6.8%
27
 
6.5%
19
 
4.6%
Other values (50) 121
29.3%
Decimal Number
ValueCountFrequency (%)
1 23
25.3%
5 11
12.1%
2 8
 
8.8%
8 8
 
8.8%
9 8
 
8.8%
4 7
 
7.7%
3 7
 
7.7%
7 7
 
7.7%
0 7
 
7.7%
6 5
 
5.5%
Space Separator
ValueCountFrequency (%)
122
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 413
60.3%
Common 272
39.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
8.5%
31
 
7.5%
31
 
7.5%
31
 
7.5%
30
 
7.3%
30
 
7.3%
30
 
7.3%
28
 
6.8%
27
 
6.5%
19
 
4.6%
Other values (50) 121
29.3%
Common
ValueCountFrequency (%)
122
44.9%
) 27
 
9.9%
( 27
 
9.9%
1 23
 
8.5%
5 11
 
4.0%
2 8
 
2.9%
8 8
 
2.9%
9 8
 
2.9%
4 7
 
2.6%
3 7
 
2.6%
Other values (5) 24
 
8.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 413
60.3%
ASCII 272
39.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
122
44.9%
) 27
 
9.9%
( 27
 
9.9%
1 23
 
8.5%
5 11
 
4.0%
2 8
 
2.9%
8 8
 
2.9%
9 8
 
2.9%
4 7
 
2.6%
3 7
 
2.6%
Other values (5) 24
 
8.8%
Hangul
ValueCountFrequency (%)
35
 
8.5%
31
 
7.5%
31
 
7.5%
31
 
7.5%
30
 
7.3%
30
 
7.3%
30
 
7.3%
28
 
6.8%
27
 
6.5%
19
 
4.6%
Other values (50) 121
29.3%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size356.0 B
2023-07-08
28 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-08
2nd row2023-07-08
3rd row2023-07-08
4th row2023-07-08
5th row2023-07-08

Common Values

ValueCountFrequency (%)
2023-07-08 28
100.0%

Length

2023-12-12T08:32:01.438220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:32:01.536119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-08 28
100.0%

Interactions

2023-12-12T08:31:59.023345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:32:01.592730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번법인명(상호)사업의종류사업소소재지
순번1.0000.7320.0001.000
법인명(상호)0.7321.0001.0001.000
사업의종류0.0001.0001.0001.000
사업소소재지1.0001.0001.0001.000
2023-12-12T08:32:01.681701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업의종류
순번1.0000.000
사업의종류0.0001.000

Missing values

2023-12-12T08:31:59.155126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:31:59.257132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번법인명(상호)사업의종류사업소소재지기준일자
01대전광역시 상수도사업본부 월평정수사업소저장소대전광역시 서구 신갈마로141번길 82 (월평동)2023-07-08
12대전문화예술의전당제조대전광역시 서구 둔산대로 155 (만년동)2023-07-08
23한국방송공사제조대전광역시 서구 둔산대로 117번길 128 (만년동)2023-07-08
34대전둔산소방서(샘머리119안전센터)제조대전광역시 서구 둔산로101번길 26 (둔산동)2023-07-08
45(주)세창제조대전광역시 서구 계룡로 598 (괴정동)2023-07-08
56대전청사관리소제조대전광역시 서구 청사로 189 (둔산동)2023-07-08
67한진산소판매대전광역시 서구 장안로 76 (매노동)2023-07-08
78대전서부소방서제조대전광역시 서구 용소로 46 (가수원동)2023-07-08
89대전둔산소방서제조대전광역시 서구 갈마중로 15, 대전광역시둔산소방서 (갈마동)2023-07-08
910에스케이텔레콤(주)제조대전광역시 서구 문정로 41 (탄방동)2023-07-08
순번법인명(상호)사업의종류사업소소재지기준일자
1819계룡병원제조대전광역시 서구 갈마로 45 (갈마동)2023-07-08
1920학교법인 을지학원 대전을지대학교병원저장소대전광역시 서구 둔산서로 95 (둔산동)2023-07-08
2021한국자산관리공사제조대전광역시 서구 한밭대로 713 (월평동)2023-07-08
2122대전지방경찰청제조대전광역시 서구 둔산중로 77 (둔산동)2023-07-08
2223한국전력공사제조대전광역시 서구 변동중로 39 (변동)2023-07-08
2324오페라웨딩컨벤션제조대전광역시 서구 둔산남로 50 (탄방동)2023-07-08
2425건양대학교병원저장소대전광역시 서구 관저동로 158 (관저동)2023-07-08
2526남선공원종합체육관(빙상장)제조대전광역시 서구 탄방동 1084 남선공원2023-07-08
2627대전고등법원제조대전광역시 서구 둔산중로78번길 15 (둔산동)2023-07-08
2728(주)국민은행 둔산선사종합금융센터제조대전광역시 서구 대덕대로 294 (둔산동)2023-07-08