Overview

Dataset statistics

Number of variables7
Number of observations218
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.9 KiB
Average record size in memory60.6 B

Variable types

Numeric3
Text1
Categorical2
Boolean1

Dataset

Description대전광역시 시설관리공단에서 운영중인 대전역 앞 지하도 상가(동구 중앙로 지하 200)의 점포현황에 대한 상세정보(일렬번호, 점포이름, 점포면적, 점포주소, 건물총면적, 점포사용면적, 사용여부) 제공
Author대전광역시시설관리공단
URLhttps://www.data.go.kr/data/15123938/fileData.do

Alerts

점포주소 has constant value ""Constant
건물총면적 has constant value ""Constant
점포면적 is highly overall correlated with 점포사용면적High correlation
점포사용면적 is highly overall correlated with 점포면적High correlation
사용여부 is highly imbalanced (92.5%)Imbalance
일렬번호 has unique valuesUnique
점포이름 has unique valuesUnique
점포면적 has 3 (1.4%) zerosZeros
점포사용면적 has 3 (1.4%) zerosZeros

Reproduction

Analysis started2023-12-12 15:49:15.069548
Analysis finished2023-12-12 15:49:16.572395
Duration1.5 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일렬번호
Real number (ℝ)

UNIQUE 

Distinct218
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2814.5
Minimum2706
Maximum2923
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-13T00:49:16.672484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2706
5-th percentile2716.85
Q12760.25
median2814.5
Q32868.75
95-th percentile2912.15
Maximum2923
Range217
Interquartile range (IQR)108.5

Descriptive statistics

Standard deviation63.075352
Coefficient of variation (CV)0.022410855
Kurtosis-1.2
Mean2814.5
Median Absolute Deviation (MAD)54.5
Skewness0
Sum613561
Variance3978.5
MonotonicityStrictly increasing
2023-12-13T00:49:16.852581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2706 1
 
0.5%
2856 1
 
0.5%
2845 1
 
0.5%
2846 1
 
0.5%
2847 1
 
0.5%
2848 1
 
0.5%
2849 1
 
0.5%
2850 1
 
0.5%
2851 1
 
0.5%
2852 1
 
0.5%
Other values (208) 208
95.4%
ValueCountFrequency (%)
2706 1
0.5%
2707 1
0.5%
2708 1
0.5%
2709 1
0.5%
2710 1
0.5%
2711 1
0.5%
2712 1
0.5%
2713 1
0.5%
2714 1
0.5%
2715 1
0.5%
ValueCountFrequency (%)
2923 1
0.5%
2922 1
0.5%
2921 1
0.5%
2920 1
0.5%
2919 1
0.5%
2918 1
0.5%
2917 1
0.5%
2916 1
0.5%
2915 1
0.5%
2914 1
0.5%

점포이름
Text

UNIQUE 

Distinct218
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-13T00:49:17.453779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length3.0183486
Min length2

Characters and Unicode

Total characters658
Distinct characters24
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique218 ?
Unique (%)100.0%

Sample

1st row가1
2nd row가2
3rd row가3
4th row가4
5th row가5
ValueCountFrequency (%)
가1 1
 
0.5%
나40 1
 
0.5%
나30 1
 
0.5%
나53 1
 
0.5%
나31 1
 
0.5%
나32 1
 
0.5%
나33 1
 
0.5%
나34 1
 
0.5%
나35 1
 
0.5%
나36 1
 
0.5%
Other values (208) 208
95.4%
2023-12-13T00:49:18.188475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
108
16.4%
106
16.1%
1 55
8.4%
6 43
 
6.5%
4 43
 
6.5%
2 42
 
6.4%
3 42
 
6.4%
5 41
 
6.2%
9 41
 
6.2%
7 41
 
6.2%
Other values (14) 96
14.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 419
63.7%
Other Letter 227
34.5%
Dash Punctuation 6
 
0.9%
Uppercase Letter 6
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
108
47.6%
106
46.7%
3
 
1.3%
2
 
0.9%
2
 
0.9%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
Decimal Number
ValueCountFrequency (%)
1 55
13.1%
6 43
10.3%
4 43
10.3%
2 42
10.0%
3 42
10.0%
5 41
9.8%
9 41
9.8%
7 41
9.8%
8 40
9.5%
0 31
7.4%
Uppercase Letter
ValueCountFrequency (%)
B 3
50.0%
A 3
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 425
64.6%
Hangul 227
34.5%
Latin 6
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
108
47.6%
106
46.7%
3
 
1.3%
2
 
0.9%
2
 
0.9%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
Common
ValueCountFrequency (%)
1 55
12.9%
6 43
10.1%
4 43
10.1%
2 42
9.9%
3 42
9.9%
5 41
9.6%
9 41
9.6%
7 41
9.6%
8 40
9.4%
0 31
7.3%
Latin
ValueCountFrequency (%)
B 3
50.0%
A 3
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 431
65.5%
Hangul 227
34.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
108
47.6%
106
46.7%
3
 
1.3%
2
 
0.9%
2
 
0.9%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
ASCII
ValueCountFrequency (%)
1 55
12.8%
6 43
10.0%
4 43
10.0%
2 42
9.7%
3 42
9.7%
5 41
9.5%
9 41
9.5%
7 41
9.5%
8 40
9.3%
0 31
7.2%
Other values (3) 12
 
2.8%

점포면적
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct12
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.192661
Minimum0
Maximum159
Zeros3
Zeros (%)1.4%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-13T00:49:18.380903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile11
Q111
median11
Q311
95-th percentile22
Maximum159
Range159
Interquartile range (IQR)0

Descriptive statistics

Standard deviation10.87548
Coefficient of variation (CV)0.82435839
Kurtosis150.16713
Mean13.192661
Median Absolute Deviation (MAD)0
Skewness11.277266
Sum2876
Variance118.27607
MonotonicityNot monotonic
2023-12-13T00:49:18.546279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
11 176
80.7%
22 27
 
12.4%
23 4
 
1.8%
0 3
 
1.4%
18 1
 
0.5%
6 1
 
0.5%
7 1
 
0.5%
17 1
 
0.5%
5 1
 
0.5%
32 1
 
0.5%
Other values (2) 2
 
0.9%
ValueCountFrequency (%)
0 3
 
1.4%
5 1
 
0.5%
6 1
 
0.5%
7 1
 
0.5%
10 1
 
0.5%
11 176
80.7%
17 1
 
0.5%
18 1
 
0.5%
22 27
 
12.4%
23 4
 
1.8%
ValueCountFrequency (%)
159 1
 
0.5%
32 1
 
0.5%
23 4
 
1.8%
22 27
 
12.4%
18 1
 
0.5%
17 1
 
0.5%
11 176
80.7%
10 1
 
0.5%
7 1
 
0.5%
6 1
 
0.5%

점포주소
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
대전시 동구 중앙로 지하 200(중동)
218 

Length

Max length21
Median length21
Mean length21
Min length21

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대전시 동구 중앙로 지하 200(중동)
2nd row대전시 동구 중앙로 지하 200(중동)
3rd row대전시 동구 중앙로 지하 200(중동)
4th row대전시 동구 중앙로 지하 200(중동)
5th row대전시 동구 중앙로 지하 200(중동)

Common Values

ValueCountFrequency (%)
대전시 동구 중앙로 지하 200(중동) 218
100.0%

Length

2023-12-13T00:49:18.724699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:49:19.217127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대전시 218
20.0%
동구 218
20.0%
중앙로 218
20.0%
지하 218
20.0%
200(중동 218
20.0%

건물총면적
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
6563
218 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row6563
2nd row6563
3rd row6563
4th row6563
5th row6563

Common Values

ValueCountFrequency (%)
6563 218
100.0%

Length

2023-12-13T00:49:19.343644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:49:19.446426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
6563 218
100.0%

점포사용면적
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct12
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.192661
Minimum0
Maximum159
Zeros3
Zeros (%)1.4%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-13T00:49:19.559031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile11
Q111
median11
Q311
95-th percentile22
Maximum159
Range159
Interquartile range (IQR)0

Descriptive statistics

Standard deviation10.87548
Coefficient of variation (CV)0.82435839
Kurtosis150.16713
Mean13.192661
Median Absolute Deviation (MAD)0
Skewness11.277266
Sum2876
Variance118.27607
MonotonicityNot monotonic
2023-12-13T00:49:19.713849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
11 176
80.7%
22 27
 
12.4%
23 4
 
1.8%
0 3
 
1.4%
18 1
 
0.5%
6 1
 
0.5%
7 1
 
0.5%
17 1
 
0.5%
5 1
 
0.5%
32 1
 
0.5%
Other values (2) 2
 
0.9%
ValueCountFrequency (%)
0 3
 
1.4%
5 1
 
0.5%
6 1
 
0.5%
7 1
 
0.5%
10 1
 
0.5%
11 176
80.7%
17 1
 
0.5%
18 1
 
0.5%
22 27
 
12.4%
23 4
 
1.8%
ValueCountFrequency (%)
159 1
 
0.5%
32 1
 
0.5%
23 4
 
1.8%
22 27
 
12.4%
18 1
 
0.5%
17 1
 
0.5%
11 176
80.7%
10 1
 
0.5%
7 1
 
0.5%
6 1
 
0.5%

사용여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size350.0 B
True
216 
False
 
2
ValueCountFrequency (%)
True 216
99.1%
False 2
 
0.9%
2023-12-13T00:49:19.894388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-13T00:49:16.012254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:49:15.292765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:49:15.638886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:49:16.108094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:49:15.393879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:49:15.754115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:49:16.239911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:49:15.513009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:49:15.887925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:49:19.992094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일렬번호점포면적점포사용면적사용여부
일렬번호1.0000.0000.0000.270
점포면적0.0001.0001.0000.000
점포사용면적0.0001.0001.0000.000
사용여부0.2700.0000.0001.000
2023-12-13T00:49:20.112737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일렬번호점포면적점포사용면적사용여부
일렬번호1.0000.0010.0010.203
점포면적0.0011.0001.0000.000
점포사용면적0.0011.0001.0000.000
사용여부0.2030.0000.0001.000

Missing values

2023-12-13T00:49:16.393329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:49:16.525298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일렬번호점포이름점포면적점포주소건물총면적점포사용면적사용여부
02706가122대전시 동구 중앙로 지하 200(중동)656322Y
12707가222대전시 동구 중앙로 지하 200(중동)656322Y
22708가322대전시 동구 중앙로 지하 200(중동)656322Y
32709가411대전시 동구 중앙로 지하 200(중동)656311Y
42710가511대전시 동구 중앙로 지하 200(중동)656311Y
52711가611대전시 동구 중앙로 지하 200(중동)656311Y
62712가711대전시 동구 중앙로 지하 200(중동)656311Y
72713가811대전시 동구 중앙로 지하 200(중동)656311Y
82714가911대전시 동구 중앙로 지하 200(중동)656311Y
92715가1011대전시 동구 중앙로 지하 200(중동)656311Y
일렬번호점포이름점포면적점포주소건물총면적점포사용면적사용여부
2082914나10011대전시 동구 중앙로 지하 200(중동)656311Y
2092915나10111대전시 동구 중앙로 지하 200(중동)656311Y
2102916나10222대전시 동구 중앙로 지하 200(중동)656322Y
2112917나10322대전시 동구 중앙로 지하 200(중동)656322Y
2122918나10417대전시 동구 중앙로 지하 200(중동)656317Y
2132919나특5대전시 동구 중앙로 지하 200(중동)65635Y
2142920다특32대전시 동구 중앙로 지하 200(중동)656332Y
2152921현금출금기0대전시 동구 중앙로 지하 200(중동)65630Y
2162922마1호10대전시 동구 중앙로 지하 200(중동)656310Y
2172923바1호159대전시 동구 중앙로 지하 200(중동)6563159Y