Overview

Dataset statistics

Number of variables6
Number of observations35
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory53.8 B

Variable types

Text3
Categorical1
Numeric2

Dataset

Description경상북도 대규모점포 현황에 대한 데이터로 대규모점포의 법인명, 상호, 업태, 소재지, 건물 연면적, 영업장 면적 현황 등의 항목을 제공합니다.
Author경상북도
URLhttps://www.data.go.kr/data/15044788/fileData.do

Alerts

건물 연면적(제곱미터) is highly overall correlated with 영업장 면적(제곱미터)High correlation
영업장 면적(제곱미터) is highly overall correlated with 건물 연면적(제곱미터)High correlation
상호 has unique valuesUnique
영업장 면적(제곱미터) has unique valuesUnique

Reproduction

Analysis started2023-12-13 00:07:17.527343
Analysis finished2023-12-13 00:07:18.228040
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct19
Distinct (%)54.3%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-13T09:07:18.314925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length5.4857143
Min length3

Characters and Unicode

Total characters192
Distinct characters67
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)37.1%

Sample

1st row㈜이마트
2nd row㈜이마트
3rd row롯데쇼핑㈜
4th row홈플러스㈜
5th row홈플러스스토어즈㈜
ValueCountFrequency (%)
㈜이마트 7
20.0%
홈플러스㈜ 5
14.3%
롯데쇼핑㈜ 4
11.4%
주)이마트 2
 
5.7%
㈜이랜드리테일 2
 
5.7%
홈플러스주식회사 2
 
5.7%
㈜농협하나로유통 1
 
2.9%
농협중앙회 1
 
2.9%
안동봉화축협 1
 
2.9%
㈜금오유통 1
 
2.9%
Other values (9) 9
25.7%
2023-12-13T09:07:18.545175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27
 
14.1%
13
 
6.8%
9
 
4.7%
9
 
4.7%
9
 
4.7%
8
 
4.2%
8
 
4.2%
8
 
4.2%
5
 
2.6%
4
 
2.1%
Other values (57) 92
47.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 159
82.8%
Other Symbol 27
 
14.1%
Open Punctuation 3
 
1.6%
Close Punctuation 3
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
8.2%
9
 
5.7%
9
 
5.7%
9
 
5.7%
8
 
5.0%
8
 
5.0%
8
 
5.0%
5
 
3.1%
4
 
2.5%
4
 
2.5%
Other values (54) 82
51.6%
Other Symbol
ValueCountFrequency (%)
27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 186
96.9%
Common 6
 
3.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
27
 
14.5%
13
 
7.0%
9
 
4.8%
9
 
4.8%
9
 
4.8%
8
 
4.3%
8
 
4.3%
8
 
4.3%
5
 
2.7%
4
 
2.2%
Other values (55) 86
46.2%
Common
ValueCountFrequency (%)
( 3
50.0%
) 3
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 159
82.8%
None 27
 
14.1%
ASCII 6
 
3.1%

Most frequent character per block

None
ValueCountFrequency (%)
27
100.0%
Hangul
ValueCountFrequency (%)
13
 
8.2%
9
 
5.7%
9
 
5.7%
9
 
5.7%
8
 
5.0%
8
 
5.0%
8
 
5.0%
5
 
3.1%
4
 
2.5%
4
 
2.5%
Other values (54) 82
51.6%
ASCII
ValueCountFrequency (%)
( 3
50.0%
) 3
50.0%

상호
Text

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-13T09:07:18.723254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length7.9714286
Min length4

Characters and Unicode

Total characters279
Distinct characters79
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row㈜이마트 포항점
2nd row㈜이마트 포항이동점
3rd row롯데마트 포항점
4th row홈플러스㈜ 죽도점
5th row홈플러스스토어즈㈜포항점
ValueCountFrequency (%)
㈜이마트 4
 
6.9%
이마트 4
 
6.9%
포항점 4
 
6.9%
구미점 4
 
6.9%
홈플러스 3
 
5.2%
경산점 3
 
5.2%
롯데마트 3
 
5.2%
홈플러스㈜ 2
 
3.4%
김천점 2
 
3.4%
안동점 2
 
3.4%
Other values (26) 27
46.6%
2023-12-13T09:07:18.988858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28
 
10.0%
23
 
8.2%
17
 
6.1%
15
 
5.4%
11
 
3.9%
10
 
3.6%
9
 
3.2%
8
 
2.9%
8
 
2.9%
8
 
2.9%
Other values (69) 142
50.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 243
87.1%
Space Separator 23
 
8.2%
Other Symbol 9
 
3.2%
Uppercase Letter 3
 
1.1%
Decimal Number 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
28
 
11.5%
17
 
7.0%
15
 
6.2%
11
 
4.5%
10
 
4.1%
8
 
3.3%
8
 
3.3%
8
 
3.3%
7
 
2.9%
7
 
2.9%
Other values (63) 124
51.0%
Uppercase Letter
ValueCountFrequency (%)
N 1
33.3%
G 1
33.3%
C 1
33.3%
Space Separator
ValueCountFrequency (%)
23
100.0%
Other Symbol
ValueCountFrequency (%)
9
100.0%
Decimal Number
ValueCountFrequency (%)
7 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 252
90.3%
Common 24
 
8.6%
Latin 3
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
28
 
11.1%
17
 
6.7%
15
 
6.0%
11
 
4.4%
10
 
4.0%
9
 
3.6%
8
 
3.2%
8
 
3.2%
8
 
3.2%
7
 
2.8%
Other values (64) 131
52.0%
Latin
ValueCountFrequency (%)
N 1
33.3%
G 1
33.3%
C 1
33.3%
Common
ValueCountFrequency (%)
23
95.8%
7 1
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 243
87.1%
ASCII 27
 
9.7%
None 9
 
3.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
28
 
11.5%
17
 
7.0%
15
 
6.2%
11
 
4.5%
10
 
4.1%
8
 
3.3%
8
 
3.3%
8
 
3.3%
7
 
2.9%
7
 
2.9%
Other values (63) 124
51.0%
ASCII
ValueCountFrequency (%)
23
85.2%
N 1
 
3.7%
G 1
 
3.7%
7 1
 
3.7%
C 1
 
3.7%
None
ValueCountFrequency (%)
9
100.0%

업태
Categorical

Distinct4
Distinct (%)11.4%
Missing0
Missing (%)0.0%
Memory size412.0 B
대형마트
25 
쇼핑센터
백화점
전문점
 
2

Length

Max length4
Median length4
Mean length3.8571429
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대형마트
2nd row대형마트
3rd row대형마트
4th row대형마트
5th row대형마트

Common Values

ValueCountFrequency (%)
대형마트 25
71.4%
쇼핑센터 5
 
14.3%
백화점 3
 
8.6%
전문점 2
 
5.7%

Length

2023-12-13T09:07:19.090126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:07:19.172338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대형마트 25
71.4%
쇼핑센터 5
 
14.3%
백화점 3
 
8.6%
전문점 2
 
5.7%
Distinct34
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-13T09:07:19.344673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length21
Mean length20.457143
Min length15

Characters and Unicode

Total characters716
Distinct characters90
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)94.3%

Sample

1st row경상북도 포항시 남구 냉천로 10(인덕동)
2nd row경상북도 포항시 북구 대이로 188(득량동)
3rd row경상북도 포항시 남구 지곡로 237(지곡동)
4th row경상북도 포항시 북구 중흥로 328(죽도동)
5th row경상북도 포항시 남구 중흥로 77(상도동)
ValueCountFrequency (%)
경상북도 35
23.8%
포항시 9
 
6.1%
구미시 8
 
5.4%
남구 5
 
3.4%
경산시 4
 
2.7%
북구 4
 
2.7%
김천시 4
 
2.7%
중흥로 3
 
2.0%
안동시 3
 
2.0%
옥산로 2
 
1.4%
Other values (64) 70
47.6%
2023-12-13T09:07:19.626564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
112
 
15.6%
46
 
6.4%
40
 
5.6%
39
 
5.4%
38
 
5.3%
37
 
5.2%
34
 
4.7%
34
 
4.7%
) 26
 
3.6%
( 26
 
3.6%
Other values (80) 284
39.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 461
64.4%
Space Separator 112
 
15.6%
Decimal Number 91
 
12.7%
Close Punctuation 26
 
3.6%
Open Punctuation 26
 
3.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
46
 
10.0%
40
 
8.7%
39
 
8.5%
38
 
8.2%
37
 
8.0%
34
 
7.4%
34
 
7.4%
21
 
4.6%
11
 
2.4%
11
 
2.4%
Other values (67) 150
32.5%
Decimal Number
ValueCountFrequency (%)
2 20
22.0%
1 16
17.6%
7 15
16.5%
8 10
11.0%
6 7
 
7.7%
0 6
 
6.6%
5 5
 
5.5%
3 5
 
5.5%
4 4
 
4.4%
9 3
 
3.3%
Space Separator
ValueCountFrequency (%)
112
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 461
64.4%
Common 255
35.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
46
 
10.0%
40
 
8.7%
39
 
8.5%
38
 
8.2%
37
 
8.0%
34
 
7.4%
34
 
7.4%
21
 
4.6%
11
 
2.4%
11
 
2.4%
Other values (67) 150
32.5%
Common
ValueCountFrequency (%)
112
43.9%
) 26
 
10.2%
( 26
 
10.2%
2 20
 
7.8%
1 16
 
6.3%
7 15
 
5.9%
8 10
 
3.9%
6 7
 
2.7%
0 6
 
2.4%
5 5
 
2.0%
Other values (3) 12
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 461
64.4%
ASCII 255
35.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
112
43.9%
) 26
 
10.2%
( 26
 
10.2%
2 20
 
7.8%
1 16
 
6.3%
7 15
 
5.9%
8 10
 
3.9%
6 7
 
2.7%
0 6
 
2.4%
5 5
 
2.0%
Other values (3) 12
 
4.7%
Hangul
ValueCountFrequency (%)
46
 
10.0%
40
 
8.7%
39
 
8.5%
38
 
8.2%
37
 
8.0%
34
 
7.4%
34
 
7.4%
21
 
4.6%
11
 
2.4%
11
 
2.4%
Other values (67) 150
32.5%

건물 연면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION 

Distinct34
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28621.029
Minimum5497
Maximum72068
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-13T09:07:19.725036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5497
5-th percentile6676.7
Q110538.5
median22349
Q343437
95-th percentile69642.5
Maximum72068
Range66571
Interquartile range (IQR)32898.5

Descriptive statistics

Standard deviation20303.368
Coefficient of variation (CV)0.70938637
Kurtosis-0.52268702
Mean28621.029
Median Absolute Deviation (MAD)13529
Skewness0.7704542
Sum1001736
Variance4.1222673 × 108
MonotonicityNot monotonic
2023-12-13T09:07:19.818066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
72068 2
 
5.7%
26476 1
 
2.9%
16562 1
 
2.9%
9912 1
 
2.9%
15819 1
 
2.9%
28490 1
 
2.9%
51303 1
 
2.9%
49474 1
 
2.9%
21552 1
 
2.9%
68603 1
 
2.9%
Other values (24) 24
68.6%
ValueCountFrequency (%)
5497 1
2.9%
5941 1
2.9%
6992 1
2.9%
7499 1
2.9%
8358 1
2.9%
8820 1
2.9%
8834 1
2.9%
9912 1
2.9%
9949 1
2.9%
11128 1
2.9%
ValueCountFrequency (%)
72068 2
5.7%
68603 1
2.9%
58064 1
2.9%
51306 1
2.9%
51303 1
2.9%
49474 1
2.9%
48767 1
2.9%
44937 1
2.9%
41937 1
2.9%
41057 1
2.9%

영업장 면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14395.971
Minimum2923
Maximum39404
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-13T09:07:19.911759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2923
5-th percentile3770.3
Q18315.5
median12739
Q320802.5
95-th percentile26771.5
Maximum39404
Range36481
Interquartile range (IQR)12487

Descriptive statistics

Standard deviation8125.7556
Coefficient of variation (CV)0.56444649
Kurtosis1.0830139
Mean14395.971
Median Absolute Deviation (MAD)6572
Skewness0.8458154
Sum503859
Variance66027904
MonotonicityNot monotonic
2023-12-13T09:07:20.007664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
21542 1
 
2.9%
16914 1
 
2.9%
8273 1
 
2.9%
13057 1
 
2.9%
21823 1
 
2.9%
11792 1
 
2.9%
15549 1
 
2.9%
21354 1
 
2.9%
2923 1
 
2.9%
39404 1
 
2.9%
Other values (25) 25
71.4%
ValueCountFrequency (%)
2923 1
2.9%
3043 1
2.9%
4082 1
2.9%
4774 1
2.9%
5311 1
2.9%
5507 1
2.9%
6167 1
2.9%
8080 1
2.9%
8273 1
2.9%
8358 1
2.9%
ValueCountFrequency (%)
39404 1
2.9%
28259 1
2.9%
26134 1
2.9%
22050 1
2.9%
21823 1
2.9%
21542 1
2.9%
21354 1
2.9%
21096 1
2.9%
20873 1
2.9%
20732 1
2.9%

Interactions

2023-12-13T09:07:17.944794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:07:17.813944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:07:18.027936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:07:17.876470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T09:07:20.079682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법인명상호업태소재지건물 연면적(제곱미터)영업장 면적(제곱미터)
법인명1.0001.0000.9240.5340.2050.000
상호1.0001.0001.0001.0001.0001.000
업태0.9241.0001.0000.7980.0000.442
소재지0.5341.0000.7981.0001.0000.943
건물 연면적(제곱미터)0.2051.0000.0001.0001.0000.698
영업장 면적(제곱미터)0.0001.0000.4420.9430.6981.000
2023-12-13T09:07:20.158154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건물 연면적(제곱미터)영업장 면적(제곱미터)업태
건물 연면적(제곱미터)1.0000.7390.000
영업장 면적(제곱미터)0.7391.0000.084
업태0.0000.0841.000

Missing values

2023-12-13T09:07:18.118146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:07:18.195810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

법인명상호업태소재지건물 연면적(제곱미터)영업장 면적(제곱미터)
0㈜이마트㈜이마트 포항점대형마트경상북도 포항시 남구 냉천로 10(인덕동)2647621542
1㈜이마트㈜이마트 포항이동점대형마트경상북도 포항시 북구 대이로 188(득량동)4193716914
2롯데쇼핑㈜롯데마트 포항점대형마트경상북도 포항시 남구 지곡로 237(지곡동)54974082
3홈플러스㈜홈플러스㈜ 죽도점대형마트경상북도 포항시 북구 중흥로 328(죽도동)202546167
4홈플러스스토어즈㈜홈플러스스토어즈㈜포항점대형마트경상북도 포항시 남구 중흥로 77(상도동)7206812739
5㈜농협하나로유통하나로마트 포항점대형마트경상북도 포항시 북구 장량로 171(양덕동)3437313256
6홈플러스㈜홈플러스㈜경주점대형마트경상북도 경주시 공단로97(용강동)69924774
7(주)이마트이마트 김천점대형마트경상북도 김천시 시청로 75(신음동)1609115428
8농협중앙회하나로마트대형마트경상북도 김천시 자산로 147(평화동)88208080
9롯데쇼핑㈜롯데마트 김천점대형마트경상북도 김천시 시청로 80(신음동)2325312397
법인명상호업태소재지건물 연면적(제곱미터)영업장 면적(제곱미터)
25㈜모다모다아울렛 김천구미점전문점경상북도 김천시 아포읍 아포대로 14172155221354
26㈜조아프라자조아아울렛전문점경상북도 경산시 강변동로 278165622923
27롯데쇼핑㈜롯데백화점 포항점백화점경상북도 포항시 북구 학산로 62(학산동)6860339404
28㈜준성공㈜준성공백화점경상북도 경주시 계림로10174995311
29㈜이랜드리테일동아백화점 구미점백화점경상북도 구미시 송원동로 28(송정동)83588358
30(사)제철복지회효자프라자 웰빙아울렛쇼핑센터경상북도 포항시 남구 효자로 70(대잠동)88348834
31본아이온㈜그랜드애비뉴쇼핑센터경상북도 포항시 남구 중흥로 77(상도동)7206822050
32보성산업㈜보성산업 구미역사점쇼핑센터경상북도 구미시 중앙로 76(원평동)4105719248
33에이시디㈜해마루밸리쇼핑센터경상북도 구미시 해마루공원로 211112811128
34㈜이랜드리테일NC 경산점쇼핑센터경상북도 경산시 옥산로 2805130611795