Overview

Dataset statistics

Number of variables5
Number of observations8454
Missing cells60
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory338.6 KiB
Average record size in memory41.0 B

Variable types

Numeric1
Text2
Categorical2

Dataset

Description전북특별자치도 전주시 관내 통신판매업 현황(업소명, 소재지, 취급품목, 데이터기준일) 등의 정보를 제공합니다.
Author전북특별자치도 전주시
URLhttps://www.data.go.kr/data/15126587/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 소재지High correlation
소재지 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 11:22:38.568840
Analysis finished2024-03-14 11:22:40.230184
Duration1.66 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct8454
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4227.5
Minimum1
Maximum8454
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size74.4 KiB
2024-03-14T20:22:40.444572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile423.65
Q12114.25
median4227.5
Q36340.75
95-th percentile8031.35
Maximum8454
Range8453
Interquartile range (IQR)4226.5

Descriptive statistics

Standard deviation2440.6039
Coefficient of variation (CV)0.57731613
Kurtosis-1.2
Mean4227.5
Median Absolute Deviation (MAD)2113.5
Skewness0
Sum35739285
Variance5956547.5
MonotonicityStrictly increasing
2024-03-14T20:22:40.772818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
5633 1
 
< 0.1%
5647 1
 
< 0.1%
5646 1
 
< 0.1%
5645 1
 
< 0.1%
5644 1
 
< 0.1%
5643 1
 
< 0.1%
5642 1
 
< 0.1%
5641 1
 
< 0.1%
5640 1
 
< 0.1%
Other values (8444) 8444
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
8454 1
< 0.1%
8453 1
< 0.1%
8452 1
< 0.1%
8451 1
< 0.1%
8450 1
< 0.1%
8449 1
< 0.1%
8448 1
< 0.1%
8447 1
< 0.1%
8446 1
< 0.1%
8445 1
< 0.1%
Distinct8326
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size66.2 KiB
2024-03-14T20:22:41.864122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length35
Mean length6.7840076
Min length1

Characters and Unicode

Total characters57352
Distinct characters1085
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8213 ?
Unique (%)97.1%

Sample

1st row월드와이드트레이드
2nd row피부의시간
3rd row활짝
4th row북숭앗빛
5th row이사야몰
ValueCountFrequency (%)
주식회사 591
 
5.4%
유한회사 205
 
1.9%
농업회사법인 57
 
0.5%
35
 
0.3%
인셀덤 34
 
0.3%
전주점 26
 
0.2%
전주 26
 
0.2%
20
 
0.2%
ltd 20
 
0.2%
co 19
 
0.2%
Other values (9210) 9974
90.6%
2024-03-14T20:22:43.517258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2589
 
4.5%
1747
 
3.0%
1533
 
2.7%
1304
 
2.3%
1279
 
2.2%
) 1202
 
2.1%
( 1195
 
2.1%
972
 
1.7%
939
 
1.6%
712
 
1.2%
Other values (1075) 43880
76.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45028
78.5%
Lowercase Letter 3552
 
6.2%
Uppercase Letter 3080
 
5.4%
Space Separator 2589
 
4.5%
Close Punctuation 1204
 
2.1%
Open Punctuation 1197
 
2.1%
Decimal Number 473
 
0.8%
Other Punctuation 194
 
0.3%
Other Symbol 19
 
< 0.1%
Connector Punctuation 15
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1747
 
3.9%
1533
 
3.4%
1304
 
2.9%
1279
 
2.8%
972
 
2.2%
939
 
2.1%
712
 
1.6%
649
 
1.4%
587
 
1.3%
562
 
1.2%
Other values (997) 34744
77.2%
Lowercase Letter
ValueCountFrequency (%)
e 427
12.0%
o 382
 
10.8%
a 289
 
8.1%
n 263
 
7.4%
i 262
 
7.4%
r 221
 
6.2%
t 217
 
6.1%
l 199
 
5.6%
s 168
 
4.7%
m 145
 
4.1%
Other values (16) 979
27.6%
Uppercase Letter
ValueCountFrequency (%)
O 245
 
8.0%
A 232
 
7.5%
E 216
 
7.0%
S 207
 
6.7%
L 187
 
6.1%
N 168
 
5.5%
I 167
 
5.4%
T 165
 
5.4%
C 163
 
5.3%
M 151
 
4.9%
Other values (16) 1179
38.3%
Decimal Number
ValueCountFrequency (%)
1 102
21.6%
2 76
16.1%
3 59
12.5%
0 52
11.0%
5 47
9.9%
6 30
 
6.3%
4 29
 
6.1%
9 28
 
5.9%
8 26
 
5.5%
7 24
 
5.1%
Other Punctuation
ValueCountFrequency (%)
. 114
58.8%
& 51
26.3%
' 11
 
5.7%
: 9
 
4.6%
! 4
 
2.1%
" 2
 
1.0%
# 2
 
1.0%
1
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 1202
99.8%
] 2
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 1195
99.8%
[ 2
 
0.2%
Space Separator
ValueCountFrequency (%)
2589
100.0%
Other Symbol
ValueCountFrequency (%)
19
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 15
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 45029
78.5%
Latin 6632
 
11.6%
Common 5673
 
9.9%
Han 18
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1747
 
3.9%
1533
 
3.4%
1304
 
2.9%
1279
 
2.8%
972
 
2.2%
939
 
2.1%
712
 
1.6%
649
 
1.4%
587
 
1.3%
562
 
1.2%
Other values (982) 34745
77.2%
Latin
ValueCountFrequency (%)
e 427
 
6.4%
o 382
 
5.8%
a 289
 
4.4%
n 263
 
4.0%
i 262
 
4.0%
O 245
 
3.7%
A 232
 
3.5%
r 221
 
3.3%
t 217
 
3.3%
E 216
 
3.3%
Other values (42) 3878
58.5%
Common
ValueCountFrequency (%)
2589
45.6%
) 1202
21.2%
( 1195
21.1%
. 114
 
2.0%
1 102
 
1.8%
2 76
 
1.3%
3 59
 
1.0%
0 52
 
0.9%
& 51
 
0.9%
5 47
 
0.8%
Other values (15) 186
 
3.3%
Han
ValueCountFrequency (%)
2
 
11.1%
2
 
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (6) 6
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45010
78.5%
ASCII 12304
 
21.5%
None 20
 
< 0.1%
CJK 18
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2589
21.0%
) 1202
 
9.8%
( 1195
 
9.7%
e 427
 
3.5%
o 382
 
3.1%
a 289
 
2.3%
n 263
 
2.1%
i 262
 
2.1%
O 245
 
2.0%
A 232
 
1.9%
Other values (66) 5218
42.4%
Hangul
ValueCountFrequency (%)
1747
 
3.9%
1533
 
3.4%
1304
 
2.9%
1279
 
2.8%
972
 
2.2%
939
 
2.1%
712
 
1.6%
649
 
1.4%
587
 
1.3%
562
 
1.2%
Other values (981) 34726
77.2%
None
ValueCountFrequency (%)
19
95.0%
1
 
5.0%
CJK
ValueCountFrequency (%)
2
 
11.1%
2
 
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (6) 6
33.3%

소재지
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size66.2 KiB
전북특별자치도 전주시 완산구
4734 
전북특별자치도 전주시 덕진구
3720 

Length

Max length15
Median length15
Mean length15
Min length15

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전북특별자치도 전주시 덕진구
2nd row전북특별자치도 전주시 덕진구
3rd row전북특별자치도 전주시 덕진구
4th row전북특별자치도 전주시 덕진구
5th row전북특별자치도 전주시 덕진구

Common Values

ValueCountFrequency (%)
전북특별자치도 전주시 완산구 4734
56.0%
전북특별자치도 전주시 덕진구 3720
44.0%

Length

2024-03-14T20:22:43.914348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:22:44.227158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전북특별자치도 8454
33.3%
전주시 8454
33.3%
완산구 4734
18.7%
덕진구 3720
14.7%
Distinct444
Distinct (%)5.3%
Missing60
Missing (%)0.7%
Memory size66.2 KiB
2024-03-14T20:22:44.814382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length83
Mean length8.4061234
Min length2

Characters and Unicode

Total characters70561
Distinct characters56
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique292 ?
Unique (%)3.5%

Sample

1st row종합몰
2nd row종합몰
3rd row종합몰
4th row의류/패션/잡화/뷰티
5th row가구/수납용품
ValueCountFrequency (%)
종합몰 3147
26.7%
의류/패션/잡화/뷰티 2396
20.3%
기타 2203
18.7%
건강/식품 1314
11.1%
교육/도서/완구/오락 589
 
5.0%
가구/수납용품 456
 
3.9%
컴퓨터/사무용품 427
 
3.6%
레져/여행/공연 397
 
3.4%
가전 395
 
3.4%
자동차/자동차용품 287
 
2.4%
Other values (5) 177
 
1.5%
2024-03-14T20:22:45.771803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 12286
 
17.4%
3394
 
4.8%
3147
 
4.5%
3147
 
4.5%
3147
 
4.5%
2658
 
3.8%
2396
 
3.4%
2396
 
3.4%
2396
 
3.4%
2396
 
3.4%
Other values (46) 33198
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 54857
77.7%
Other Punctuation 12286
 
17.4%
Space Separator 3394
 
4.8%
Decimal Number 24
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3147
 
5.7%
3147
 
5.7%
3147
 
5.7%
2658
 
4.8%
2396
 
4.4%
2396
 
4.4%
2396
 
4.4%
2396
 
4.4%
2396
 
4.4%
2396
 
4.4%
Other values (38) 28382
51.7%
Decimal Number
ValueCountFrequency (%)
2 10
41.7%
0 8
33.3%
1 2
 
8.3%
8 2
 
8.3%
7 1
 
4.2%
5 1
 
4.2%
Other Punctuation
ValueCountFrequency (%)
/ 12286
100.0%
Space Separator
ValueCountFrequency (%)
3394
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 54857
77.7%
Common 15704
 
22.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3147
 
5.7%
3147
 
5.7%
3147
 
5.7%
2658
 
4.8%
2396
 
4.4%
2396
 
4.4%
2396
 
4.4%
2396
 
4.4%
2396
 
4.4%
2396
 
4.4%
Other values (38) 28382
51.7%
Common
ValueCountFrequency (%)
/ 12286
78.2%
3394
 
21.6%
2 10
 
0.1%
0 8
 
0.1%
1 2
 
< 0.1%
8 2
 
< 0.1%
7 1
 
< 0.1%
5 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 54857
77.7%
ASCII 15704
 
22.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 12286
78.2%
3394
 
21.6%
2 10
 
0.1%
0 8
 
0.1%
1 2
 
< 0.1%
8 2
 
< 0.1%
7 1
 
< 0.1%
5 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
3147
 
5.7%
3147
 
5.7%
3147
 
5.7%
2658
 
4.8%
2396
 
4.4%
2396
 
4.4%
2396
 
4.4%
2396
 
4.4%
2396
 
4.4%
2396
 
4.4%
Other values (38) 28382
51.7%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size66.2 KiB
2024-02-06
8454 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-02-06
2nd row2024-02-06
3rd row2024-02-06
4th row2024-02-06
5th row2024-02-06

Common Values

ValueCountFrequency (%)
2024-02-06 8454
100.0%

Length

2024-03-14T20:22:46.156494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:22:46.465845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-02-06 8454
100.0%

Interactions

2024-03-14T20:22:39.443605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T20:22:46.647973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소재지
연번1.0000.997
소재지0.9971.000
2024-03-14T20:22:46.792196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소재지
연번1.0000.950
소재지0.9501.000

Missing values

2024-03-14T20:22:39.773113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T20:22:40.091406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명소재지취급품목데이터기준일자
01월드와이드트레이드전북특별자치도 전주시 덕진구종합몰2024-02-06
12피부의시간전북특별자치도 전주시 덕진구종합몰2024-02-06
23활짝전북특별자치도 전주시 덕진구종합몰2024-02-06
34북숭앗빛전북특별자치도 전주시 덕진구의류/패션/잡화/뷰티2024-02-06
45이사야몰전북특별자치도 전주시 덕진구가구/수납용품2024-02-06
56투투코퍼레이션전북특별자치도 전주시 덕진구종합몰 교육/도서/완구/오락2024-02-06
67윈도우마켓 전주전북특별자치도 전주시 덕진구종합몰2024-02-06
78주니또전북특별자치도 전주시 덕진구종합몰2024-02-06
89전주덕진두처리두치전북특별자치도 전주시 덕진구종합몰2024-02-06
910은솔컴퍼니전북특별자치도 전주시 덕진구종합몰2024-02-06
연번업소명소재지취급품목데이터기준일자
84448445INDIZONE전북특별자치도 전주시 완산구종합몰2024-02-06
84458446(유)다솜전북특별자치도 전주시 완산구교육/도서/완구/오락 건강/식품2024-02-06
84468447진솔텔레콤전북특별자치도 전주시 완산구<NA>2024-02-06
84478448뽀삐세상전북특별자치도 전주시 완산구건강/식품2024-02-06
84488449헌책바다전북특별자치도 전주시 완산구<NA>2024-02-06
84498450(주)샘 코스메틱전북특별자치도 전주시 완산구기타2024-02-06
84508451프로텔전북특별자치도 전주시 완산구<NA>2024-02-06
84518452안나갤러리전북특별자치도 전주시 완산구<NA>2024-02-06
84528453㈜인포피아전북특별자치도 전주시 완산구<NA>2024-02-06
84538454ICON52전북특별자치도 전주시 완산구<NA>2024-02-06