Overview

Dataset statistics

Number of variables6
Number of observations952
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory45.7 KiB
Average record size in memory49.1 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description전북특별자치도 김제시 통신판매업체 현황입니다.김제시의 통신판매업체명, 도메인, 취급품목 등을 포함하고 있습니다.
Author전북특별자치도 김제시
URLhttps://www.data.go.kr/data/15006756/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
업체명 is highly imbalanced (87.0%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-21 00:59:56.653204
Analysis finished2024-04-21 00:59:58.378613
Duration1.73 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct952
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean476.5
Minimum1
Maximum952
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.5 KiB
2024-04-21T09:59:58.445551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile48.55
Q1238.75
median476.5
Q3714.25
95-th percentile904.45
Maximum952
Range951
Interquartile range (IQR)475.5

Descriptive statistics

Standard deviation274.96303
Coefficient of variation (CV)0.57704728
Kurtosis-1.2
Mean476.5
Median Absolute Deviation (MAD)238
Skewness0
Sum453628
Variance75604.667
MonotonicityStrictly increasing
2024-04-21T09:59:58.561624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
641 1
 
0.1%
629 1
 
0.1%
630 1
 
0.1%
631 1
 
0.1%
632 1
 
0.1%
633 1
 
0.1%
634 1
 
0.1%
635 1
 
0.1%
636 1
 
0.1%
Other values (942) 942
98.9%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
952 1
0.1%
951 1
0.1%
950 1
0.1%
949 1
0.1%
948 1
0.1%
947 1
0.1%
946 1
0.1%
945 1
0.1%
944 1
0.1%
943 1
0.1%

업체명
Categorical

IMBALANCE 

Distinct44
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
데이터미집계
885 
네이버스마트스토어
 
15
네이버
 
4
카페24시
 
3
옥션
 
3
Other values (39)
 
42

Length

Max length32
Median length6
Mean length6.0997899
Min length2

Unique

Unique36 ?
Unique (%)3.8%

Sample

1st row데이터미집계
2nd row데이터미집계
3rd row데이터미집계
4th row데이터미집계
5th row데이터미집계

Common Values

ValueCountFrequency (%)
데이터미집계 885
93.0%
네이버스마트스토어 15
 
1.6%
네이버 4
 
0.4%
카페24시 3
 
0.3%
옥션 3
 
0.3%
KTIDC 2
 
0.2%
카페24 2
 
0.2%
후이즈 2
 
0.2%
11번가 1
 
0.1%
네이버 파이낸셜 1
 
0.1%
Other values (34) 34
 
3.6%

Length

2024-04-21T09:59:58.676464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
데이터미집계 885
91.2%
네이버스마트스토어 15
 
1.5%
네이버 6
 
0.6%
옥션 6
 
0.6%
11번가 4
 
0.4%
카페24시 3
 
0.3%
g마켓 3
 
0.3%
주식회사 3
 
0.3%
ktidc 2
 
0.2%
카페24 2
 
0.2%
Other values (39) 41
 
4.2%
Distinct437
Distinct (%)45.9%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
2024-04-21T09:59:58.900601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length56
Mean length13.266807
Min length2

Characters and Unicode

Total characters12630
Distinct characters239
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique408 ?
Unique (%)42.9%

Sample

1st row네이버스마트스토어
2nd row데이터미집계
3rd row네이버스마트스토어
4th row네이버스마트스토어
5th row네이버스마트스토어
ValueCountFrequency (%)
데이터미집계 351
32.7%
네이버스마트스토어 71
 
6.6%
쿠팡 38
 
3.5%
네이버 28
 
2.6%
스마트스토어 27
 
2.5%
옥션 23
 
2.1%
11번가 22
 
2.1%
지마켓 21
 
2.0%
네이버스토어팜 9
 
0.8%
7
 
0.7%
Other values (425) 475
44.3%
2024-04-21T09:59:59.294072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 791
 
6.3%
o 728
 
5.8%
r 596
 
4.7%
t 566
 
4.5%
w 516
 
4.1%
a 508
 
4.0%
e 491
 
3.9%
m 480
 
3.8%
478
 
3.8%
s 423
 
3.3%
Other values (229) 7053
55.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 6865
54.4%
Other Letter 3802
30.1%
Other Punctuation 1331
 
10.5%
Decimal Number 372
 
2.9%
Space Separator 178
 
1.4%
Uppercase Letter 44
 
0.3%
Connector Punctuation 21
 
0.2%
Dash Punctuation 7
 
0.1%
Close Punctuation 4
 
< 0.1%
Open Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
478
12.6%
358
 
9.4%
353
 
9.3%
353
 
9.3%
352
 
9.3%
351
 
9.2%
228
 
6.0%
141
 
3.7%
120
 
3.2%
118
 
3.1%
Other values (159) 950
25.0%
Lowercase Letter
ValueCountFrequency (%)
o 728
 
10.6%
r 596
 
8.7%
t 566
 
8.2%
w 516
 
7.5%
a 508
 
7.4%
e 491
 
7.2%
m 480
 
7.0%
s 423
 
6.2%
c 402
 
5.9%
n 334
 
4.9%
Other values (16) 1821
26.5%
Uppercase Letter
ValueCountFrequency (%)
G 6
13.6%
U 5
11.4%
H 4
 
9.1%
R 3
 
6.8%
T 3
 
6.8%
D 3
 
6.8%
Q 2
 
4.5%
W 2
 
4.5%
P 2
 
4.5%
L 2
 
4.5%
Other values (10) 12
27.3%
Decimal Number
ValueCountFrequency (%)
1 98
26.3%
2 48
12.9%
0 46
12.4%
9 30
 
8.1%
7 29
 
7.8%
4 28
 
7.5%
3 27
 
7.3%
8 24
 
6.5%
5 24
 
6.5%
6 18
 
4.8%
Other Punctuation
ValueCountFrequency (%)
. 791
59.4%
/ 412
31.0%
: 122
 
9.2%
? 2
 
0.2%
@ 1
 
0.1%
; 1
 
0.1%
& 1
 
0.1%
# 1
 
0.1%
Space Separator
ValueCountFrequency (%)
178
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 21
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Math Symbol
ValueCountFrequency (%)
= 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 6909
54.7%
Hangul 3802
30.1%
Common 1919
 
15.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
478
12.6%
358
 
9.4%
353
 
9.3%
353
 
9.3%
352
 
9.3%
351
 
9.2%
228
 
6.0%
141
 
3.7%
120
 
3.2%
118
 
3.1%
Other values (159) 950
25.0%
Latin
ValueCountFrequency (%)
o 728
 
10.5%
r 596
 
8.6%
t 566
 
8.2%
w 516
 
7.5%
a 508
 
7.4%
e 491
 
7.1%
m 480
 
6.9%
s 423
 
6.1%
c 402
 
5.8%
n 334
 
4.8%
Other values (36) 1865
27.0%
Common
ValueCountFrequency (%)
. 791
41.2%
/ 412
21.5%
178
 
9.3%
: 122
 
6.4%
1 98
 
5.1%
2 48
 
2.5%
0 46
 
2.4%
9 30
 
1.6%
7 29
 
1.5%
4 28
 
1.5%
Other values (14) 137
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8828
69.9%
Hangul 3802
30.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 791
 
9.0%
o 728
 
8.2%
r 596
 
6.8%
t 566
 
6.4%
w 516
 
5.8%
a 508
 
5.8%
e 491
 
5.6%
m 480
 
5.4%
s 423
 
4.8%
/ 412
 
4.7%
Other values (60) 3317
37.6%
Hangul
ValueCountFrequency (%)
478
12.6%
358
 
9.4%
353
 
9.3%
353
 
9.3%
352
 
9.3%
351
 
9.2%
228
 
6.0%
141
 
3.7%
120
 
3.2%
118
 
3.1%
Other values (159) 950
25.0%
Distinct63
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
2024-04-21T09:59:59.439656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length72
Mean length5.9957983
Min length2

Characters and Unicode

Total characters5708
Distinct characters55
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)3.7%

Sample

1st row의류/패션/잡화/뷰티
2nd row건강/식품
3rd row기타
4th row종합몰
5th row종합몰
ValueCountFrequency (%)
기타 361
31.5%
건강/식품 267
23.3%
종합몰 214
18.7%
의류/패션/잡화/뷰티 147
12.8%
가구/수납용품 33
 
2.9%
교육/도서/완구/오락 29
 
2.5%
레져/여행/공연 23
 
2.0%
컴퓨터/사무용품 23
 
2.0%
자동차/자동차용품 18
 
1.6%
가전 18
 
1.6%
Other values (3) 14
 
1.2%
2024-04-21T09:59:59.722588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 918
16.1%
361
 
6.3%
361
 
6.3%
346
 
6.1%
267
 
4.7%
267
 
4.7%
267
 
4.7%
214
 
3.7%
214
 
3.7%
214
 
3.7%
Other values (45) 2279
39.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4595
80.5%
Other Punctuation 918
 
16.1%
Space Separator 195
 
3.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
361
 
7.9%
361
 
7.9%
346
 
7.5%
267
 
5.8%
267
 
5.8%
267
 
5.8%
214
 
4.7%
214
 
4.7%
214
 
4.7%
147
 
3.2%
Other values (43) 1937
42.2%
Other Punctuation
ValueCountFrequency (%)
/ 918
100.0%
Space Separator
ValueCountFrequency (%)
195
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4595
80.5%
Common 1113
 
19.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
361
 
7.9%
361
 
7.9%
346
 
7.5%
267
 
5.8%
267
 
5.8%
267
 
5.8%
214
 
4.7%
214
 
4.7%
214
 
4.7%
147
 
3.2%
Other values (43) 1937
42.2%
Common
ValueCountFrequency (%)
/ 918
82.5%
195
 
17.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4595
80.5%
ASCII 1113
 
19.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 918
82.5%
195
 
17.5%
Hangul
ValueCountFrequency (%)
361
 
7.9%
361
 
7.9%
346
 
7.5%
267
 
5.8%
267
 
5.8%
267
 
5.8%
214
 
4.7%
214
 
4.7%
214
 
4.7%
147
 
3.2%
Other values (43) 1937
42.2%
Distinct337
Distinct (%)35.4%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
2024-04-21T09:59:59.940516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length6
Mean length5.9233193
Min length1

Characters and Unicode

Total characters5639
Distinct characters368
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique291 ?
Unique (%)30.6%

Sample

1st row데이터미집계
2nd row데이터미집계
3rd row인쇄물 종이 디자인 등
4th row데이터미집계
5th row데이터미집계
ValueCountFrequency (%)
데이터미집계 480
40.2%
농산물 72
 
6.0%
12
 
1.0%
11
 
0.9%
10
 
0.8%
다육식물 10
 
0.8%
식물 9
 
0.8%
잡곡 7
 
0.6%
종합몰 6
 
0.5%
생활용품 6
 
0.5%
Other values (438) 570
47.8%
2024-04-21T10:00:00.303580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
502
 
8.9%
494
 
8.8%
491
 
8.7%
488
 
8.7%
482
 
8.5%
480
 
8.5%
367
 
6.5%
145
 
2.6%
108
 
1.9%
108
 
1.9%
Other values (358) 1974
35.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5166
91.6%
Space Separator 367
 
6.5%
Open Punctuation 31
 
0.5%
Close Punctuation 31
 
0.5%
Decimal Number 16
 
0.3%
Other Punctuation 15
 
0.3%
Lowercase Letter 13
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
502
 
9.7%
494
 
9.6%
491
 
9.5%
488
 
9.4%
482
 
9.3%
480
 
9.3%
145
 
2.8%
108
 
2.1%
108
 
2.1%
102
 
2.0%
Other values (336) 1766
34.2%
Lowercase Letter
ValueCountFrequency (%)
p 4
30.8%
a 2
15.4%
i 2
15.4%
n 1
 
7.7%
o 1
 
7.7%
t 1
 
7.7%
c 1
 
7.7%
l 1
 
7.7%
Decimal Number
ValueCountFrequency (%)
2 6
37.5%
0 4
25.0%
9 2
 
12.5%
5 1
 
6.2%
7 1
 
6.2%
1 1
 
6.2%
4 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
. 11
73.3%
/ 2
 
13.3%
& 1
 
6.7%
: 1
 
6.7%
Space Separator
ValueCountFrequency (%)
367
100.0%
Open Punctuation
ValueCountFrequency (%)
( 31
100.0%
Close Punctuation
ValueCountFrequency (%)
) 31
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5166
91.6%
Common 460
 
8.2%
Latin 13
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
502
 
9.7%
494
 
9.6%
491
 
9.5%
488
 
9.4%
482
 
9.3%
480
 
9.3%
145
 
2.8%
108
 
2.1%
108
 
2.1%
102
 
2.0%
Other values (336) 1766
34.2%
Common
ValueCountFrequency (%)
367
79.8%
( 31
 
6.7%
) 31
 
6.7%
. 11
 
2.4%
2 6
 
1.3%
0 4
 
0.9%
/ 2
 
0.4%
9 2
 
0.4%
& 1
 
0.2%
5 1
 
0.2%
Other values (4) 4
 
0.9%
Latin
ValueCountFrequency (%)
p 4
30.8%
a 2
15.4%
i 2
15.4%
n 1
 
7.7%
o 1
 
7.7%
t 1
 
7.7%
c 1
 
7.7%
l 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5166
91.6%
ASCII 473
 
8.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
502
 
9.7%
494
 
9.6%
491
 
9.5%
488
 
9.4%
482
 
9.3%
480
 
9.3%
145
 
2.8%
108
 
2.1%
108
 
2.1%
102
 
2.0%
Other values (336) 1766
34.2%
ASCII
ValueCountFrequency (%)
367
77.6%
( 31
 
6.6%
) 31
 
6.6%
. 11
 
2.3%
2 6
 
1.3%
0 4
 
0.8%
p 4
 
0.8%
a 2
 
0.4%
i 2
 
0.4%
/ 2
 
0.4%
Other values (12) 13
 
2.7%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
2024-04-12
952 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-04-12
2nd row2024-04-12
3rd row2024-04-12
4th row2024-04-12
5th row2024-04-12

Common Values

ValueCountFrequency (%)
2024-04-12 952
100.0%

Length

2024-04-21T10:00:00.412666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:00:00.485416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-04-12 952
100.0%

Interactions

2024-04-21T09:59:58.081818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T10:00:00.535889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업체명취급품목
연번1.0000.3610.449
업체명0.3611.0000.000
취급품목0.4490.0001.000
2024-04-21T10:00:00.621956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업체명
연번1.0000.130
업체명0.1301.000

Missing values

2024-04-21T09:59:58.248099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T09:59:58.339365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명도메인명취급품목취급품목세부사항데이터기준일자
01데이터미집계네이버스마트스토어의류/패션/잡화/뷰티데이터미집계2024-04-12
12데이터미집계데이터미집계건강/식품데이터미집계2024-04-12
23데이터미집계네이버스마트스토어기타인쇄물 종이 디자인 등2024-04-12
34데이터미집계네이버스마트스토어종합몰데이터미집계2024-04-12
45데이터미집계네이버스마트스토어종합몰데이터미집계2024-04-12
56데이터미집계스마트스토어기타데이터미집계2024-04-12
67데이터미집계네이버스토어팜(the sumgim)건강/식품데이터미집계2024-04-12
78나이스idreamone.com건강/식품데이터미집계2024-04-12
89데이터미집계네이버스마트스토어건강/식품데이터미집계2024-04-12
910데이터미집계스마트스토어종합몰데이터미집계2024-04-12
연번업체명도메인명취급품목취급품목세부사항데이터기준일자
942943데이터미집계데이터미집계데이터미집계데이터미집계2024-04-12
943944데이터미집계데이터미집계데이터미집계데이터미집계2024-04-12
944945데이터미집계www.lowbuy.co.kr가전가전제품2024-04-12
945946데이터미집계데이터미집계데이터미집계데이터미집계2024-04-12
946947데이터미집계데이터미집계데이터미집계데이터미집계2024-04-12
947948데이터미집계www.sanjifarm.com기타농산물2024-04-12
948949데이터미집계foodware.co.kr건강/식품만두류2024-04-12
949950고도몰www.soojihealing.com건강/식품 기타화장품 건강식품 의료용품 정수기 및 생활용품2024-04-12
950951데이터미집계www.oaaro.kr데이터미집계파프리카2024-04-12
951952데이터미집계데이터미집계데이터미집계데이터미집계2024-04-12