Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory400.4 KiB
Average record size in memory41.0 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description경상북도 11,472개의 민간배달앱을 사용하는 사업체 정보(순번, 상호, 시군명, 주소) 데이터 셋 (CSV 파일)
Author경상북도
URLhttps://www.data.go.kr/data/15096094/fileData.do

Alerts

순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 00:23:54.966884
Analysis finished2023-12-12 00:23:56.579885
Duration1.61 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5735.2676
Minimum1
Maximum11472
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T09:23:56.649508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile560.95
Q12872.75
median5746.5
Q38601.25
95-th percentile10892.05
Maximum11472
Range11471
Interquartile range (IQR)5728.5

Descriptive statistics

Standard deviation3309.3705
Coefficient of variation (CV)0.57702112
Kurtosis-1.1959499
Mean5735.2676
Median Absolute Deviation (MAD)2863.5
Skewness-0.004570544
Sum57352676
Variance10951933
MonotonicityNot monotonic
2023-12-12T09:23:56.808720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2376 1
 
< 0.1%
8418 1
 
< 0.1%
1179 1
 
< 0.1%
73 1
 
< 0.1%
225 1
 
< 0.1%
10869 1
 
< 0.1%
3363 1
 
< 0.1%
3504 1
 
< 0.1%
2955 1
 
< 0.1%
10738 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
11472 1
< 0.1%
11471 1
< 0.1%
11470 1
< 0.1%
11469 1
< 0.1%
11468 1
< 0.1%
11467 1
< 0.1%
11465 1
< 0.1%
11464 1
< 0.1%
11462 1
< 0.1%
11461 1
< 0.1%
Distinct9607
Distinct (%)96.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T09:23:57.094599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length23
Mean length8.8486
Min length1

Characters and Unicode

Total characters88486
Distinct characters990
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9273 ?
Unique (%)92.7%

Sample

1st row비스트로피자 경주1호점
2nd row임꺽정 숯불촌
3rd row장강반점
4th rowBOX OF DARK
5th row호식이두마리치킨중리점
ValueCountFrequency (%)
경산점 181
 
1.0%
본점 145
 
0.8%
옥계점 127
 
0.7%
인동점 106
 
0.6%
경주점 93
 
0.5%
하양점 93
 
0.5%
파리바게뜨 80
 
0.4%
영천점 76
 
0.4%
포항점 76
 
0.4%
안동점 75
 
0.4%
Other values (8112) 16902
94.1%
2023-12-12T09:23:57.508251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7954
 
9.0%
5853
 
6.6%
1491
 
1.7%
1473
 
1.7%
1132
 
1.3%
1074
 
1.2%
1070
 
1.2%
1061
 
1.2%
1041
 
1.2%
940
 
1.1%
Other values (980) 65397
73.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 77099
87.1%
Space Separator 7954
 
9.0%
Decimal Number 990
 
1.1%
Uppercase Letter 974
 
1.1%
Other Punctuation 835
 
0.9%
Lowercase Letter 503
 
0.6%
Close Punctuation 60
 
0.1%
Open Punctuation 60
 
0.1%
Dash Punctuation 10
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5853
 
7.6%
1491
 
1.9%
1473
 
1.9%
1132
 
1.5%
1074
 
1.4%
1070
 
1.4%
1061
 
1.4%
1041
 
1.4%
940
 
1.2%
918
 
1.2%
Other values (906) 61046
79.2%
Uppercase Letter
ValueCountFrequency (%)
B 217
22.3%
C 105
10.8%
H 95
9.8%
Q 55
 
5.6%
T 54
 
5.5%
O 53
 
5.4%
E 52
 
5.3%
N 38
 
3.9%
A 35
 
3.6%
D 33
 
3.4%
Other values (16) 237
24.3%
Lowercase Letter
ValueCountFrequency (%)
e 75
14.9%
o 49
 
9.7%
n 38
 
7.6%
s 37
 
7.4%
i 35
 
7.0%
a 32
 
6.4%
h 32
 
6.4%
f 29
 
5.8%
t 23
 
4.6%
r 21
 
4.2%
Other values (16) 132
26.2%
Decimal Number
ValueCountFrequency (%)
1 252
25.5%
0 151
15.3%
9 120
12.1%
2 88
 
8.9%
3 86
 
8.7%
6 68
 
6.9%
5 63
 
6.4%
8 60
 
6.1%
4 53
 
5.4%
7 49
 
4.9%
Other Punctuation
ValueCountFrequency (%)
& 737
88.3%
. 46
 
5.5%
, 27
 
3.2%
' 18
 
2.2%
4
 
0.5%
· 2
 
0.2%
% 1
 
0.1%
Space Separator
ValueCountFrequency (%)
7954
100.0%
Close Punctuation
ValueCountFrequency (%)
) 60
100.0%
Open Punctuation
ValueCountFrequency (%)
( 60
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 77091
87.1%
Common 9910
 
11.2%
Latin 1477
 
1.7%
Han 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5853
 
7.6%
1491
 
1.9%
1473
 
1.9%
1132
 
1.5%
1074
 
1.4%
1070
 
1.4%
1061
 
1.4%
1041
 
1.4%
940
 
1.2%
918
 
1.2%
Other values (899) 61038
79.2%
Latin
ValueCountFrequency (%)
B 217
 
14.7%
C 105
 
7.1%
H 95
 
6.4%
e 75
 
5.1%
Q 55
 
3.7%
T 54
 
3.7%
O 53
 
3.6%
E 52
 
3.5%
o 49
 
3.3%
N 38
 
2.6%
Other values (42) 684
46.3%
Common
ValueCountFrequency (%)
7954
80.3%
& 737
 
7.4%
1 252
 
2.5%
0 151
 
1.5%
9 120
 
1.2%
2 88
 
0.9%
3 86
 
0.9%
6 68
 
0.7%
5 63
 
0.6%
8 60
 
0.6%
Other values (12) 331
 
3.3%
Han
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 77091
87.1%
ASCII 11381
 
12.9%
CJK 8
 
< 0.1%
None 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7954
69.9%
& 737
 
6.5%
1 252
 
2.2%
B 217
 
1.9%
0 151
 
1.3%
9 120
 
1.1%
C 105
 
0.9%
H 95
 
0.8%
2 88
 
0.8%
3 86
 
0.8%
Other values (62) 1576
 
13.8%
Hangul
ValueCountFrequency (%)
5853
 
7.6%
1491
 
1.9%
1473
 
1.9%
1132
 
1.5%
1074
 
1.4%
1070
 
1.4%
1061
 
1.4%
1041
 
1.4%
940
 
1.2%
918
 
1.2%
Other values (899) 61038
79.2%
None
ValueCountFrequency (%)
4
66.7%
· 2
33.3%
CJK
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

시군명
Categorical

Distinct27
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
구미시
2535 
경산시
1673 
포항시 북구
1172 
포항시 남구
1119 
경주시
843 
Other values (22)
2658 

Length

Max length6
Median length3
Mean length3.687
Min length2

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row경주시
2nd row김천시
3rd row예천군
4th row구미시
5th row칠곡군

Common Values

ValueCountFrequency (%)
구미시 2535
25.4%
경산시 1673
16.7%
포항시 북구 1172
11.7%
포항시 남구 1119
11.2%
경주시 843
 
8.4%
안동시 571
 
5.7%
칠곡군 518
 
5.2%
김천시 432
 
4.3%
영주시 314
 
3.1%
영천시 242
 
2.4%
Other values (17) 581
 
5.8%

Length

2023-12-12T09:23:57.644274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
구미시 2535
20.6%
포항시 2291
18.6%
경산시 1673
13.6%
북구 1172
9.5%
남구 1119
9.1%
경주시 843
 
6.9%
안동시 571
 
4.6%
칠곡군 518
 
4.2%
김천시 432
 
3.5%
영주시 314
 
2.6%
Other values (18) 823
 
6.7%
Distinct7603
Distinct (%)76.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T09:23:57.992876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length42
Mean length22.4081
Min length12

Characters and Unicode

Total characters224081
Distinct characters483
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6168 ?
Unique (%)61.7%

Sample

1st row경북 경주시 황성로64번길 30-28
2nd row경상북도 김천시 송설로 23(부곡동)
3rd row경상북도 예천군 호명면 양지4길 20-10
4th row경상북도 구미시 상사서로 55(상모동)
5th row경상북도 칠곡군 석적읍 서중리5길 50
ValueCountFrequency (%)
경상북도 7931
 
17.2%
구미시 2535
 
5.5%
포항시 2291
 
5.0%
경북 2013
 
4.4%
경산시 1673
 
3.6%
북구 1172
 
2.5%
남구 1119
 
2.4%
경주시 843
 
1.8%
안동시 571
 
1.2%
칠곡군 518
 
1.1%
Other values (7260) 25340
55.1%
2023-12-12T09:23:58.511767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
36006
 
16.1%
13033
 
5.8%
11600
 
5.2%
9576
 
4.3%
9045
 
4.0%
8566
 
3.8%
8358
 
3.7%
1 7995
 
3.6%
7880
 
3.5%
( 6104
 
2.7%
Other values (473) 105918
47.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 138916
62.0%
Space Separator 36006
 
16.1%
Decimal Number 34022
 
15.2%
Open Punctuation 6104
 
2.7%
Close Punctuation 6104
 
2.7%
Dash Punctuation 2694
 
1.2%
Other Punctuation 181
 
0.1%
Uppercase Letter 46
 
< 0.1%
Lowercase Letter 7
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13033
 
9.4%
11600
 
8.4%
9576
 
6.9%
9045
 
6.5%
8566
 
6.2%
8358
 
6.0%
7880
 
5.7%
5579
 
4.0%
5453
 
3.9%
2797
 
2.0%
Other values (435) 57029
41.1%
Uppercase Letter
ValueCountFrequency (%)
B 7
15.2%
C 6
13.0%
A 6
13.0%
I 6
13.0%
H 5
10.9%
J 4
8.7%
Y 3
6.5%
G 2
 
4.3%
S 2
 
4.3%
N 1
 
2.2%
Other values (4) 4
8.7%
Decimal Number
ValueCountFrequency (%)
1 7995
23.5%
2 5252
15.4%
3 3935
11.6%
4 3014
 
8.9%
5 2939
 
8.6%
6 2531
 
7.4%
0 2178
 
6.4%
7 2138
 
6.3%
8 2045
 
6.0%
9 1995
 
5.9%
Lowercase Letter
ValueCountFrequency (%)
e 2
28.6%
s 1
14.3%
k 1
14.3%
w 1
14.3%
a 1
14.3%
b 1
14.3%
Other Punctuation
ValueCountFrequency (%)
, 177
97.8%
& 3
 
1.7%
· 1
 
0.6%
Space Separator
ValueCountFrequency (%)
36006
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6104
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6104
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2694
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 138916
62.0%
Common 85112
38.0%
Latin 53
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13033
 
9.4%
11600
 
8.4%
9576
 
6.9%
9045
 
6.5%
8566
 
6.2%
8358
 
6.0%
7880
 
5.7%
5579
 
4.0%
5453
 
3.9%
2797
 
2.0%
Other values (435) 57029
41.1%
Latin
ValueCountFrequency (%)
B 7
13.2%
C 6
11.3%
A 6
11.3%
I 6
11.3%
H 5
9.4%
J 4
 
7.5%
Y 3
 
5.7%
G 2
 
3.8%
S 2
 
3.8%
e 2
 
3.8%
Other values (10) 10
18.9%
Common
ValueCountFrequency (%)
36006
42.3%
1 7995
 
9.4%
( 6104
 
7.2%
) 6104
 
7.2%
2 5252
 
6.2%
3 3935
 
4.6%
4 3014
 
3.5%
5 2939
 
3.5%
- 2694
 
3.2%
6 2531
 
3.0%
Other values (8) 8538
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 138916
62.0%
ASCII 85164
38.0%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
36006
42.3%
1 7995
 
9.4%
( 6104
 
7.2%
) 6104
 
7.2%
2 5252
 
6.2%
3 3935
 
4.6%
4 3014
 
3.5%
5 2939
 
3.5%
- 2694
 
3.2%
6 2531
 
3.0%
Other values (27) 8590
 
10.1%
Hangul
ValueCountFrequency (%)
13033
 
9.4%
11600
 
8.4%
9576
 
6.9%
9045
 
6.5%
8566
 
6.2%
8358
 
6.0%
7880
 
5.7%
5579
 
4.0%
5453
 
3.9%
2797
 
2.0%
Other values (435) 57029
41.1%
None
ValueCountFrequency (%)
· 1
100.0%

Interactions

2023-12-12T09:23:56.293198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:23:58.606431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번시군명
순번1.0000.141
시군명0.1411.000
2023-12-12T09:23:58.680270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번시군명
순번1.0000.051
시군명0.0511.000

Missing values

2023-12-12T09:23:56.442719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:23:56.541611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번상호명시군명도로명 주소
23752376비스트로피자 경주1호점경주시경북 경주시 황성로64번길 30-28
85058506임꺽정 숯불촌김천시경상북도 김천시 송설로 23(부곡동)
58645865장강반점예천군경상북도 예천군 호명면 양지4길 20-10
1051110512BOX OF DARK구미시경상북도 구미시 상사서로 55(상모동)
10841085호식이두마리치킨중리점칠곡군경상북도 칠곡군 석적읍 서중리5길 50
98819882옛날농주김천시경상북도 김천시 남면 석정길 222-15
64016402샌드리코 왜관점칠곡군경상북도 칠곡군 왜관읍 석전로7길 9
22762277범프리카인생치킨 오천점포항시 남구경북 포항시 남구 오천읍 원동로 73
18141815돈까스퐁당떡볶이공수간 구미상모점구미시경북 구미시 상사서로4길 23
72847285밥도둑 짜글이 왜관점칠곡군경상북도 칠곡군 왜관읍 회동1길 12
순번상호명시군명도로명 주소
85138514인생극장쪽갈비안동점안동시경상북도 안동시 복주6길 34(옥동)
56045605안동한우곱창도청점예천군경상북도 예천군 호명면 양지4길 6-4
37923793명랑부대찌개 구미인동점구미시경상북도 구미시 인동19길 26(인의동, 신화오페라하우스)
996997서울뚝배기구미시경상북도 구미시 산호대로27길 40(옥계동)
1014310144육회한 밤칠곡군경상북도 칠곡군 왜관읍 구상길 203
12441245하늘보리피자 하양진량점경산시경상북도 경산시 진량읍 봉황길 68
64836484삼첩분식 구미도량점구미시경상북도 구미시 문장로12길 5-4(도량동)
78207821역전할머니맥주 장량점포항시 북구경상북도 포항시 북구 장량중앙로 76(양덕동)
71187119피자스쿨 경북전문대점영주시경상북도 영주시 대학로 80-1(가흥동)
1055610557애플꼬마김밥 죽도점포항시 북구경상북도 포항시 북구 중흥로 319(죽도동)