Overview

Dataset statistics

Number of variables5
Number of observations2882
Missing cells342
Missing cells (%)2.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory118.3 KiB
Average record size in memory42.0 B

Variable types

Text3
Numeric2

Dataset

Description부산광역시 집단급식소 현황에 대한 데이터로 업종, 업소명, 업태, 업소주소 위도, 경도 항목에 대한 정보를 제공합니다
Author부산광역시
URLhttps://www.data.go.kr/data/3075873/fileData.do

Alerts

X좌표 is highly overall correlated with Y좌표High correlation
Y좌표 is highly overall correlated with X좌표High correlation
업소전화번호 has 342 (11.9%) missing valuesMissing

Reproduction

Analysis started2023-12-12 11:38:00.611320
Analysis finished2023-12-12 11:38:02.824045
Duration2.21 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct2826
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size22.6 KiB
2023-12-12T20:38:03.166724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length26
Mean length8.3501041
Min length2

Characters and Unicode

Total characters24065
Distinct characters577
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2778 ?
Unique (%)96.4%

Sample

1st row(사복)로사리오카리타스 중구노인복지관 분관
2nd row(의)송산의료재단해양요양병원
3rd row(재)천주교부산교구메리놀병원 집단급식소
4th row(주)HJ중공업
5th row(주)엘지유플러스
ValueCountFrequency (%)
어린이집 83
 
2.3%
의료법인 47
 
1.3%
주식회사 45
 
1.3%
구내식당 26
 
0.7%
사회복지법인 24
 
0.7%
유치원 14
 
0.4%
집단급식소 13
 
0.4%
부산광역시 13
 
0.4%
초등학교 10
 
0.3%
부산 9
 
0.3%
Other values (3067) 3252
92.0%
2023-12-12T20:38:03.825448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
879
 
3.7%
804
 
3.3%
788
 
3.3%
758
 
3.1%
654
 
2.7%
629
 
2.6%
601
 
2.5%
597
 
2.5%
505
 
2.1%
490
 
2.0%
Other values (567) 17360
72.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22253
92.5%
Space Separator 654
 
2.7%
Close Punctuation 452
 
1.9%
Open Punctuation 451
 
1.9%
Uppercase Letter 139
 
0.6%
Decimal Number 86
 
0.4%
Other Punctuation 24
 
0.1%
Lowercase Letter 5
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
879
 
4.0%
804
 
3.6%
788
 
3.5%
758
 
3.4%
629
 
2.8%
601
 
2.7%
597
 
2.7%
505
 
2.3%
490
 
2.2%
468
 
2.1%
Other values (524) 15734
70.7%
Uppercase Letter
ValueCountFrequency (%)
K 16
11.5%
B 15
10.8%
S 14
 
10.1%
C 12
 
8.6%
T 11
 
7.9%
N 10
 
7.2%
H 7
 
5.0%
G 7
 
5.0%
E 5
 
3.6%
M 5
 
3.6%
Other values (11) 37
26.6%
Decimal Number
ValueCountFrequency (%)
2 36
41.9%
1 21
24.4%
3 12
 
14.0%
7 5
 
5.8%
4 4
 
4.7%
6 3
 
3.5%
5 3
 
3.5%
9 2
 
2.3%
Other Punctuation
ValueCountFrequency (%)
. 12
50.0%
· 4
 
16.7%
, 3
 
12.5%
/ 3
 
12.5%
* 1
 
4.2%
& 1
 
4.2%
Lowercase Letter
ValueCountFrequency (%)
e 2
40.0%
i 1
20.0%
k 1
20.0%
s 1
20.0%
Space Separator
ValueCountFrequency (%)
654
100.0%
Close Punctuation
ValueCountFrequency (%)
) 452
100.0%
Open Punctuation
ValueCountFrequency (%)
( 451
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22253
92.5%
Common 1668
 
6.9%
Latin 144
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
879
 
4.0%
804
 
3.6%
788
 
3.5%
758
 
3.4%
629
 
2.8%
601
 
2.7%
597
 
2.7%
505
 
2.3%
490
 
2.2%
468
 
2.1%
Other values (524) 15734
70.7%
Latin
ValueCountFrequency (%)
K 16
 
11.1%
B 15
 
10.4%
S 14
 
9.7%
C 12
 
8.3%
T 11
 
7.6%
N 10
 
6.9%
H 7
 
4.9%
G 7
 
4.9%
E 5
 
3.5%
M 5
 
3.5%
Other values (15) 42
29.2%
Common
ValueCountFrequency (%)
654
39.2%
) 452
27.1%
( 451
27.0%
2 36
 
2.2%
1 21
 
1.3%
. 12
 
0.7%
3 12
 
0.7%
7 5
 
0.3%
· 4
 
0.2%
4 4
 
0.2%
Other values (8) 17
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22253
92.5%
ASCII 1808
 
7.5%
None 4
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
879
 
4.0%
804
 
3.6%
788
 
3.5%
758
 
3.4%
629
 
2.8%
601
 
2.7%
597
 
2.7%
505
 
2.3%
490
 
2.2%
468
 
2.1%
Other values (524) 15734
70.7%
ASCII
ValueCountFrequency (%)
654
36.2%
) 452
25.0%
( 451
24.9%
2 36
 
2.0%
1 21
 
1.2%
K 16
 
0.9%
B 15
 
0.8%
S 14
 
0.8%
. 12
 
0.7%
3 12
 
0.7%
Other values (32) 125
 
6.9%
None
ValueCountFrequency (%)
· 4
100.0%
Distinct2807
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size22.6 KiB
2023-12-12T20:38:04.379093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length49
Mean length28.014226
Min length18

Characters and Unicode

Total characters80737
Distinct characters481
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2747 ?
Unique (%)95.3%

Sample

1st row부산광역시 중구 영주로 8-1(중구노인복지관분관 4층 영주동)
2nd row부산광역시 중구 광복로97번길 26-2(14층 동광동2가)
3rd row부산광역시 중구 중구로 121(대청동4가)
4th row부산광역시 중구 충장대로 6(지1층 중앙동4가)
5th row부산광역시 중구 중앙대로 52(중앙동5가)
ValueCountFrequency (%)
부산광역시 2882
 
21.1%
강서구 410
 
3.0%
사하구 278
 
2.0%
부산진구 273
 
2.0%
해운대구 247
 
1.8%
동래구 211
 
1.5%
기장군 202
 
1.5%
북구 183
 
1.3%
금정구 178
 
1.3%
사상구 174
 
1.3%
Other values (4542) 8619
63.1%
2023-12-12T20:38:05.355225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10793
 
13.4%
3957
 
4.9%
3573
 
4.4%
3473
 
4.3%
3017
 
3.7%
3013
 
3.7%
2889
 
3.6%
2840
 
3.5%
2763
 
3.4%
( 2744
 
3.4%
Other values (471) 41675
51.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 51559
63.9%
Decimal Number 11918
 
14.8%
Space Separator 10793
 
13.4%
Open Punctuation 2744
 
3.4%
Close Punctuation 2744
 
3.4%
Other Punctuation 428
 
0.5%
Dash Punctuation 372
 
0.5%
Uppercase Letter 156
 
0.2%
Lowercase Letter 16
 
< 0.1%
Math Symbol 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3957
 
7.7%
3573
 
6.9%
3473
 
6.7%
3017
 
5.9%
3013
 
5.8%
2889
 
5.6%
2840
 
5.5%
2763
 
5.4%
1217
 
2.4%
1187
 
2.3%
Other values (417) 23630
45.8%
Uppercase Letter
ValueCountFrequency (%)
B 27
17.3%
A 19
12.2%
K 14
 
9.0%
S 12
 
7.7%
C 12
 
7.7%
L 9
 
5.8%
E 8
 
5.1%
H 8
 
5.1%
T 6
 
3.8%
G 5
 
3.2%
Other values (12) 36
23.1%
Decimal Number
ValueCountFrequency (%)
1 2655
22.3%
2 1730
14.5%
3 1352
11.3%
4 1068
9.0%
5 1028
 
8.6%
6 992
 
8.3%
7 881
 
7.4%
0 869
 
7.3%
9 706
 
5.9%
8 637
 
5.3%
Other Punctuation
ValueCountFrequency (%)
, 412
96.3%
. 5
 
1.2%
: 3
 
0.7%
& 2
 
0.5%
@ 2
 
0.5%
· 2
 
0.5%
1
 
0.2%
/ 1
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
e 6
37.5%
c 3
18.8%
l 2
 
12.5%
b 1
 
6.2%
o 1
 
6.2%
k 1
 
6.2%
i 1
 
6.2%
s 1
 
6.2%
Space Separator
ValueCountFrequency (%)
10793
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2744
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2744
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 372
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 51559
63.9%
Common 29005
35.9%
Latin 173
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3957
 
7.7%
3573
 
6.9%
3473
 
6.7%
3017
 
5.9%
3013
 
5.8%
2889
 
5.6%
2840
 
5.5%
2763
 
5.4%
1217
 
2.4%
1187
 
2.3%
Other values (417) 23630
45.8%
Latin
ValueCountFrequency (%)
B 27
15.6%
A 19
 
11.0%
K 14
 
8.1%
S 12
 
6.9%
C 12
 
6.9%
L 9
 
5.2%
E 8
 
4.6%
H 8
 
4.6%
e 6
 
3.5%
T 6
 
3.5%
Other values (21) 52
30.1%
Common
ValueCountFrequency (%)
10793
37.2%
( 2744
 
9.5%
) 2744
 
9.5%
1 2655
 
9.2%
2 1730
 
6.0%
3 1352
 
4.7%
4 1068
 
3.7%
5 1028
 
3.5%
6 992
 
3.4%
7 881
 
3.0%
Other values (13) 3018
 
10.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 51559
63.9%
ASCII 29174
36.1%
None 3
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10793
37.0%
( 2744
 
9.4%
) 2744
 
9.4%
1 2655
 
9.1%
2 1730
 
5.9%
3 1352
 
4.6%
4 1068
 
3.7%
5 1028
 
3.5%
6 992
 
3.4%
7 881
 
3.0%
Other values (41) 3187
 
10.9%
Hangul
ValueCountFrequency (%)
3957
 
7.7%
3573
 
6.9%
3473
 
6.7%
3017
 
5.9%
3013
 
5.8%
2889
 
5.6%
2840
 
5.5%
2763
 
5.4%
1217
 
2.4%
1187
 
2.3%
Other values (417) 23630
45.8%
None
ValueCountFrequency (%)
· 2
66.7%
1
33.3%
Number Forms
ValueCountFrequency (%)
1
100.0%

업소전화번호
Text

MISSING 

Distinct2499
Distinct (%)98.4%
Missing342
Missing (%)11.9%
Memory size22.6 KiB
2023-12-12T20:38:05.992574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.675984
Min length9

Characters and Unicode

Total characters29657
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2467 ?
Unique (%)97.1%

Sample

1st row051 462 0316
2nd row051 466 0008
3rd row051 4612398
4th row051 4647307
5th row051 7187229
ValueCountFrequency (%)
051 2417
35.8%
831 66
 
1.0%
070 43
 
0.6%
330 37
 
0.5%
727 25
 
0.4%
714 17
 
0.3%
302 16
 
0.2%
320 16
 
0.2%
580 16
 
0.2%
728 16
 
0.2%
Other values (2569) 4078
60.4%
2023-12-12T20:38:06.869641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 5375
18.1%
1 4360
14.7%
5 4301
14.5%
4230
14.3%
2 2041
 
6.9%
7 1923
 
6.5%
3 1804
 
6.1%
6 1539
 
5.2%
8 1479
 
5.0%
4 1323
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 25427
85.7%
Space Separator 4230
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 5375
21.1%
1 4360
17.1%
5 4301
16.9%
2 2041
 
8.0%
7 1923
 
7.6%
3 1804
 
7.1%
6 1539
 
6.1%
8 1479
 
5.8%
4 1323
 
5.2%
9 1282
 
5.0%
Space Separator
ValueCountFrequency (%)
4230
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 29657
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 5375
18.1%
1 4360
14.7%
5 4301
14.5%
4230
14.3%
2 2041
 
6.9%
7 1923
 
6.5%
3 1804
 
6.1%
6 1539
 
5.2%
8 1479
 
5.0%
4 1323
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29657
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 5375
18.1%
1 4360
14.7%
5 4301
14.5%
4230
14.3%
2 2041
 
6.9%
7 1923
 
6.5%
3 1804
 
6.1%
6 1539
 
5.2%
8 1479
 
5.0%
4 1323
 
4.5%

X좌표
Real number (ℝ)

HIGH CORRELATION 

Distinct2683
Distinct (%)93.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean129.04143
Minimum128.80416
Maximum129.29222
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.5 KiB
2023-12-12T20:38:07.159301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum128.80416
5-th percentile128.85228
Q1128.98965
median129.05342
Q3129.09892
95-th percentile129.18521
Maximum129.29222
Range0.4880612
Interquartile range (IQR)0.10927032

Descriptive statistics

Standard deviation0.094625214
Coefficient of variation (CV)0.00073329327
Kurtosis0.045702018
Mean129.04143
Median Absolute Deviation (MAD)0.05330685
Skewness-0.35746863
Sum371897.41
Variance0.0089539312
MonotonicityNot monotonic
2023-12-12T20:38:07.426076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
129.0838238 9
 
0.3%
129.1047328 7
 
0.2%
129.036312 6
 
0.2%
129.0658605 5
 
0.2%
129.0981953 5
 
0.2%
128.8832206 5
 
0.2%
129.1275334 4
 
0.1%
128.85775 4
 
0.1%
129.0697784 4
 
0.1%
129.0122107 4
 
0.1%
Other values (2673) 2829
98.2%
ValueCountFrequency (%)
128.8041603 1
< 0.1%
128.8116945 1
< 0.1%
128.8127389 1
< 0.1%
128.8133272 1
< 0.1%
128.8144829 1
< 0.1%
128.8150536 1
< 0.1%
128.8158336 1
< 0.1%
128.8166027 1
< 0.1%
128.8172751 1
< 0.1%
128.8182265 1
< 0.1%
ValueCountFrequency (%)
129.2922215 1
< 0.1%
129.2836311 1
< 0.1%
129.2826453 2
0.1%
129.2824849 1
< 0.1%
129.2733574 1
< 0.1%
129.2693967 1
< 0.1%
129.2693384 1
< 0.1%
129.2682785 1
< 0.1%
129.2673999 1
< 0.1%
129.267278 1
< 0.1%

Y좌표
Real number (ℝ)

HIGH CORRELATION 

Distinct2684
Distinct (%)93.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.163513
Minimum35.042548
Maximum35.379486
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.5 KiB
2023-12-12T20:38:07.685194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum35.042548
5-th percentile35.07933
Q135.110343
median35.161184
Q335.201603
95-th percentile35.269458
Maximum35.379486
Range0.3369386
Interquartile range (IQR)0.09126014

Descriptive statistics

Standard deviation0.062234383
Coefficient of variation (CV)0.0017698568
Kurtosis0.31308382
Mean35.163513
Median Absolute Deviation (MAD)0.04412413
Skewness0.63359583
Sum101341.24
Variance0.0038731184
MonotonicityNot monotonic
2023-12-12T20:38:07.975150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35.23223941 9
 
0.3%
35.13117614 7
 
0.2%
35.14431978 6
 
0.2%
35.14648787 5
 
0.2%
35.09540652 5
 
0.2%
35.24704552 5
 
0.2%
35.15151913 4
 
0.1%
35.07440875 4
 
0.1%
35.11880727 4
 
0.1%
35.25485329 4
 
0.1%
Other values (2674) 2829
98.2%
ValueCountFrequency (%)
35.0425478 1
< 0.1%
35.04996522 1
< 0.1%
35.05025013 1
< 0.1%
35.05061057 1
< 0.1%
35.05093083 1
< 0.1%
35.05112408 1
< 0.1%
35.05122843 1
< 0.1%
35.0513562 1
< 0.1%
35.0513807 1
< 0.1%
35.05163272 1
< 0.1%
ValueCountFrequency (%)
35.3794864 1
< 0.1%
35.37920445 1
< 0.1%
35.37483686 1
< 0.1%
35.37426367 1
< 0.1%
35.3710779 1
< 0.1%
35.37089548 1
< 0.1%
35.36915143 1
< 0.1%
35.36665765 1
< 0.1%
35.36529328 1
< 0.1%
35.36017428 1
< 0.1%

Interactions

2023-12-12T20:38:02.141626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:38:01.746169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:38:02.304705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:38:01.926906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:38:08.176117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
X좌표Y좌표
X좌표1.0000.808
Y좌표0.8081.000
2023-12-12T20:38:08.320271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
X좌표Y좌표
X좌표1.0000.603
Y좌표0.6031.000

Missing values

2023-12-12T20:38:02.538208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:38:02.742740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명업소주소업소전화번호X좌표Y좌표
0(사복)로사리오카리타스 중구노인복지관 분관부산광역시 중구 영주로 8-1(중구노인복지관분관 4층 영주동)051 462 0316129.032535.110162
1(의)송산의료재단해양요양병원부산광역시 중구 광복로97번길 26-2(14층 동광동2가)051 466 0008129.03489635.100966
2(재)천주교부산교구메리놀병원 집단급식소부산광역시 중구 중구로 121(대청동4가)051 4612398129.03298435.107652
3(주)HJ중공업부산광역시 중구 충장대로 6(지1층 중앙동4가)051 4647307129.03712635.104696
4(주)엘지유플러스부산광역시 중구 중앙대로 52(중앙동5가)051 7187229129.03685335.102193
5(주)한국스탠다드차타드은행CS센터부산광역시 중구 대청로 135(8층 중앙동3가, 제일은행)051 4415986129.03483535.103129
6관정빌딩 구내식당부산광역시 중구 충장대로9번길 46(관정빌딩 지하1층 중앙동4가)051 603 3997129.03874135.109262
7광일초등학교부산광역시 중구 중구로 74(대청동4가)051 603 5122129.0296835.104003
8국립수산물품질관리원 부산지원부산광역시 중구 중앙대로30번길 8(지하1층 중앙동6가)051 602 6020129.03739335.099781
9굿모닝요양원부산광역시 중구 대영로 235(제일빌딩 9층 영주동)<NA>129.03640135.112373
업소명업소주소업소전화번호X좌표Y좌표
2872해빛초등학교부산광역시 기장군 일광읍 해빛6로 9(2층)051 790 8300129.22149135.264773
2873해양수산인재개발원부산광역시 기장군 기장읍 기장해안로 216051 720 7736129.22228635.191096
2874행복엔젤유치원부산광역시 기장군 기장읍 차성서로 44(1층)051 502 4556129.21035335.237123
2875현대요양병원부산광역시 기장군 기장읍 반송로 1555051 721 7582129.20994435.248773
2876홈플러스(주)부산정관점부산광역시 기장군 정관읍 정관5로 50051 519 8224129.17638635.323052
2877효산요양원부산광역시 기장군 철마면 철마삼동로 103(1층)051 508 0675129.11450535.313823
2878효성노인건강센터부산광역시 기장군 장안읍 한골길 159-16051 727 5080129.26140935.358987
2879효성유치원부산광역시 기장군 기장읍 차성로344번길 13051 753 1013129.21528235.249359
2880효성전기(주)부산광역시 기장군 장안읍 장안산단9로 190051 720 6416129.25658735.32475
2881효성제일노인건강센터부산광역시 기장군 장안읍 오리길 109051 714 3747129.27335735.366658