Overview

Dataset statistics

Number of variables5
Number of observations28
Missing cells5
Missing cells (%)3.6%
Duplicate rows1
Duplicate rows (%)3.6%
Total size in memory1.2 KiB
Average record size in memory45.7 B

Variable types

Categorical1
Text3
Numeric1

Dataset

Description전북특별자치도 장수군의 단체급식 현황(업종명, 업소명, 소재지, 전화번호, 1일급식인원수)에 대한 데이터 항목을 제공하고자 합니다
Author전북특별자치도 장수군
URLhttps://www.data.go.kr/data/15116632/fileData.do

Alerts

Dataset has 1 (3.6%) duplicate rowsDuplicates
1일급식인원수 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 1일급식인원수High correlation
업종명 is highly imbalanced (62.9%)Imbalance
1일급식인원수 has 5 (17.9%) missing valuesMissing

Reproduction

Analysis started2024-03-30 03:11:03.736091
Analysis finished2024-03-30 03:11:05.104153
Duration1.37 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size356.0 B
집단급식소
26 
위탁급식영업
 
2

Length

Max length6
Median length5
Mean length5.0714286
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row집단급식소
2nd row집단급식소
3rd row집단급식소
4th row집단급식소
5th row집단급식소

Common Values

ValueCountFrequency (%)
집단급식소 26
92.9%
위탁급식영업 2
 
7.1%

Length

2024-03-30T03:11:05.353443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-30T03:11:05.659124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
집단급식소 26
92.9%
위탁급식영업 2
 
7.1%
Distinct27
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size356.0 B
2024-03-30T03:11:06.047396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length8.5357143
Min length5

Characters and Unicode

Total characters239
Distinct characters71
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)92.9%

Sample

1st row장수초등학교급식
2nd row백화고등학교급식
3rd row산서초등학교급식
4th row번암초등학교급식
5th row장계초등학교급식
ValueCountFrequency (%)
계북초등학교급식 2
 
5.9%
한국마사회 2
 
5.9%
산서초등학교급식 1
 
2.9%
번암초등학교급식 1
 
2.9%
오렌지 1
 
2.9%
장수군청집단급식소 1
 
2.9%
구내식당 1
 
2.9%
새마을금고 1
 
2.9%
분관 1
 
2.9%
노인복지관(장계 1
 
2.9%
Other values (22) 22
64.7%
2024-03-30T03:11:07.005540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15
 
6.3%
15
 
6.3%
14
 
5.9%
14
 
5.9%
13
 
5.4%
12
 
5.0%
11
 
4.6%
9
 
3.8%
7
 
2.9%
6
 
2.5%
Other values (61) 123
51.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 229
95.8%
Space Separator 6
 
2.5%
Open Punctuation 2
 
0.8%
Close Punctuation 2
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
 
6.6%
15
 
6.6%
14
 
6.1%
14
 
6.1%
13
 
5.7%
12
 
5.2%
11
 
4.8%
9
 
3.9%
7
 
3.1%
6
 
2.6%
Other values (58) 113
49.3%
Space Separator
ValueCountFrequency (%)
6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 229
95.8%
Common 10
 
4.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15
 
6.6%
15
 
6.6%
14
 
6.1%
14
 
6.1%
13
 
5.7%
12
 
5.2%
11
 
4.8%
9
 
3.9%
7
 
3.1%
6
 
2.6%
Other values (58) 113
49.3%
Common
ValueCountFrequency (%)
6
60.0%
( 2
 
20.0%
) 2
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 229
95.8%
ASCII 10
 
4.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
15
 
6.6%
15
 
6.6%
14
 
6.1%
14
 
6.1%
13
 
5.7%
12
 
5.2%
11
 
4.8%
9
 
3.9%
7
 
3.1%
6
 
2.6%
Other values (58) 113
49.3%
ASCII
ValueCountFrequency (%)
6
60.0%
( 2
 
20.0%
) 2
 
20.0%
Distinct26
Distinct (%)92.9%
Missing0
Missing (%)0.0%
Memory size356.0 B
2024-03-30T03:11:07.647972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length27
Mean length23.714286
Min length21

Characters and Unicode

Total characters664
Distinct characters74
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)85.7%

Sample

1st row전북특별자치도 장수군 장수읍 향교길 11-8
2nd row전북특별자치도 장수군 장계면 백화로 4-31
3rd row전북특별자치도 장수군 산서면 동백로 7
4th row전북특별자치도 장수군 번암면 동강길 21
5th row전북특별자치도 장수군 장계면 한들로 69
ValueCountFrequency (%)
전북특별자치도 28
19.6%
장수군 28
19.6%
장수읍 12
 
8.4%
장계면 9
 
6.3%
7 3
 
2.1%
천천면 2
 
1.4%
백화로 2
 
1.4%
장천로 2
 
1.4%
노하3길 2
 
1.4%
16 2
 
1.4%
Other values (44) 53
37.1%
2024-03-30T03:11:08.614499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
115
17.3%
53
 
8.0%
42
 
6.3%
30
 
4.5%
28
 
4.2%
28
 
4.2%
28
 
4.2%
28
 
4.2%
28
 
4.2%
28
 
4.2%
Other values (64) 256
38.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 468
70.5%
Space Separator 115
 
17.3%
Decimal Number 72
 
10.8%
Dash Punctuation 7
 
1.1%
Other Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
53
 
11.3%
42
 
9.0%
30
 
6.4%
28
 
6.0%
28
 
6.0%
28
 
6.0%
28
 
6.0%
28
 
6.0%
28
 
6.0%
28
 
6.0%
Other values (51) 147
31.4%
Decimal Number
ValueCountFrequency (%)
1 14
19.4%
7 9
12.5%
2 9
12.5%
6 8
11.1%
5 7
9.7%
3 7
9.7%
4 6
8.3%
0 5
 
6.9%
8 5
 
6.9%
9 2
 
2.8%
Space Separator
ValueCountFrequency (%)
115
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 468
70.5%
Common 196
29.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
53
 
11.3%
42
 
9.0%
30
 
6.4%
28
 
6.0%
28
 
6.0%
28
 
6.0%
28
 
6.0%
28
 
6.0%
28
 
6.0%
28
 
6.0%
Other values (51) 147
31.4%
Common
ValueCountFrequency (%)
115
58.7%
1 14
 
7.1%
7 9
 
4.6%
2 9
 
4.6%
6 8
 
4.1%
- 7
 
3.6%
5 7
 
3.6%
3 7
 
3.6%
4 6
 
3.1%
0 5
 
2.6%
Other values (3) 9
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 468
70.5%
ASCII 196
29.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
115
58.7%
1 14
 
7.1%
7 9
 
4.6%
2 9
 
4.6%
6 8
 
4.1%
- 7
 
3.6%
5 7
 
3.6%
3 7
 
3.6%
4 6
 
3.1%
0 5
 
2.6%
Other values (3) 9
 
4.6%
Hangul
ValueCountFrequency (%)
53
 
11.3%
42
 
9.0%
30
 
6.4%
28
 
6.0%
28
 
6.0%
28
 
6.0%
28
 
6.0%
28
 
6.0%
28
 
6.0%
28
 
6.0%
Other values (51) 147
31.4%
Distinct26
Distinct (%)92.9%
Missing0
Missing (%)0.0%
Memory size356.0 B
2024-03-30T03:11:09.120430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.035714
Min length12

Characters and Unicode

Total characters337
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)85.7%

Sample

1st row063-351-5514
2nd row063-351-0106
3rd row063-351-4588
4th row063-352-3567
5th row063-351-1093
ValueCountFrequency (%)
063-351-2231 2
 
7.1%
063-352-3785 2
 
7.1%
063-351-5514 1
 
3.6%
063-350-3745 1
 
3.6%
063-353-8286 1
 
3.6%
063-353-8833 1
 
3.6%
063-352-3690 1
 
3.6%
063-351-1517 1
 
3.6%
070-8249-8722 1
 
3.6%
063-353-8288 1
 
3.6%
Other values (16) 16
57.1%
2024-03-30T03:11:10.054513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 79
23.4%
- 56
16.6%
5 43
12.8%
0 41
12.2%
6 34
10.1%
1 25
 
7.4%
2 23
 
6.8%
8 18
 
5.3%
7 8
 
2.4%
4 6
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 281
83.4%
Dash Punctuation 56
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 79
28.1%
5 43
15.3%
0 41
14.6%
6 34
12.1%
1 25
 
8.9%
2 23
 
8.2%
8 18
 
6.4%
7 8
 
2.8%
4 6
 
2.1%
9 4
 
1.4%
Dash Punctuation
ValueCountFrequency (%)
- 56
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 337
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 79
23.4%
- 56
16.6%
5 43
12.8%
0 41
12.2%
6 34
10.1%
1 25
 
7.4%
2 23
 
6.8%
8 18
 
5.3%
7 8
 
2.4%
4 6
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 337
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 79
23.4%
- 56
16.6%
5 43
12.8%
0 41
12.2%
6 34
10.1%
1 25
 
7.4%
2 23
 
6.8%
8 18
 
5.3%
7 8
 
2.4%
4 6
 
1.8%

1일급식인원수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct19
Distinct (%)82.6%
Missing5
Missing (%)17.9%
Infinite0
Infinite (%)0.0%
Mean202.04348
Minimum56
Maximum750
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size384.0 B
2024-03-30T03:11:10.426212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum56
5-th percentile72
Q179.5
median145
Q3285
95-th percentile476
Maximum750
Range694
Interquartile range (IQR)205.5

Descriptive statistics

Standard deviation165.18405
Coefficient of variation (CV)0.81756686
Kurtosis4.5887112
Mean202.04348
Median Absolute Deviation (MAD)71
Skewness1.9379161
Sum4647
Variance27285.771
MonotonicityNot monotonic
2024-03-30T03:11:10.843297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
72 2
 
7.1%
285 2
 
7.1%
74 2
 
7.1%
95 2
 
7.1%
350 1
 
3.6%
82 1
 
3.6%
214 1
 
3.6%
206 1
 
3.6%
141 1
 
3.6%
750 1
 
3.6%
Other values (9) 9
32.1%
(Missing) 5
17.9%
ValueCountFrequency (%)
56 1
3.6%
72 2
7.1%
74 2
7.1%
77 1
3.6%
82 1
3.6%
94 1
3.6%
95 2
7.1%
141 1
3.6%
145 1
3.6%
180 1
3.6%
ValueCountFrequency (%)
750 1
3.6%
490 1
3.6%
350 1
3.6%
310 1
3.6%
300 1
3.6%
285 2
7.1%
214 1
3.6%
206 1
3.6%
200 1
3.6%
180 1
3.6%

Interactions

2024-03-30T03:11:04.224464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-30T03:11:11.127034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명업소명소재지(도로명)전화번호1일급식인원수
업종명1.0001.0000.0001.000NaN
업소명1.0001.0001.0001.0001.000
소재지(도로명)0.0001.0001.0000.9981.000
전화번호1.0001.0000.9981.0001.000
1일급식인원수NaN1.0001.0001.0001.000
2024-03-30T03:11:11.413861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
1일급식인원수업종명
1일급식인원수1.0001.000
업종명1.0001.000

Missing values

2024-03-30T03:11:04.596048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-30T03:11:04.983113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)전화번호1일급식인원수
0집단급식소장수초등학교급식전북특별자치도 장수군 장수읍 향교길 11-8063-351-551495
1집단급식소백화고등학교급식전북특별자치도 장수군 장계면 백화로 4-31063-351-0106750
2집단급식소산서초등학교급식전북특별자치도 장수군 산서면 동백로 7063-351-458895
3집단급식소번암초등학교급식전북특별자치도 장수군 번암면 동강길 21063-352-3567214
4집단급식소장계초등학교급식전북특별자치도 장수군 장계면 한들로 69063-351-1093350
5집단급식소장수어린이집전북특별자치도 장수군 장수읍 덕산로 14-3063-351-213572
6집단급식소승예어린이집전북특별자치도 장수군 장수읍 신천로 50063-351-032174
7집단급식소장계어린이집전북특별자치도 장수군 장계면 한들4길 16063-352-237882
8집단급식소꿈나무어린이집전북특별자치도 장수군 장계면 방천길 5063-352-020472
9집단급식소계북초등학교급식전북특별자치도 장수군 계북면 문성길 7063-352-3785285
업종명업소명소재지(도로명)전화번호1일급식인원수
18집단급식소한국마사회 장수경주마목장전북특별자치도 장수군 장계면 육십령로 764-5063-351-2231<NA>
19집단급식소산서중고등학교전북특별자치도 장수군 산서면 보산로 1852-6063-351-413674
20집단급식소장수군노인복지관전북특별자치도 장수군 장수읍 노하3길 16063-353-828894
21집단급식소(사)한국농업연수원식당전북특별자치도 장수군 장수읍 발방골길 72070-8249-8722300
22집단급식소전북유니텍고등학교전북특별자치도 장수군 장계면 장계7길 3063-351-1517490
23집단급식소장수푸른어린이집전북특별자치도 장수군 장수읍 노하3길 21063-352-369056
24집단급식소장수한사랑유치원전북특별자치도 장수군 장수읍 장천로 237063-353-8833145
25집단급식소장수군 노인복지관(장계 분관)전북특별자치도 장수군 장계면 백화로 60, 장계면 종합복지회관063-353-828677
26위탁급식영업한국마사회 새마을금고 구내식당전북특별자치도 장수군 장계면 육십령로 764-5063-350-3745<NA>
27위탁급식영업오렌지 푸드전북특별자치도 장수군 장수읍 발방골길 72, 한국농업연수원063-352-3051<NA>

Duplicate rows

Most frequently occurring

업종명업소명소재지(도로명)전화번호1일급식인원수# duplicates
0집단급식소계북초등학교급식전북특별자치도 장수군 계북면 문성길 7063-352-37852852