Overview

Dataset statistics

Number of variables4
Number of observations118
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)1.7%
Total size in memory3.9 KiB
Average record size in memory34.1 B

Variable types

DateTime1
Categorical2
Numeric1

Dataset

Description제주특별자치도개발공사 삼다수 제조 과정을 관람할 수 있는 견학로 운영 현황(견학일, 방문 유형, 방문객수) 입니다.
URLhttps://www.data.go.kr/data/15029664/fileData.do

Alerts

Dataset has 2 (1.7%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 22:57:23.959259
Analysis finished2023-12-12 22:57:24.351012
Duration0.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct66
Distinct (%)55.9%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
Minimum2023-01-04 00:00:00
Maximum2023-06-30 00:00:00
2023-12-13T07:57:24.413416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:57:24.532174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

견학유형
Categorical

Distinct3
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
방문
48 
여행
45 
교육 및 연수
25 

Length

Max length7
Median length2
Mean length3.059322
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교육 및 연수
2nd row방문
3rd row여행
4th row방문
5th row방문

Common Values

ValueCountFrequency (%)
방문 48
40.7%
여행 45
38.1%
교육 및 연수 25
21.2%

Length

2023-12-13T07:57:24.677321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:57:24.770908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
방문 48
28.6%
여행 45
26.8%
교육 25
14.9%
25
14.9%
연수 25
14.9%

견학인원
Real number (ℝ)

Distinct33
Distinct (%)28.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.220339
Minimum1
Maximum80
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-13T07:57:25.155645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q315
95-th percentile47.35
Maximum80
Range79
Interquartile range (IQR)12

Descriptive statistics

Standard deviation15.648454
Coefficient of variation (CV)1.2805254
Kurtosis5.1119759
Mean12.220339
Median Absolute Deviation (MAD)3
Skewness2.2507438
Sum1442
Variance244.87411
MonotonicityNot monotonic
2023-12-13T07:57:25.304168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
4 16
13.6%
2 15
12.7%
3 13
11.0%
5 13
11.0%
1 8
 
6.8%
15 5
 
4.2%
10 5
 
4.2%
6 4
 
3.4%
20 4
 
3.4%
9 3
 
2.5%
Other values (23) 32
27.1%
ValueCountFrequency (%)
1 8
6.8%
2 15
12.7%
3 13
11.0%
4 16
13.6%
5 13
11.0%
6 4
 
3.4%
7 2
 
1.7%
8 1
 
0.8%
9 3
 
2.5%
10 5
 
4.2%
ValueCountFrequency (%)
80 1
0.8%
70 1
0.8%
60 2
1.7%
57 1
0.8%
55 1
0.8%
46 1
0.8%
43 1
0.8%
40 1
0.8%
38 1
0.8%
32 2
1.7%

지역
Categorical

Distinct15
Distinct (%)12.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
서울
44 
제주
32 
온라인
10 
경기
경북
Other values (10)
18 

Length

Max length3
Median length2
Mean length2.0847458
Min length2

Unique

Unique5 ?
Unique (%)4.2%

Sample

1st row제주
2nd row제주
3rd row온라인
4th row서울
5th row서울

Common Values

ValueCountFrequency (%)
서울 44
37.3%
제주 32
27.1%
온라인 10
 
8.5%
경기 8
 
6.8%
경북 6
 
5.1%
부산 4
 
3.4%
해외 3
 
2.5%
대구 2
 
1.7%
전남 2
 
1.7%
인천 2
 
1.7%
Other values (5) 5
 
4.2%

Length

2023-12-13T07:57:25.430463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울 44
37.3%
제주 32
27.1%
온라인 10
 
8.5%
경기 8
 
6.8%
경북 6
 
5.1%
부산 4
 
3.4%
해외 3
 
2.5%
대구 2
 
1.7%
전남 2
 
1.7%
인천 2
 
1.7%
Other values (5) 5
 
4.2%

Interactions

2023-12-13T07:57:24.100920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:57:25.499068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
견학일자견학유형견학인원지역
견학일자1.0000.7720.8870.000
견학유형0.7721.0000.5720.578
견학인원0.8870.5721.0000.000
지역0.0000.5780.0001.000
2023-12-13T07:57:25.573920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역견학유형
지역1.0000.303
견학유형0.3031.000
2023-12-13T07:57:25.646686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
견학인원견학유형지역
견학인원1.0000.4020.000
견학유형0.4021.0000.303
지역0.0000.3031.000

Missing values

2023-12-13T07:57:24.215199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:57:24.314052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

견학일자견학유형견학인원지역
02023-01-04교육 및 연수9제주
12023-01-06방문2제주
22023-01-10여행3온라인
32023-01-12방문6서울
42023-01-13방문5서울
52023-01-16방문6제주
62023-01-18방문13서울
72023-01-31교육 및 연수11서울
82023-02-06방문20경북
92023-02-06교육 및 연수1온라인
견학일자견학유형견학인원지역
1082023-06-28방문30제주
1092023-06-28방문15제주
1102023-06-29방문5서울
1112023-06-29여행1서울
1122023-06-29여행5부산
1132023-06-29여행7인천
1142023-06-29방문1서울
1152023-06-30여행4제주
1162023-06-30여행3서울
1172023-06-30방문10서울

Duplicate rows

Most frequently occurring

견학일자견학유형견학인원지역# duplicates
02023-06-21여행4제주2
12023-06-28방문15제주2