Overview

Dataset statistics

Number of variables6
Number of observations27
Missing cells3
Missing cells (%)1.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 KiB
Average record size in memory54.9 B

Variable types

Numeric2
Text1
Categorical3

Dataset

Description전라북도 임실군의 전국대회정보현황 데이터 입니다. 데이터 세부내역에는 순번, 대회명, 상세장소, 접수기간, 대회기간, 주최, 주관, 후원사명, 홈페이지주소, 문의처를 포함하여 데이터를 제공하고 있습니다.
Author전라북도 임실군
URLhttps://www.data.go.kr/data/15027681/fileData.do

Alerts

순번 is highly overall correlated with 유형High correlation
예산(천원) is highly overall correlated with 개최시기 and 1 other fieldsHigh correlation
개최시기 is highly overall correlated with 예산(천원) and 2 other fieldsHigh correlation
유형 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
구분 is highly overall correlated with 예산(천원) and 1 other fieldsHigh correlation
유형 is highly imbalanced (61.8%)Imbalance
순번 has 1 (3.7%) missing valuesMissing
대회명 has 1 (3.7%) missing valuesMissing
예산(천원) has 1 (3.7%) missing valuesMissing
예산(천원) has 1 (3.7%) zerosZeros

Reproduction

Analysis started2023-12-12 17:23:09.262221
Analysis finished2023-12-12 17:23:10.602141
Duration1.34 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct26
Distinct (%)100.0%
Missing1
Missing (%)3.7%
Infinite0
Infinite (%)0.0%
Mean13.5
Minimum1
Maximum26
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size375.0 B
2023-12-13T02:23:10.674768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.25
Q17.25
median13.5
Q319.75
95-th percentile24.75
Maximum26
Range25
Interquartile range (IQR)12.5

Descriptive statistics

Standard deviation7.6485293
Coefficient of variation (CV)0.56655772
Kurtosis-1.2
Mean13.5
Median Absolute Deviation (MAD)6.5
Skewness0
Sum351
Variance58.5
MonotonicityStrictly increasing
2023-12-13T02:23:10.832516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
1 1
 
3.7%
15 1
 
3.7%
26 1
 
3.7%
25 1
 
3.7%
24 1
 
3.7%
23 1
 
3.7%
22 1
 
3.7%
21 1
 
3.7%
20 1
 
3.7%
19 1
 
3.7%
Other values (16) 16
59.3%
ValueCountFrequency (%)
1 1
3.7%
2 1
3.7%
3 1
3.7%
4 1
3.7%
5 1
3.7%
6 1
3.7%
7 1
3.7%
8 1
3.7%
9 1
3.7%
10 1
3.7%
ValueCountFrequency (%)
26 1
3.7%
25 1
3.7%
24 1
3.7%
23 1
3.7%
22 1
3.7%
21 1
3.7%
20 1
3.7%
19 1
3.7%
18 1
3.7%
17 1
3.7%

대회명
Text

MISSING 

Distinct26
Distinct (%)100.0%
Missing1
Missing (%)3.7%
Memory size348.0 B
2023-12-13T02:23:11.079322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length18
Mean length14.961538
Min length10

Characters and Unicode

Total characters389
Distinct characters98
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row전라북도민체전 참가
2nd row전북어르신생활체육대회 참가
3rd row전북역전마라톤대회 참가
4th row전북 씨름왕 선발대회 참가
5th row임실군수배 체육대회
ValueCountFrequency (%)
참가 9
 
11.1%
전국 8
 
9.9%
임실n치즈배 7
 
8.6%
체육대회 5
 
6.2%
동호인 3
 
3.7%
기념 2
 
2.5%
사격대회 2
 
2.5%
대회 2
 
2.5%
양궁대회 2
 
2.5%
전북여성생활체육대회 2
 
2.5%
Other values (38) 39
48.1%
2023-12-13T02:23:11.496882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
56
 
14.4%
25
 
6.4%
24
 
6.2%
19
 
4.9%
15
 
3.9%
12
 
3.1%
12
 
3.1%
12
 
3.1%
10
 
2.6%
10
 
2.6%
Other values (88) 194
49.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 324
83.3%
Space Separator 56
 
14.4%
Uppercase Letter 8
 
2.1%
Other Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
7.7%
24
 
7.4%
19
 
5.9%
15
 
4.6%
12
 
3.7%
12
 
3.7%
12
 
3.7%
10
 
3.1%
10
 
3.1%
9
 
2.8%
Other values (85) 176
54.3%
Space Separator
ValueCountFrequency (%)
56
100.0%
Uppercase Letter
ValueCountFrequency (%)
N 8
100.0%
Other Punctuation
ValueCountFrequency (%)
· 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 324
83.3%
Common 57
 
14.7%
Latin 8
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
7.7%
24
 
7.4%
19
 
5.9%
15
 
4.6%
12
 
3.7%
12
 
3.7%
12
 
3.7%
10
 
3.1%
10
 
3.1%
9
 
2.8%
Other values (85) 176
54.3%
Common
ValueCountFrequency (%)
56
98.2%
· 1
 
1.8%
Latin
ValueCountFrequency (%)
N 8
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 324
83.3%
ASCII 64
 
16.5%
None 1
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
56
87.5%
N 8
 
12.5%
Hangul
ValueCountFrequency (%)
25
 
7.7%
24
 
7.4%
19
 
5.9%
15
 
4.6%
12
 
3.7%
12
 
3.7%
12
 
3.7%
10
 
3.1%
10
 
3.1%
9
 
2.8%
Other values (85) 176
54.3%
None
ValueCountFrequency (%)
· 1
100.0%

개최시기
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)25.9%
Missing0
Missing (%)0.0%
Memory size348.0 B
10월중
11 
<NA>
10 
6월중
9월중
 
1
11월중
 
1
Other values (2)

Length

Max length4
Median length4
Mean length3.7777778
Min length2

Unique

Unique4 ?
Unique (%)14.8%

Sample

1st row9월중
2nd row6월중
3rd row11월중
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
10월중 11
40.7%
<NA> 10
37.0%
6월중 2
 
7.4%
9월중 1
 
3.7%
11월중 1
 
3.7%
연중 1
 
3.7%
8월중 1
 
3.7%

Length

2023-12-13T02:23:11.663395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:23:11.812568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
10월중 11
40.7%
na 10
37.0%
6월중 2
 
7.4%
9월중 1
 
3.7%
11월중 1
 
3.7%
연중 1
 
3.7%
8월중 1
 
3.7%

예산(천원)
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct16
Distinct (%)61.5%
Missing1
Missing (%)3.7%
Infinite0
Infinite (%)0.0%
Mean29726.538
Minimum0
Maximum252000
Zeros1
Zeros (%)3.7%
Negative0
Negative (%)0.0%
Memory size375.0 B
2023-12-13T02:23:11.955291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3450
Q110000
median20000
Q327500
95-th percentile75500
Maximum252000
Range252000
Interquartile range (IQR)17500

Descriptive statistics

Standard deviation48951.872
Coefficient of variation (CV)1.6467397
Kurtosis18.326282
Mean29726.538
Median Absolute Deviation (MAD)10000
Skewness4.0701258
Sum772890
Variance2.3962858 × 109
MonotonicityNot monotonic
2023-12-13T02:23:12.109702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
20000 7
25.9%
10000 3
11.1%
15000 2
 
7.4%
30000 2
 
7.4%
252000 1
 
3.7%
34000 1
 
3.7%
6000 1
 
3.7%
4800 1
 
3.7%
8090 1
 
3.7%
5000 1
 
3.7%
Other values (6) 6
22.2%
ValueCountFrequency (%)
0 1
 
3.7%
3000 1
 
3.7%
4800 1
 
3.7%
5000 1
 
3.7%
6000 1
 
3.7%
8090 1
 
3.7%
10000 3
11.1%
12000 1
 
3.7%
15000 2
 
7.4%
20000 7
25.9%
ValueCountFrequency (%)
252000 1
 
3.7%
77000 1
 
3.7%
71000 1
 
3.7%
40000 1
 
3.7%
34000 1
 
3.7%
30000 2
 
7.4%
20000 7
25.9%
15000 2
 
7.4%
12000 1
 
3.7%
10000 3
11.1%

유형
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)11.1%
Missing0
Missing (%)0.0%
Memory size348.0 B
생활
24 
전문
 
2
<NA>
 
1

Length

Max length4
Median length2
Mean length2.0740741
Min length2

Unique

Unique1 ?
Unique (%)3.7%

Sample

1st row생활
2nd row생활
3rd row생활
4th row생활
5th row생활

Common Values

ValueCountFrequency (%)
생활 24
88.9%
전문 2
 
7.4%
<NA> 1
 
3.7%

Length

2023-12-13T02:23:12.268912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:23:12.413473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
생활 24
88.9%
전문 2
 
7.4%
na 1
 
3.7%

구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)14.8%
Missing0
Missing (%)0.0%
Memory size348.0 B
개최
17 
출전
개최, 출전
 
1
<NA>
 
1

Length

Max length6
Median length2
Mean length2.2222222
Min length2

Unique

Unique2 ?
Unique (%)7.4%

Sample

1st row출전
2nd row출전
3rd row출전
4th row출전
5th row개최

Common Values

ValueCountFrequency (%)
개최 17
63.0%
출전 8
29.6%
개최, 출전 1
 
3.7%
<NA> 1
 
3.7%

Length

2023-12-13T02:23:12.571574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:23:12.720472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개최 18
64.3%
출전 9
32.1%
na 1
 
3.6%

Interactions

2023-12-13T02:23:09.877443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:23:09.628000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:23:09.992443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:23:09.759668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:23:12.829886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번대회명개최시기예산(천원)유형구분
순번1.0001.0000.6470.4880.2390.528
대회명1.0001.0001.0001.0001.0001.000
개최시기0.6471.0001.0000.8801.0000.978
예산(천원)0.4881.0000.8801.0000.0000.728
유형0.2391.0001.0000.0001.0000.000
구분0.5281.0000.9780.7280.0001.000
2023-12-13T02:23:12.940139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분유형개최시기
구분1.0000.0000.719
유형0.0001.0000.856
개최시기0.7190.8561.000
2023-12-13T02:23:13.037563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번예산(천원)개최시기유형구분
순번1.000-0.2170.4920.5390.317
예산(천원)-0.2171.0000.6880.0000.687
개최시기0.4920.6881.0000.8560.719
유형0.5390.0000.8561.0000.000
구분0.3170.6870.7190.0001.000

Missing values

2023-12-13T02:23:10.151507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:23:10.338885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T02:23:10.494935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번대회명개최시기예산(천원)유형구분
01전라북도민체전 참가9월중252000생활출전
12전북어르신생활체육대회 참가6월중10000생활출전
23전북역전마라톤대회 참가11월중10000생활출전
34전북 씨름왕 선발대회 참가<NA>3000생활출전
45임실군수배 체육대회<NA>71000생활개최
56각종체육경기 참가 및 개최연중77000생활개최, 출전
67임실군민의 날 기념 체육대회10월중0생활개최
78전북여성생활체육대회 참가10월중12000생활출전
89전북여성생활체육대회 유치<NA>15000생활개최
910임실치즈 그란폰도·메디오폰도 대회10월중40000생활개최
순번대회명개최시기예산(천원)유형구분
1718임실N치즈배 전국생활체육 양궁대회10월중15000생활개최
1819문체부장관기 전국학생 사격대회8월중20000전문개최
1920전국 생활체육 사격대회<NA>5000생활개최
2021화랑기 시도대항 양궁대회<NA>30000전문개최
2122연예인 초청 자선 골프대회6월중20000생활개최
2223전라북도장애인체육대회 참가<NA>20000생활출전
2324전국 장애인 체육대회 참가<NA>8090생활출전
2425지체장애인 체육대회 참가<NA>4800생활출전
2526임실군장애인 한마음 체육대회<NA>6000생활개최
26<NA><NA><NA><NA><NA><NA>