Overview

Dataset statistics

Number of variables13
Number of observations10000
Missing cells51767
Missing cells (%)39.8%
Duplicate rows113
Duplicate rows (%)1.1%
Total size in memory1.1 MiB
Average record size in memory117.0 B

Variable types

Numeric5
DateTime4
Text3
Categorical1

Dataset

Description경상남도 도립미술관 티켓 판매 취소 내역 현황입니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15039895

Alerts

Dataset has 113 (1.1%) duplicate rowsDuplicates
티켓장수 is highly overall correlated with 원금 and 1 other fieldsHigh correlation
원금 is highly overall correlated with 티켓장수 and 1 other fieldsHigh correlation
티켓가격 is highly overall correlated with 티켓장수 and 1 other fieldsHigh correlation
티켓장수 has 3654 (36.5%) missing valuesMissing
원금 has 3654 (36.5%) missing valuesMissing
구입날짜 has 3654 (36.5%) missing valuesMissing
구입시간 has 3654 (36.5%) missing valuesMissing
구입일시 has 3654 (36.5%) missing valuesMissing
취소일시 has 9935 (99.4%) missing valuesMissing
티켓가격 has 3654 (36.5%) missing valuesMissing
내국인 has 9954 (99.5%) missing valuesMissing
외국인 has 9954 (99.5%) missing valuesMissing
원금 is highly skewed (γ1 = 23.60461111)Skewed
티켓가격 is highly skewed (γ1 = 23.60461111)Skewed
원금 has 460 (4.6%) zerosZeros
티켓가격 has 460 (4.6%) zerosZeros

Reproduction

Analysis started2023-12-10 23:31:38.095962
Analysis finished2023-12-10 23:31:41.927041
Duration3.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

티켓장수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct62
Distinct (%)1.0%
Missing3654
Missing (%)36.5%
Infinite0
Infinite (%)0.0%
Mean3.1786952
Minimum1
Maximum177
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T08:31:41.985384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median2
Q33
95-th percentile5
Maximum177
Range176
Interquartile range (IQR)1

Descriptive statistics

Standard deviation6.1493285
Coefficient of variation (CV)1.9345448
Kurtosis290.94862
Mean3.1786952
Median Absolute Deviation (MAD)1
Skewness14.221513
Sum20172
Variance37.814241
MonotonicityNot monotonic
2023-12-11T08:31:42.096502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 2845
28.4%
3 1185
 
11.8%
1 1048
 
10.5%
4 756
 
7.6%
5 205
 
2.1%
6 76
 
0.8%
8 30
 
0.3%
7 26
 
0.3%
9 15
 
0.1%
10 15
 
0.1%
Other values (52) 145
 
1.5%
(Missing) 3654
36.5%
ValueCountFrequency (%)
1 1048
 
10.5%
2 2845
28.4%
3 1185
11.8%
4 756
 
7.6%
5 205
 
2.1%
6 76
 
0.8%
7 26
 
0.3%
8 30
 
0.3%
9 15
 
0.1%
10 15
 
0.1%
ValueCountFrequency (%)
177 1
< 0.1%
165 1
< 0.1%
157 1
< 0.1%
101 1
< 0.1%
89 1
< 0.1%
86 1
< 0.1%
80 2
< 0.1%
76 1
< 0.1%
74 1
< 0.1%
71 1
< 0.1%

원금
Real number (ℝ)

HIGH CORRELATION  MISSING  SKEWED  ZEROS 

Distinct71
Distinct (%)1.1%
Missing3654
Missing (%)36.5%
Infinite0
Infinite (%)0.0%
Mean1668.0744
Minimum0
Maximum78000
Zeros460
Zeros (%)4.6%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T08:31:42.212920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11000
median1700
Q32000
95-th percentile3000
Maximum78000
Range78000
Interquartile range (IQR)1000

Descriptive statistics

Standard deviation1653.8592
Coefficient of variation (CV)0.99147806
Kurtosis910.30389
Mean1668.0744
Median Absolute Deviation (MAD)300
Skewness23.604611
Sum10585600
Variance2735250.1
MonotonicityNot monotonic
2023-12-11T08:31:42.333364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2000 1937
19.4%
1000 1138
 
11.4%
1400 650
 
6.5%
0 460
 
4.6%
1500 382
 
3.8%
2500 285
 
2.9%
3000 271
 
2.7%
1700 266
 
2.7%
700 262
 
2.6%
500 153
 
1.5%
Other values (61) 542
 
5.4%
(Missing) 3654
36.5%
ValueCountFrequency (%)
0 460
4.6%
250 5
 
0.1%
350 9
 
0.1%
500 153
 
1.5%
700 262
 
2.6%
750 28
 
0.3%
850 4
 
< 0.1%
1000 1138
11.4%
1050 1
 
< 0.1%
1100 6
 
0.1%
ValueCountFrequency (%)
78000 1
< 0.1%
48900 1
< 0.1%
45000 1
< 0.1%
27500 1
< 0.1%
21300 1
< 0.1%
18500 1
< 0.1%
17500 1
< 0.1%
17000 1
< 0.1%
12400 1
< 0.1%
10000 2
< 0.1%

구입날짜
Date

MISSING 

Distinct5905
Distinct (%)93.1%
Missing3654
Missing (%)36.5%
Memory size156.2 KiB
Minimum2019-02-08 15:50:00
Maximum2019-09-24 17:53:00
2023-12-11T08:31:42.452571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:42.569226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

구입시간
Date

MISSING 

Distinct5905
Distinct (%)93.1%
Missing3654
Missing (%)36.5%
Memory size156.2 KiB
Minimum2019-02-08 15:50:00
Maximum2019-09-24 17:53:00
2023-12-11T08:31:42.695660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:42.884126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

구입일시
Date

MISSING 

Distinct5905
Distinct (%)93.1%
Missing3654
Missing (%)36.5%
Memory size156.2 KiB
Minimum2019-02-08 15:50:00
Maximum2019-09-24 17:53:00
2023-12-11T08:31:43.262872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:43.425878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

취소일시
Date

MISSING 

Distinct59
Distinct (%)90.8%
Missing9935
Missing (%)99.4%
Memory size156.2 KiB
Minimum2019-02-08 15:50:00
Maximum2019-08-18 14:42:00
2023-12-11T08:31:43.551779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:43.666280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct308
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T08:31:43.887073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length15
Mean length8.9623
Min length6

Characters and Unicode

Total characters89623
Distinct characters70
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique145 ?
Unique (%)1.5%

Sample

1st row유아 무료:1명,
2nd row어른 유료:2명
3rd row어른 유료:1명,
4th row어른 유료:1명,
5th row경로 무료:1명,
ValueCountFrequency (%)
어른 4494
21.9%
유료:2명 3316
16.2%
유료:1명 3223
15.7%
유아 1986
9.7%
무료:1명 1515
 
7.4%
청군경 1482
 
7.2%
어린이 1116
 
5.4%
무료:2명 598
 
2.9%
할인 528
 
2.6%
제로페이 292
 
1.4%
Other values (134) 1970
9.6%
2023-12-11T08:31:44.210174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10520
11.7%
10000
11.2%
: 10000
11.2%
9335
10.4%
8964
10.0%
6101
 
6.8%
1 5172
 
5.8%
4869
 
5.4%
2 4180
 
4.7%
, 3608
 
4.0%
Other values (60) 16874
18.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 55339
61.7%
Other Punctuation 13608
 
15.2%
Space Separator 10520
 
11.7%
Decimal Number 10156
 
11.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10000
18.1%
9335
16.9%
8964
16.2%
6101
11.0%
4869
8.8%
2399
 
4.3%
1986
 
3.6%
1672
 
3.0%
1524
 
2.8%
1516
 
2.7%
Other values (47) 6973
12.6%
Decimal Number
ValueCountFrequency (%)
1 5172
50.9%
2 4180
41.2%
3 399
 
3.9%
4 165
 
1.6%
5 90
 
0.9%
6 47
 
0.5%
7 31
 
0.3%
0 31
 
0.3%
8 22
 
0.2%
9 19
 
0.2%
Other Punctuation
ValueCountFrequency (%)
: 10000
73.5%
, 3608
 
26.5%
Space Separator
ValueCountFrequency (%)
10520
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 55339
61.7%
Common 34284
38.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10000
18.1%
9335
16.9%
8964
16.2%
6101
11.0%
4869
8.8%
2399
 
4.3%
1986
 
3.6%
1672
 
3.0%
1524
 
2.8%
1516
 
2.7%
Other values (47) 6973
12.6%
Common
ValueCountFrequency (%)
10520
30.7%
: 10000
29.2%
1 5172
15.1%
2 4180
 
12.2%
, 3608
 
10.5%
3 399
 
1.2%
4 165
 
0.5%
5 90
 
0.3%
6 47
 
0.1%
7 31
 
0.1%
Other values (3) 72
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 55339
61.7%
ASCII 34284
38.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10520
30.7%
: 10000
29.2%
1 5172
15.1%
2 4180
 
12.2%
, 3608
 
10.5%
3 399
 
1.2%
4 165
 
0.5%
5 90
 
0.3%
6 47
 
0.1%
7 31
 
0.1%
Other values (3) 72
 
0.2%
Hangul
ValueCountFrequency (%)
10000
18.1%
9335
16.9%
8964
16.2%
6101
11.0%
4869
8.8%
2399
 
4.3%
1986
 
3.6%
1672
 
3.0%
1524
 
2.8%
1516
 
2.7%
Other values (47) 6973
12.6%

티켓가격
Real number (ℝ)

HIGH CORRELATION  MISSING  SKEWED  ZEROS 

Distinct71
Distinct (%)1.1%
Missing3654
Missing (%)36.5%
Infinite0
Infinite (%)0.0%
Mean1668.0744
Minimum0
Maximum78000
Zeros460
Zeros (%)4.6%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T08:31:44.332802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11000
median1700
Q32000
95-th percentile3000
Maximum78000
Range78000
Interquartile range (IQR)1000

Descriptive statistics

Standard deviation1653.8592
Coefficient of variation (CV)0.99147806
Kurtosis910.30389
Mean1668.0744
Median Absolute Deviation (MAD)300
Skewness23.604611
Sum10585600
Variance2735250.1
MonotonicityNot monotonic
2023-12-11T08:31:44.472493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2000 1937
19.4%
1000 1138
 
11.4%
1400 650
 
6.5%
0 460
 
4.6%
1500 382
 
3.8%
2500 285
 
2.9%
3000 271
 
2.7%
1700 266
 
2.7%
700 262
 
2.6%
500 153
 
1.5%
Other values (61) 542
 
5.4%
(Missing) 3654
36.5%
ValueCountFrequency (%)
0 460
4.6%
250 5
 
0.1%
350 9
 
0.1%
500 153
 
1.5%
700 262
 
2.6%
750 28
 
0.3%
850 4
 
< 0.1%
1000 1138
11.4%
1050 1
 
< 0.1%
1100 6
 
0.1%
ValueCountFrequency (%)
78000 1
< 0.1%
48900 1
< 0.1%
45000 1
< 0.1%
27500 1
< 0.1%
21300 1
< 0.1%
18500 1
< 0.1%
17500 1
< 0.1%
17000 1
< 0.1%
12400 1
< 0.1%
10000 2
< 0.1%

결제구분
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
현금
4053 
<NA>
3654 
카드
2293 

Length

Max length4
Median length2
Mean length2.7308
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row카드
2nd row현금
3rd row현금
4th row현금
5th row<NA>

Common Values

ValueCountFrequency (%)
현금 4053
40.5%
<NA> 3654
36.5%
카드 2293
22.9%

Length

2023-12-11T08:31:44.605747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:31:44.699503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
현금 4053
40.5%
na 3654
36.5%
카드 2293
22.9%

내국인
Real number (ℝ)

MISSING 

Distinct44
Distinct (%)95.7%
Missing9954
Missing (%)99.5%
Infinite0
Infinite (%)0.0%
Mean443.13043
Minimum0
Maximum1104
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T08:31:44.794111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile73.75
Q1223
median354
Q3551.5
95-th percentile1019.5
Maximum1104
Range1104
Interquartile range (IQR)328.5

Descriptive statistics

Standard deviation302.22807
Coefficient of variation (CV)0.68202959
Kurtosis-0.15754635
Mean443.13043
Median Absolute Deviation (MAD)159
Skewness0.90374347
Sum20384
Variance91341.805
MonotonicityNot monotonic
2023-12-11T08:31:44.933111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=44)
ValueCountFrequency (%)
297 2
 
< 0.1%
1000 2
 
< 0.1%
506 1
 
< 0.1%
911 1
 
< 0.1%
561 1
 
< 0.1%
317 1
 
< 0.1%
188 1
 
< 0.1%
570 1
 
< 0.1%
179 1
 
< 0.1%
376 1
 
< 0.1%
Other values (34) 34
 
0.3%
(Missing) 9954
99.5%
ValueCountFrequency (%)
0 1
< 0.1%
43 1
< 0.1%
51 1
< 0.1%
142 1
< 0.1%
143 1
< 0.1%
149 1
< 0.1%
162 1
< 0.1%
179 1
< 0.1%
187 1
< 0.1%
188 1
< 0.1%
ValueCountFrequency (%)
1104 1
< 0.1%
1077 1
< 0.1%
1026 1
< 0.1%
1000 2
< 0.1%
971 1
< 0.1%
953 1
< 0.1%
911 1
< 0.1%
731 1
< 0.1%
573 1
< 0.1%
570 1
< 0.1%

외국인
Real number (ℝ)

MISSING 

Distinct6
Distinct (%)13.0%
Missing9954
Missing (%)99.5%
Infinite0
Infinite (%)0.0%
Mean6.3043478
Minimum0
Maximum162
Zeros39
Zeros (%)0.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T08:31:45.037582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile25.5
Maximum162
Range162
Interquartile range (IQR)0

Descriptive statistics

Standard deviation26.49014
Coefficient of variation (CV)4.2018843
Kurtosis28.472252
Mean6.3043478
Median Absolute Deviation (MAD)0
Skewness5.1805517
Sum290
Variance701.72754
MonotonicityNot monotonic
2023-12-11T08:31:45.152092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 39
 
0.4%
2 3
 
< 0.1%
162 1
 
< 0.1%
78 1
 
< 0.1%
29 1
 
< 0.1%
15 1
 
< 0.1%
(Missing) 9954
99.5%
ValueCountFrequency (%)
0 39
0.4%
2 3
 
< 0.1%
15 1
 
< 0.1%
29 1
 
< 0.1%
78 1
 
< 0.1%
162 1
 
< 0.1%
ValueCountFrequency (%)
162 1
 
< 0.1%
78 1
 
< 0.1%
29 1
 
< 0.1%
15 1
 
< 0.1%
2 3
 
< 0.1%
0 39
0.4%
Distinct308
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T08:31:45.368668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length15
Mean length8.9623
Min length6

Characters and Unicode

Total characters89623
Distinct characters70
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique145 ?
Unique (%)1.5%

Sample

1st row유아 무료:1명,
2nd row어른 유료:2명
3rd row어른 유료:1명,
4th row어른 유료:1명,
5th row경로 무료:1명,
ValueCountFrequency (%)
어른 4494
21.9%
유료:2명 3316
16.2%
유료:1명 3223
15.7%
유아 1986
9.7%
무료:1명 1515
 
7.4%
청군경 1482
 
7.2%
어린이 1116
 
5.4%
무료:2명 598
 
2.9%
할인 528
 
2.6%
제로페이 292
 
1.4%
Other values (134) 1970
9.6%
2023-12-11T08:31:45.720910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10520
11.7%
10000
11.2%
: 10000
11.2%
9335
10.4%
8964
10.0%
6101
 
6.8%
1 5172
 
5.8%
4869
 
5.4%
2 4180
 
4.7%
, 3608
 
4.0%
Other values (60) 16874
18.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 55339
61.7%
Other Punctuation 13608
 
15.2%
Space Separator 10520
 
11.7%
Decimal Number 10156
 
11.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10000
18.1%
9335
16.9%
8964
16.2%
6101
11.0%
4869
8.8%
2399
 
4.3%
1986
 
3.6%
1672
 
3.0%
1524
 
2.8%
1516
 
2.7%
Other values (47) 6973
12.6%
Decimal Number
ValueCountFrequency (%)
1 5172
50.9%
2 4180
41.2%
3 399
 
3.9%
4 165
 
1.6%
5 90
 
0.9%
6 47
 
0.5%
7 31
 
0.3%
0 31
 
0.3%
8 22
 
0.2%
9 19
 
0.2%
Other Punctuation
ValueCountFrequency (%)
: 10000
73.5%
, 3608
 
26.5%
Space Separator
ValueCountFrequency (%)
10520
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 55339
61.7%
Common 34284
38.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10000
18.1%
9335
16.9%
8964
16.2%
6101
11.0%
4869
8.8%
2399
 
4.3%
1986
 
3.6%
1672
 
3.0%
1524
 
2.8%
1516
 
2.7%
Other values (47) 6973
12.6%
Common
ValueCountFrequency (%)
10520
30.7%
: 10000
29.2%
1 5172
15.1%
2 4180
 
12.2%
, 3608
 
10.5%
3 399
 
1.2%
4 165
 
0.5%
5 90
 
0.3%
6 47
 
0.1%
7 31
 
0.1%
Other values (3) 72
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 55339
61.7%
ASCII 34284
38.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10520
30.7%
: 10000
29.2%
1 5172
15.1%
2 4180
 
12.2%
, 3608
 
10.5%
3 399
 
1.2%
4 165
 
0.5%
5 90
 
0.3%
6 47
 
0.1%
7 31
 
0.1%
Other values (3) 72
 
0.2%
Hangul
ValueCountFrequency (%)
10000
18.1%
9335
16.9%
8964
16.2%
6101
11.0%
4869
8.8%
2399
 
4.3%
1986
 
3.6%
1672
 
3.0%
1524
 
2.8%
1516
 
2.7%
Other values (47) 6973
12.6%
Distinct308
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T08:31:45.963878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length15
Mean length8.9623
Min length6

Characters and Unicode

Total characters89623
Distinct characters70
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique145 ?
Unique (%)1.5%

Sample

1st row유아 무료:1명,
2nd row어른 유료:2명
3rd row어른 유료:1명,
4th row어른 유료:1명,
5th row경로 무료:1명,
ValueCountFrequency (%)
어른 4494
21.9%
유료:2명 3316
16.2%
유료:1명 3223
15.7%
유아 1986
9.7%
무료:1명 1515
 
7.4%
청군경 1482
 
7.2%
어린이 1116
 
5.4%
무료:2명 598
 
2.9%
할인 528
 
2.6%
제로페이 292
 
1.4%
Other values (134) 1970
9.6%
2023-12-11T08:31:46.322380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10520
11.7%
10000
11.2%
: 10000
11.2%
9335
10.4%
8964
10.0%
6101
 
6.8%
1 5172
 
5.8%
4869
 
5.4%
2 4180
 
4.7%
, 3608
 
4.0%
Other values (60) 16874
18.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 55339
61.7%
Other Punctuation 13608
 
15.2%
Space Separator 10520
 
11.7%
Decimal Number 10156
 
11.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10000
18.1%
9335
16.9%
8964
16.2%
6101
11.0%
4869
8.8%
2399
 
4.3%
1986
 
3.6%
1672
 
3.0%
1524
 
2.8%
1516
 
2.7%
Other values (47) 6973
12.6%
Decimal Number
ValueCountFrequency (%)
1 5172
50.9%
2 4180
41.2%
3 399
 
3.9%
4 165
 
1.6%
5 90
 
0.9%
6 47
 
0.5%
7 31
 
0.3%
0 31
 
0.3%
8 22
 
0.2%
9 19
 
0.2%
Other Punctuation
ValueCountFrequency (%)
: 10000
73.5%
, 3608
 
26.5%
Space Separator
ValueCountFrequency (%)
10520
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 55339
61.7%
Common 34284
38.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10000
18.1%
9335
16.9%
8964
16.2%
6101
11.0%
4869
8.8%
2399
 
4.3%
1986
 
3.6%
1672
 
3.0%
1524
 
2.8%
1516
 
2.7%
Other values (47) 6973
12.6%
Common
ValueCountFrequency (%)
10520
30.7%
: 10000
29.2%
1 5172
15.1%
2 4180
 
12.2%
, 3608
 
10.5%
3 399
 
1.2%
4 165
 
0.5%
5 90
 
0.3%
6 47
 
0.1%
7 31
 
0.1%
Other values (3) 72
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 55339
61.7%
ASCII 34284
38.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10520
30.7%
: 10000
29.2%
1 5172
15.1%
2 4180
 
12.2%
, 3608
 
10.5%
3 399
 
1.2%
4 165
 
0.5%
5 90
 
0.3%
6 47
 
0.1%
7 31
 
0.1%
Other values (3) 72
 
0.2%
Hangul
ValueCountFrequency (%)
10000
18.1%
9335
16.9%
8964
16.2%
6101
11.0%
4869
8.8%
2399
 
4.3%
1986
 
3.6%
1672
 
3.0%
1524
 
2.8%
1516
 
2.7%
Other values (47) 6973
12.6%

Interactions

2023-12-11T08:31:41.053452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:38.794299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:39.696926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:40.184433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:40.694807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:41.128343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:39.311430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:39.797325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:40.291766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:40.762635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:41.204355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:39.443143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:39.913565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:40.427068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:40.835510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:41.286544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:39.542209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:40.021563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:40.540403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:40.914244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:41.402680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:39.613903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:40.098750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:40.614868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:31:40.978163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:31:46.415878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
티켓장수원금취소일시티켓가격결제구분내국인외국인
티켓장수1.0000.804NaN0.8040.0920.0000.000
원금0.8041.0001.0001.0000.052NaNNaN
취소일시NaN1.0001.0001.0001.000NaNNaN
티켓가격0.8041.0001.0001.0000.052NaNNaN
결제구분0.0920.0521.0000.0521.0000.0000.000
내국인0.000NaNNaNNaN0.0001.0000.428
외국인0.000NaNNaNNaN0.0000.4281.000
2023-12-11T08:31:46.523485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
티켓장수원금티켓가격내국인외국인결제구분
티켓장수1.0000.5320.532-0.056-0.0380.069
원금0.5321.0001.0000.327-0.0810.056
티켓가격0.5321.0001.0000.327-0.0810.056
내국인-0.0560.3270.3271.000-0.0570.000
외국인-0.038-0.081-0.081-0.0571.0000.000
결제구분0.0690.0560.0560.0000.0001.000

Missing values

2023-12-11T08:31:41.517413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:31:41.665149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T08:31:41.816716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

티켓장수원금구입날짜구입시간구입일시취소일시할인구분티켓가격결제구분내국인외국인요금분류유무료구분
15685210002019-07-04 15:432019-07-04 15:432019-07-04 15:43<NA>유아 무료:1명,1000카드<NA><NA>유아 무료:1명,유아 무료:1명,
24927220002019-05-10 17:232019-05-10 17:232019-05-10 17:23<NA>어른 유료:2명2000현금<NA><NA>어른 유료:2명어른 유료:2명
30451320002019-03-09 14:252019-03-09 14:252019-03-09 14:25<NA>어른 유료:1명,2000현금<NA><NA>어른 유료:1명,어른 유료:1명,
8746215002019-08-01 12:242019-08-01 12:242019-08-01 12:24<NA>어른 유료:1명,1500현금<NA><NA>어른 유료:1명,어른 유료:1명,
26627<NA><NA><NA><NA><NA><NA>경로 무료:1명,<NA><NA><NA><NA>경로 무료:1명,경로 무료:1명,
22017220002019-06-08 14:402019-06-08 14:402019-06-08 14:40<NA>어른 유료:2명2000현금<NA><NA>어른 유료:2명어른 유료:2명
15861425002019-07-03 17:002019-07-03 17:002019-07-03 17:00<NA>어른 유료:2명,2500카드<NA><NA>어른 유료:2명,어른 유료:2명,
13936<NA><NA><NA><NA><NA><NA>어린이 유료:1명,<NA><NA><NA><NA>어린이 유료:1명,어린이 유료:1명,
32563<NA><NA><NA><NA><NA><NA>어른 유료:1명<NA><NA><NA><NA>어른 유료:1명어른 유료:1명
24921<NA><NA><NA><NA><NA><NA>청군경 유료:1명<NA><NA><NA><NA>청군경 유료:1명청군경 유료:1명
티켓장수원금구입날짜구입시간구입일시취소일시할인구분티켓가격결제구분내국인외국인요금분류유무료구분
24638320002019-05-12 14:332019-05-12 14:332019-05-12 14:33<NA>어른 유료:2명,2000현금<NA><NA>어른 유료:2명,어른 유료:2명,
31023<NA><NA><NA><NA><NA><NA>어린이 유료:2명<NA><NA><NA><NA>어린이 유료:2명어린이 유료:2명
4224<NA><NA><NA><NA><NA><NA>유아 무료:1명<NA><NA><NA><NA>유아 무료:1명유아 무료:1명
16228220002019-06-30 16:582019-06-30 16:582019-06-30 16:58<NA>어른 유료:2명2000카드<NA><NA>어른 유료:2명어른 유료:2명
25504<NA><NA><NA><NA><NA><NA>어린이 유료:1명,<NA><NA><NA><NA>어린이 유료:1명,어린이 유료:1명,
26235430002019-04-21 16:462019-04-21 16:462019-04-21 16:46<NA>어른 유료:2명,3000현금<NA><NA>어른 유료:2명,어른 유료:2명,
13271214002019-07-13 14:512019-07-13 14:512019-07-13 14:51<NA>청군경 유료:2명1400현금<NA><NA>청군경 유료:2명청군경 유료:2명
6357<NA><NA><NA><NA><NA><NA>다누리카드 할인 어린이:2명<NA><NA><NA><NA>다누리카드 할인 어린이:2명다누리카드 할인 어린이:2명
5276320002019-08-10 13:362019-08-10 13:362019-08-10 13:36<NA>어른 유료:2명,2000카드<NA><NA>어른 유료:2명,어른 유료:2명,
20495428002019-06-15 11:262019-06-15 11:262019-06-15 11:26<NA>청군경 유료:4명2800현금<NA><NA>청군경 유료:4명청군경 유료:4명

Duplicate rows

Most frequently occurring

티켓장수원금구입날짜구입시간구입일시취소일시할인구분티켓가격결제구분내국인외국인요금분류유무료구분# duplicates
85<NA><NA><NA><NA><NA><NA>유아 무료:1명<NA><NA><NA><NA>유아 무료:1명유아 무료:1명1143
87<NA><NA><NA><NA><NA><NA>유아 무료:2명<NA><NA><NA><NA>유아 무료:2명유아 무료:2명378
74<NA><NA><NA><NA><NA><NA>어린이 유료:1명<NA><NA><NA><NA>어린이 유료:1명어린이 유료:1명353
75<NA><NA><NA><NA><NA><NA>어린이 유료:1명,<NA><NA><NA><NA>어린이 유료:1명,어린이 유료:1명,281
109<NA><NA><NA><NA><NA><NA>청군경 유료:1명<NA><NA><NA><NA>청군경 유료:1명청군경 유료:1명267
76<NA><NA><NA><NA><NA><NA>어린이 유료:2명<NA><NA><NA><NA>어린이 유료:2명어린이 유료:2명210
59<NA><NA><NA><NA><NA><NA>어른 유료:1명<NA><NA><NA><NA>어른 유료:1명어른 유료:1명190
61<NA><NA><NA><NA><NA><NA>어른 유료:2명<NA><NA><NA><NA>어른 유료:2명어른 유료:2명107
86<NA><NA><NA><NA><NA><NA>유아 무료:1명,<NA><NA><NA><NA>유아 무료:1명,유아 무료:1명,59
110<NA><NA><NA><NA><NA><NA>청군경 유료:1명,<NA><NA><NA><NA>청군경 유료:1명,청군경 유료:1명,54