Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.6 KiB
Average record size in memory67.3 B

Variable types

Categorical6
Numeric2

Alerts

image_id is highly overall correlated with respond_co and 1 other fieldsHigh correlation
qustnr_nm is highly overall correlated with qustnr_idHigh correlation
qustnr_id is highly overall correlated with qustnr_nmHigh correlation
area_nm is highly overall correlated with respond_co and 2 other fieldsHigh correlation
image_nm is highly overall correlated with respond_co and 1 other fieldsHigh correlation
area_id is highly overall correlated with respond_co and 2 other fieldsHigh correlation
respond_co is highly overall correlated with area_id and 3 other fieldsHigh correlation
qustnr_value is highly overall correlated with area_id and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-10 09:48:56.350011
Analysis finished2023-12-10 09:48:58.387448
Duration2.04 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

area_id
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
GW
45 
ALL
42 
GG
10 
SJ
 
3

Length

Max length3
Median length2
Mean length2.42
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowALL
2nd rowSJ
3rd rowALL
4th rowALL
5th rowALL

Common Values

ValueCountFrequency (%)
GW 45
45.0%
ALL 42
42.0%
GG 10
 
10.0%
SJ 3
 
3.0%

Length

2023-12-10T18:48:58.497036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:48:58.675697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
gw 45
45.0%
all 42
42.0%
gg 10
 
10.0%
sj 3
 
3.0%

area_nm
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
강원
45 
전국
42 
경기
10 
세종
 
3

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전국
2nd row세종
3rd row전국
4th row전국
5th row전국

Common Values

ValueCountFrequency (%)
강원 45
45.0%
전국 42
42.0%
경기 10
 
10.0%
세종 3
 
3.0%

Length

2023-12-10T18:48:58.846051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:48:59.023365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강원 45
45.0%
전국 42
42.0%
경기 10
 
10.0%
세종 3
 
3.0%

respond_co
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21995.14
Minimum205
Maximum69612
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T18:48:59.178092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum205
5-th percentile2521
Q13932
median9230
Q336127
95-th percentile69612
Maximum69612
Range69407
Interquartile range (IQR)32195

Descriptive statistics

Standard deviation22810.598
Coefficient of variation (CV)1.0370744
Kurtosis-0.19046856
Mean21995.14
Median Absolute Deviation (MAD)6709
Skewness1.0060343
Sum2199514
Variance5.2032336 × 108
MonotonicityNot monotonic
2023-12-10T18:48:59.349722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
33485 15
15.0%
6453 15
15.0%
3932 15
15.0%
2521 15
15.0%
36127 14
14.0%
69612 13
13.0%
9230 10
10.0%
205 3
 
3.0%
ValueCountFrequency (%)
205 3
 
3.0%
2521 15
15.0%
3932 15
15.0%
6453 15
15.0%
9230 10
10.0%
33485 15
15.0%
36127 14
14.0%
69612 13
13.0%
ValueCountFrequency (%)
69612 13
13.0%
36127 14
14.0%
33485 15
15.0%
9230 10
10.0%
6453 15
15.0%
3932 15
15.0%
2521 15
15.0%
205 3
 
3.0%

image_id
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
ALL_IMAGE
38 
LCLS_IMAGE
32 
PASSNGR_IMAGE
30 

Length

Max length13
Median length10
Mean length10.52
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowALL_IMAGE
2nd rowLCLS_IMAGE
3rd rowALL_IMAGE
4th rowALL_IMAGE
5th rowALL_IMAGE

Common Values

ValueCountFrequency (%)
ALL_IMAGE 38
38.0%
LCLS_IMAGE 32
32.0%
PASSNGR_IMAGE 30
30.0%

Length

2023-12-10T18:48:59.584505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:48:59.783732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
all_image 38
38.0%
lcls_image 32
32.0%
passngr_image 30
30.0%

image_nm
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
전체지역이미지
38 
현지인지역이미지
32 
여행자지역이미지
30 

Length

Max length8
Median length8
Mean length7.62
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전체지역이미지
2nd row현지인지역이미지
3rd row전체지역이미지
4th row전체지역이미지
5th row전체지역이미지

Common Values

ValueCountFrequency (%)
전체지역이미지 38
38.0%
현지인지역이미지 32
32.0%
여행자지역이미지 30
30.0%

Length

2023-12-10T18:49:00.009851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:49:00.181781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전체지역이미지 38
38.0%
현지인지역이미지 32
32.0%
여행자지역이미지 30
30.0%

qustnr_id
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)15.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
IMAGE_1
IMAGE_13
IMAGE_3
IMAGE_4
IMAGE_5
Other values (10)
65 

Length

Max length8
Median length7
Mean length7.4
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowIMAGE_1
2nd rowIMAGE_13
3rd rowIMAGE_3
4th rowIMAGE_4
5th rowIMAGE_5

Common Values

ValueCountFrequency (%)
IMAGE_1 7
 
7.0%
IMAGE_13 7
 
7.0%
IMAGE_3 7
 
7.0%
IMAGE_4 7
 
7.0%
IMAGE_5 7
 
7.0%
IMAGE_6 7
 
7.0%
IMAGE_7 7
 
7.0%
IMAGE_14 7
 
7.0%
IMAGE_9 7
 
7.0%
IMAGE_10 7
 
7.0%
Other values (5) 30
30.0%

Length

2023-12-10T18:49:00.414528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
image_1 7
 
7.0%
image_13 7
 
7.0%
image_3 7
 
7.0%
image_4 7
 
7.0%
image_5 7
 
7.0%
image_6 7
 
7.0%
image_7 7
 
7.0%
image_14 7
 
7.0%
image_9 7
 
7.0%
image_10 7
 
7.0%
Other values (5) 30
30.0%

qustnr_nm
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)15.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
지저분한깨끗한
친구와함께가족과함께
근거리원거리
복잡한한적한
세련된촌스러운
Other values (10)
65 

Length

Max length10
Median length6
Mean length6.96
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지저분한깨끗한
2nd row친구와함께가족과함께
3rd row근거리원거리
4th row복잡한한적한
5th row세련된촌스러운

Common Values

ValueCountFrequency (%)
지저분한깨끗한 7
 
7.0%
친구와함께가족과함께 7
 
7.0%
근거리원거리 7
 
7.0%
복잡한한적한 7
 
7.0%
세련된촌스러운 7
 
7.0%
독특한평범한 7
 
7.0%
화려한소박한 7
 
7.0%
활동적인힐링되는 7
 
7.0%
젊은층의중장년층 7
 
7.0%
상업적인인심좋은 7
 
7.0%
Other values (5) 30
30.0%

Length

2023-12-10T18:49:00.718151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지저분한깨끗한 7
 
7.0%
친구와함께가족과함께 7
 
7.0%
근거리원거리 7
 
7.0%
복잡한한적한 7
 
7.0%
세련된촌스러운 7
 
7.0%
독특한평범한 7
 
7.0%
화려한소박한 7
 
7.0%
활동적인힐링되는 7
 
7.0%
젊은층의중장년층 7
 
7.0%
상업적인인심좋은 7
 
7.0%
Other values (5) 30
30.0%

qustnr_value
Real number (ℝ)

HIGH CORRELATION 

Distinct52
Distinct (%)52.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean52.25
Minimum32.4
Maximum64.6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T18:49:01.020722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum32.4
5-th percentile43.725
Q150
median50
Q357.15
95-th percentile62.455
Maximum64.6
Range32.2
Interquartile range (IQR)7.15

Descriptive statistics

Standard deviation6.2007738
Coefficient of variation (CV)0.1186751
Kurtosis0.35246417
Mean52.25
Median Absolute Deviation (MAD)4.05
Skewness-0.063891902
Sum5225
Variance38.449596
MonotonicityNot monotonic
2023-12-10T18:49:01.282719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50.0 42
42.0%
45.5 3
 
3.0%
58.3 2
 
2.0%
62.3 2
 
2.0%
58.9 2
 
2.0%
57.3 2
 
2.0%
57.5 2
 
2.0%
45.7 1
 
1.0%
45.4 1
 
1.0%
45.3 1
 
1.0%
Other values (42) 42
42.0%
ValueCountFrequency (%)
32.4 1
1.0%
36.0 1
1.0%
41.6 1
1.0%
41.9 1
1.0%
42.3 1
1.0%
43.8 1
1.0%
44.4 1
1.0%
44.7 1
1.0%
45.3 1
1.0%
45.4 1
1.0%
ValueCountFrequency (%)
64.6 1
1.0%
64.5 1
1.0%
64.3 1
1.0%
63.6 1
1.0%
63.5 1
1.0%
62.4 1
1.0%
62.3 2
2.0%
62.1 1
1.0%
61.9 1
1.0%
61.2 1
1.0%

Interactions

2023-12-10T18:48:57.684615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:48:57.228203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:48:57.855092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:48:57.428250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T18:49:01.454990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
area_idarea_nmrespond_coimage_idimage_nmqustnr_idqustnr_nmqustnr_value
area_id1.0001.0001.0000.3170.3170.0000.0000.908
area_nm1.0001.0001.0000.3170.3170.0000.0000.908
respond_co1.0001.0001.0000.7230.7230.0000.0000.780
image_id0.3170.3170.7231.0001.0000.0000.0000.000
image_nm0.3170.3170.7231.0001.0000.0000.0000.000
qustnr_id0.0000.0000.0000.0000.0001.0001.0000.535
qustnr_nm0.0000.0000.0000.0000.0001.0001.0000.535
qustnr_value0.9080.9080.7800.0000.0000.5350.5351.000
2023-12-10T18:49:01.664999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
image_idqustnr_nmqustnr_idarea_nmimage_nmarea_id
image_id1.0000.0000.0000.3041.0000.304
qustnr_nm0.0001.0001.0000.0000.0000.000
qustnr_id0.0001.0001.0000.0000.0000.000
area_nm0.3040.0000.0001.0000.3041.000
image_nm1.0000.0000.0000.3041.0000.304
area_id0.3040.0000.0001.0000.3041.000
2023-12-10T18:49:01.851000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
respond_coqustnr_valuearea_idarea_nmimage_idimage_nmqustnr_idqustnr_nm
respond_co1.000-0.4080.8040.8040.6970.6970.0000.000
qustnr_value-0.4081.0000.7700.7700.0000.0000.2210.221
area_id0.8040.7701.0001.0000.3040.3040.0000.000
area_nm0.8040.7701.0001.0000.3040.3040.0000.000
image_id0.6970.0000.3040.3041.0001.0000.0000.000
image_nm0.6970.0000.3040.3041.0001.0000.0000.000
qustnr_id0.0000.2210.0000.0000.0000.0001.0001.000
qustnr_nm0.0000.2210.0000.0000.0000.0001.0001.000

Missing values

2023-12-10T18:48:58.053900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T18:48:58.303474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

area_idarea_nmrespond_coimage_idimage_nmqustnr_idqustnr_nmqustnr_value
0ALL전국69612ALL_IMAGE전체지역이미지IMAGE_1지저분한깨끗한50.0
1SJ세종205LCLS_IMAGE현지인지역이미지IMAGE_13친구와함께가족과함께47.1
2ALL전국69612ALL_IMAGE전체지역이미지IMAGE_3근거리원거리50.0
3ALL전국69612ALL_IMAGE전체지역이미지IMAGE_4복잡한한적한50.0
4ALL전국69612ALL_IMAGE전체지역이미지IMAGE_5세련된촌스러운50.0
5ALL전국69612ALL_IMAGE전체지역이미지IMAGE_6독특한평범한50.0
6ALL전국69612ALL_IMAGE전체지역이미지IMAGE_7화려한소박한50.0
7SJ세종205LCLS_IMAGE현지인지역이미지IMAGE_14활동적인힐링되는44.4
8ALL전국69612ALL_IMAGE전체지역이미지IMAGE_9젊은층의중장년층50.0
9ALL전국69612ALL_IMAGE전체지역이미지IMAGE_10상업적인인심좋은50.0
area_idarea_nmrespond_coimage_idimage_nmqustnr_idqustnr_nmqustnr_value
90GG경기9230ALL_IMAGE전체지역이미지IMAGE_1지저분한깨끗한44.7
91GG경기9230ALL_IMAGE전체지역이미지IMAGE_2개발된보존된41.6
92GG경기9230ALL_IMAGE전체지역이미지IMAGE_3근거리원거리36.0
93GG경기9230ALL_IMAGE전체지역이미지IMAGE_4복잡한한적한45.4
94GG경기9230ALL_IMAGE전체지역이미지IMAGE_5세련된촌스러운45.7
95GG경기9230ALL_IMAGE전체지역이미지IMAGE_6독특한평범한55.8
96GG경기9230ALL_IMAGE전체지역이미지IMAGE_7화려한소박한45.8
97GG경기9230ALL_IMAGE전체지역이미지IMAGE_8이국적한국적42.3
98GG경기9230ALL_IMAGE전체지역이미지IMAGE_9젊은층의중장년층43.8
99GG경기9230ALL_IMAGE전체지역이미지IMAGE_10상업적인인심좋은41.9