Amino acid dipepetide frequency for Crimean-Congo hemorrhagic fever virus (strain Nigeria/IbAr10200/1970) (CCHFV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.619AlaAla: 2.619 ± 0.946
1.309AlaCys: 1.309 ± 0.284
1.473AlaAsp: 1.473 ± 0.539
3.437AlaGlu: 3.437 ± 0.427
1.637AlaPhe: 1.637 ± 0.503
3.11AlaGly: 3.11 ± 0.746
0.327AlaHis: 0.327 ± 0.112
2.782AlaIle: 2.782 ± 0.573
2.455AlaLys: 2.455 ± 1.227
5.565AlaLeu: 5.565 ± 1.856
0.655AlaMet: 0.655 ± 0.149
2.455AlaAsn: 2.455 ± 0.922
1.473AlaPro: 1.473 ± 0.443
1.637AlaGln: 1.637 ± 1.755
2.946AlaArg: 2.946 ± 0.739
4.255AlaSer: 4.255 ± 1.285
3.11AlaThr: 3.11 ± 1.483
3.928AlaVal: 3.928 ± 0.652
0.327AlaTrp: 0.327 ± 0.35
0.655AlaTyr: 0.655 ± 0.293
0.0AlaXaa: 0.0 ± 0.0
Cys
1.146CysAla: 1.146 ± 1.023
1.146CysCys: 1.146 ± 0.298
0.982CysAsp: 0.982 ± 0.561
1.637CysGlu: 1.637 ± 0.356
1.964CysPhe: 1.964 ± 0.073
0.982CysGly: 0.982 ± 0.336
0.491CysHis: 0.491 ± 0.28
2.128CysIle: 2.128 ± 0.427
2.619CysLys: 2.619 ± 0.895
3.437CysLeu: 3.437 ± 0.711
0.327CysMet: 0.327 ± 0.175
0.982CysAsn: 0.982 ± 0.336
1.8CysPro: 1.8 ± 0.722
0.818CysGln: 0.818 ± 0.223
1.473CysArg: 1.473 ± 0.443
3.601CysSer: 3.601 ± 0.759
1.8CysThr: 1.8 ± 1.43
1.146CysVal: 1.146 ± 0.5
0.655CysTrp: 0.655 ± 0.224
0.818CysTyr: 0.818 ± 0.628
0.0CysXaa: 0.0 ± 0.0
Asp
1.8AspAla: 1.8 ± 0.495
3.273AspCys: 3.273 ± 0.745
2.619AspAsp: 2.619 ± 0.736
4.583AspGlu: 4.583 ± 1.277
1.473AspPhe: 1.473 ± 0.786
3.437AspGly: 3.437 ± 0.974
0.818AspHis: 0.818 ± 0.437
3.273AspIle: 3.273 ± 0.903
1.8AspLys: 1.8 ± 0.27
5.237AspLeu: 5.237 ± 1.049
0.982AspMet: 0.982 ± 0.655
2.455AspAsn: 2.455 ± 0.67
1.309AspPro: 1.309 ± 0.284
0.818AspGln: 0.818 ± 0.223
3.273AspArg: 3.273 ± 0.6
4.092AspSer: 4.092 ± 0.434
4.583AspThr: 4.583 ± 0.554
1.964AspVal: 1.964 ± 0.073
0.982AspTrp: 0.982 ± 0.236
2.291AspTyr: 2.291 ± 0.474
0.0AspXaa: 0.0 ± 0.0
Glu
2.946GluAla: 2.946 ± 0.698
1.637GluCys: 1.637 ± 0.183
4.583GluAsp: 4.583 ± 1.556
4.583GluGlu: 4.583 ± 0.523
3.273GluPhe: 3.273 ± 0.845
3.437GluGly: 3.437 ± 0.345
1.964GluHis: 1.964 ± 0.726
3.928GluIle: 3.928 ± 1.048
3.11GluLys: 3.11 ± 0.913
8.347GluLeu: 8.347 ± 1.534
1.8GluMet: 1.8 ± 0.886
3.273GluAsn: 3.273 ± 0.6
2.128GluPro: 2.128 ± 0.629
1.964GluGln: 1.964 ± 0.576
3.11GluArg: 3.11 ± 0.663
5.074GluSer: 5.074 ± 0.273
5.074GluThr: 5.074 ± 0.738
6.219GluVal: 6.219 ± 0.877
0.818GluTrp: 0.818 ± 0.557
0.982GluTyr: 0.982 ± 0.273
0.0GluXaa: 0.0 ± 0.0
Phe
1.8PheAla: 1.8 ± 0.248
1.146PheCys: 1.146 ± 0.769
1.8PheAsp: 1.8 ± 0.498
3.437PheGlu: 3.437 ± 1.154
2.128PhePhe: 2.128 ± 0.629
1.964PheGly: 1.964 ± 0.184
0.327PheHis: 0.327 ± 0.35
2.455PheIle: 2.455 ± 0.493
2.782PheLys: 2.782 ± 0.317
4.255PheLeu: 4.255 ± 0.623
0.982PheMet: 0.982 ± 0.197
3.11PheAsn: 3.11 ± 0.535
1.473PhePro: 1.473 ± 0.354
1.309PheGln: 1.309 ± 0.984
1.309PheArg: 1.309 ± 0.153
2.946PheSer: 2.946 ± 0.582
2.946PheThr: 2.946 ± 0.582
1.964PheVal: 1.964 ± 0.546
0.327PheTrp: 0.327 ± 0.112
2.128PheTyr: 2.128 ± 0.517
0.0PheXaa: 0.0 ± 0.0
Gly
0.982GlyAla: 0.982 ± 0.336
1.8GlyCys: 1.8 ± 0.722
3.437GlyAsp: 3.437 ± 1.375
2.619GlyGlu: 2.619 ± 0.38
1.8GlyPhe: 1.8 ± 0.495
1.964GlyGly: 1.964 ± 0.073
0.982GlyHis: 0.982 ± 0.47
3.601GlyIle: 3.601 ± 0.625
5.237GlyLys: 5.237 ± 1.104
6.71GlyLeu: 6.71 ± 0.908
0.982GlyMet: 0.982 ± 0.261
3.437GlyAsn: 3.437 ± 0.427
1.637GlyPro: 1.637 ± 0.604
1.473GlyGln: 1.473 ± 0.296
2.782GlyArg: 2.782 ± 0.429
3.601GlySer: 3.601 ± 0.706
4.255GlyThr: 4.255 ± 1.66
2.946GlyVal: 2.946 ± 0.422
0.327GlyTrp: 0.327 ± 0.112
1.637GlyTyr: 1.637 ± 0.896
0.0GlyXaa: 0.0 ± 0.0
His
1.309HisAla: 1.309 ± 0.474
1.473HisCys: 1.473 ± 0.37
0.327HisAsp: 0.327 ± 0.112
1.309HisGlu: 1.309 ± 0.294
1.146HisPhe: 1.146 ± 0.5
1.473HisGly: 1.473 ± 0.408
0.327HisHis: 0.327 ± 0.112
1.309HisIle: 1.309 ± 0.53
1.8HisLys: 1.8 ± 0.52
1.964HisLeu: 1.964 ± 0.073
0.818HisMet: 0.818 ± 0.437
1.309HisAsn: 1.309 ± 0.448
1.964HisPro: 1.964 ± 0.942
0.655HisGln: 0.655 ± 0.293
0.655HisArg: 0.655 ± 0.224
2.128HisSer: 2.128 ± 0.517
0.655HisThr: 0.655 ± 0.293
1.637HisVal: 1.637 ± 1.107
0.491HisTrp: 0.491 ± 0.262
0.327HisTyr: 0.327 ± 0.175
0.0HisXaa: 0.0 ± 0.0
Ile
2.782IleAla: 2.782 ± 0.645
1.964IleCys: 1.964 ± 0.889
2.455IleAsp: 2.455 ± 0.452
4.255IleGlu: 4.255 ± 0.963
2.128IlePhe: 2.128 ± 0.103
2.455IleGly: 2.455 ± 0.207
1.473IleHis: 1.473 ± 0.408
2.291IleIle: 2.291 ± 0.528
4.746IleLys: 4.746 ± 1.542
6.383IleLeu: 6.383 ± 0.609
1.637IleMet: 1.637 ± 0.384
3.437IleAsn: 3.437 ± 0.539
2.128IlePro: 2.128 ± 0.103
2.128IleGln: 2.128 ± 0.404
2.619IleArg: 2.619 ± 0.656
5.565IleSer: 5.565 ± 1.297
4.746IleThr: 4.746 ± 0.985
4.419IleVal: 4.419 ± 1.022
0.327IleTrp: 0.327 ± 0.175
1.964IleTyr: 1.964 ± 0.853
0.0IleXaa: 0.0 ± 0.0
Lys
3.764LysAla: 3.764 ± 0.597
1.637LysCys: 1.637 ± 0.56
5.074LysAsp: 5.074 ± 1.343
4.91LysGlu: 4.91 ± 1.169
3.273LysPhe: 3.273 ± 1.247
3.928LysGly: 3.928 ± 1.834
1.8LysHis: 1.8 ± 0.765
4.746LysIle: 4.746 ± 0.125
7.038LysLys: 7.038 ± 1.377
8.674LysLeu: 8.674 ± 0.586
1.473LysMet: 1.473 ± 0.144
3.273LysAsn: 3.273 ± 0.255
1.8LysPro: 1.8 ± 0.384
3.437LysGln: 3.437 ± 0.919
4.255LysArg: 4.255 ± 0.855
3.601LysSer: 3.601 ± 0.532
3.928LysThr: 3.928 ± 0.891
3.764LysVal: 3.764 ± 0.52
0.818LysTrp: 0.818 ± 0.68
1.473LysTyr: 1.473 ± 0.144
0.0LysXaa: 0.0 ± 0.0
Leu
4.746LeuAla: 4.746 ± 0.498
2.619LeuCys: 2.619 ± 0.706
6.547LeuAsp: 6.547 ± 0.829
5.565LeuGlu: 5.565 ± 0.774
5.401LeuPhe: 5.401 ± 1.308
4.91LeuGly: 4.91 ± 0.616
2.619LeuHis: 2.619 ± 0.178
6.219LeuIle: 6.219 ± 1.058
10.638LeuLys: 10.638 ± 1.781
12.111LeuLeu: 12.111 ± 1.92
3.273LeuMet: 3.273 ± 0.662
5.074LeuAsn: 5.074 ± 0.608
3.764LeuPro: 3.764 ± 0.769
4.092LeuGln: 4.092 ± 0.857
4.255LeuArg: 4.255 ± 0.555
12.93LeuSer: 12.93 ± 1.62
7.529LeuThr: 7.529 ± 1.569
6.219LeuVal: 6.219 ± 1.07
0.327LeuTrp: 0.327 ± 0.175
3.11LeuTyr: 3.11 ± 1.058
0.0LeuXaa: 0.0 ± 0.0
Met
1.637MetAla: 1.637 ± 0.555
0.164MetCys: 0.164 ± 0.175
0.982MetAsp: 0.982 ± 0.236
1.8MetGlu: 1.8 ± 0.27
1.146MetPhe: 1.146 ± 0.237
1.473MetGly: 1.473 ± 0.144
0.982MetHis: 0.982 ± 0.47
1.309MetIle: 1.309 ± 0.578
2.128MetLys: 2.128 ± 0.405
2.946MetLeu: 2.946 ± 0.827
0.655MetMet: 0.655 ± 0.35
0.982MetAsn: 0.982 ± 0.406
0.655MetPro: 0.655 ± 0.149
0.655MetGln: 0.655 ± 0.35
0.327MetArg: 0.327 ± 0.175
2.128MetSer: 2.128 ± 0.404
0.655MetThr: 0.655 ± 0.224
0.164MetVal: 0.164 ± 0.087
0.0MetTrp: 0.0 ± 0.0
0.491MetTyr: 0.491 ± 0.28
0.0MetXaa: 0.0 ± 0.0
Asn
1.309AsnAla: 1.309 ± 1.464
1.964AsnCys: 1.964 ± 0.394
1.637AsnAsp: 1.637 ± 0.447
1.8AsnGlu: 1.8 ± 0.527
1.637AsnPhe: 1.637 ± 0.447
1.473AsnGly: 1.473 ± 0.622
1.637AsnHis: 1.637 ± 0.447
4.092AsnIle: 4.092 ± 0.474
4.092AsnLys: 4.092 ± 0.862
7.856AsnLeu: 7.856 ± 0.093
1.637AsnMet: 1.637 ± 0.447
1.473AsnAsn: 1.473 ± 0.568
2.128AsnPro: 2.128 ± 1.235
0.655AsnGln: 0.655 ± 0.35
3.273AsnArg: 3.273 ± 0.617
5.074AsnSer: 5.074 ± 0.73
2.128AsnThr: 2.128 ± 0.405
3.764AsnVal: 3.764 ± 0.784
0.655AsnTrp: 0.655 ± 0.149
0.655AsnTyr: 0.655 ± 0.149
0.0AsnXaa: 0.0 ± 0.0
Pro
1.8ProAla: 1.8 ± 0.27
0.327ProCys: 0.327 ± 0.35
2.128ProAsp: 2.128 ± 0.151
3.11ProGlu: 3.11 ± 0.436
1.473ProPhe: 1.473 ± 0.144
2.128ProGly: 2.128 ± 1.366
0.818ProHis: 0.818 ± 0.628
1.309ProIle: 1.309 ± 0.297
2.128ProLys: 2.128 ± 0.861
2.291ProLeu: 2.291 ± 0.592
0.327ProMet: 0.327 ± 0.35
1.473ProAsn: 1.473 ± 0.354
1.473ProPro: 1.473 ± 0.69
1.8ProGln: 1.8 ± 0.722
1.964ProArg: 1.964 ± 0.472
3.273ProSer: 3.273 ± 0.926
3.764ProThr: 3.764 ± 0.876
2.455ProVal: 2.455 ± 1.39
0.655ProTrp: 0.655 ± 0.432
0.818ProTyr: 0.818 ± 0.389
0.0ProXaa: 0.0 ± 0.0
Gln
1.964GlnAla: 1.964 ± 0.829
0.655GlnCys: 0.655 ± 0.224
1.309GlnAsp: 1.309 ± 0.153
2.128GlnGlu: 2.128 ± 0.151
1.309GlnPhe: 1.309 ± 0.294
1.637GlnGly: 1.637 ± 0.356
0.982GlnHis: 0.982 ± 0.197
1.637GlnIle: 1.637 ± 0.423
2.291GlnLys: 2.291 ± 0.382
4.255GlnLeu: 4.255 ± 0.823
0.982GlnMet: 0.982 ± 0.236
1.473GlnAsn: 1.473 ± 0.568
0.655GlnPro: 0.655 ± 0.293
2.782GlnGln: 2.782 ± 0.645
0.982GlnArg: 0.982 ± 0.305
3.273GlnSer: 3.273 ± 0.829
1.8GlnThr: 1.8 ± 0.527
2.782GlnVal: 2.782 ± 0.567
0.327GlnTrp: 0.327 ± 0.175
0.982GlnTyr: 0.982 ± 0.197
0.0GlnXaa: 0.0 ± 0.0
Arg
2.128ArgAla: 2.128 ± 0.405
1.8ArgCys: 1.8 ± 0.384
2.128ArgAsp: 2.128 ± 1.136
2.946ArgGlu: 2.946 ± 0.267
1.473ArgPhe: 1.473 ± 0.354
1.8ArgGly: 1.8 ± 1.043
1.8ArgHis: 1.8 ± 0.518
3.764ArgIle: 3.764 ± 0.956
2.619ArgLys: 2.619 ± 0.38
5.892ArgLeu: 5.892 ± 1.644
0.982ArgMet: 0.982 ± 0.305
3.11ArgAsn: 3.11 ± 0.245
1.146ArgPro: 1.146 ± 0.237
2.619ArgGln: 2.619 ± 0.596
3.11ArgArg: 3.11 ± 0.318
4.583ArgSer: 4.583 ± 0.996
2.782ArgThr: 2.782 ± 0.317
2.128ArgVal: 2.128 ± 0.404
0.491ArgTrp: 0.491 ± 0.332
0.818ArgTyr: 0.818 ± 0.437
0.0ArgXaa: 0.0 ± 0.0
Ser
5.565SerAla: 5.565 ± 1.64
1.964SerCys: 1.964 ± 0.889
4.255SerAsp: 4.255 ± 1.28
7.529SerGlu: 7.529 ± 1.133
3.437SerPhe: 3.437 ± 0.207
5.728SerGly: 5.728 ± 0.638
1.8SerHis: 1.8 ± 0.27
5.728SerIle: 5.728 ± 0.803
4.419SerLys: 4.419 ± 1.68
9.165SerLeu: 9.165 ± 1.798
0.655SerMet: 0.655 ± 0.336
4.419SerAsn: 4.419 ± 0.705
3.11SerPro: 3.11 ± 0.66
1.964SerGln: 1.964 ± 0.073
3.764SerArg: 3.764 ± 0.962
9.165SerSer: 9.165 ± 1.683
8.347SerThr: 8.347 ± 2.142
6.219SerVal: 6.219 ± 1.101
1.8SerTrp: 1.8 ± 0.52
2.782SerTyr: 2.782 ± 0.817
0.0SerXaa: 0.0 ± 0.0
Thr
4.092ThrAla: 4.092 ± 0.805
1.309ThrCys: 1.309 ± 1.152
4.583ThrAsp: 4.583 ± 0.614
5.728ThrGlu: 5.728 ± 0.16
2.128ThrPhe: 2.128 ± 0.405
5.237ThrGly: 5.237 ± 1.642
1.637ThrHis: 1.637 ± 0.56
3.273ThrIle: 3.273 ± 1.022
3.928ThrLys: 3.928 ± 0.523
6.056ThrLeu: 6.056 ± 1.409
0.982ThrMet: 0.982 ± 0.336
2.455ThrAsn: 2.455 ± 1.175
3.928ThrPro: 3.928 ± 2.137
2.619ThrGln: 2.619 ± 0.568
2.291ThrArg: 2.291 ± 0.476
6.547ThrSer: 6.547 ± 1.707
3.764ThrThr: 3.764 ± 0.858
4.583ThrVal: 4.583 ± 0.307
0.818ThrTrp: 0.818 ± 0.192
1.637ThrTyr: 1.637 ± 0.384
0.0ThrXaa: 0.0 ± 0.0
Val
2.455ValAla: 2.455 ± 0.788
1.8ValCys: 1.8 ± 0.52
3.273ValAsp: 3.273 ± 1.111
4.91ValGlu: 4.91 ± 0.201
2.291ValPhe: 2.291 ± 0.236
1.964ValGly: 1.964 ± 0.184
1.146ValHis: 1.146 ± 0.237
3.928ValIle: 3.928 ± 1.039
5.565ValLys: 5.565 ± 1.093
6.71ValLeu: 6.71 ± 0.57
1.146ValMet: 1.146 ± 0.203
3.11ValAsn: 3.11 ± 0.209
2.291ValPro: 2.291 ± 0.811
1.8ValGln: 1.8 ± 0.27
3.601ValArg: 3.601 ± 0.72
6.874ValSer: 6.874 ± 0.461
3.764ValThr: 3.764 ± 0.341
2.946ValVal: 2.946 ± 0.62
0.0ValTrp: 0.0 ± 0.0
0.982ValTyr: 0.982 ± 0.197
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.818TrpCys: 0.818 ± 0.351
0.818TrpAsp: 0.818 ± 0.351
0.491TrpGlu: 0.491 ± 0.099
0.818TrpPhe: 0.818 ± 0.68
1.309TrpGly: 1.309 ± 0.586
0.0TrpHis: 0.0 ± 0.0
0.327TrpIle: 0.327 ± 0.175
1.309TrpLys: 1.309 ± 0.474
1.309TrpLeu: 1.309 ± 0.153
0.327TrpMet: 0.327 ± 0.112
0.0TrpAsn: 0.0 ± 0.0
0.491TrpPro: 0.491 ± 0.525
0.327TrpGln: 0.327 ± 0.175
0.655TrpArg: 0.655 ± 0.432
0.491TrpSer: 0.491 ± 0.099
0.491TrpThr: 0.491 ± 0.262
0.491TrpVal: 0.491 ± 0.352
0.164TrpTrp: 0.164 ± 0.087
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.309TyrAla: 1.309 ± 0.153
1.146TyrCys: 1.146 ± 0.734
0.655TyrAsp: 0.655 ± 0.35
1.473TyrGlu: 1.473 ± 0.568
0.655TyrPhe: 0.655 ± 0.149
1.964TyrGly: 1.964 ± 0.394
1.146TyrHis: 1.146 ± 0.237
1.8TyrIle: 1.8 ± 0.056
1.964TyrLys: 1.964 ± 0.88
2.782TyrLeu: 2.782 ± 0.495
0.491TyrMet: 0.491 ± 0.352
1.637TyrAsn: 1.637 ± 0.423
0.164TyrPro: 0.164 ± 0.087
0.655TyrGln: 0.655 ± 0.336
1.473TyrArg: 1.473 ± 0.408
2.619TyrSer: 2.619 ± 0.178
1.473TyrThr: 1.473 ± 0.69
0.818TyrVal: 0.818 ± 0.223
0.327TyrTrp: 0.327 ± 0.35
0.982TyrTyr: 0.982 ± 0.47
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (6111 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski