Amino acid dipepetide frequency for Crimean-Congo hemorrhagic fever orthonairovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.781AlaAla: 2.781 ± 0.739
1.308AlaCys: 1.308 ± 0.299
1.799AlaAsp: 1.799 ± 0.486
3.598AlaGlu: 3.598 ± 0.541
1.799AlaPhe: 1.799 ± 0.055
2.453AlaGly: 2.453 ± 0.406
0.654AlaHis: 0.654 ± 0.254
2.781AlaIle: 2.781 ± 0.558
2.453AlaLys: 2.453 ± 1.163
5.561AlaLeu: 5.561 ± 1.92
0.818AlaMet: 0.818 ± 0.207
2.453AlaAsn: 2.453 ± 1.284
1.963AlaPro: 1.963 ± 1.074
1.799AlaGln: 1.799 ± 1.72
3.108AlaArg: 3.108 ± 0.618
4.743AlaSer: 4.743 ± 1.547
2.29AlaThr: 2.29 ± 1.385
3.925AlaVal: 3.925 ± 0.69
0.327AlaTrp: 0.327 ± 0.351
0.491AlaTyr: 0.491 ± 0.345
0.0AlaXaa: 0.0 ± 0.0
Cys
1.145CysAla: 1.145 ± 1.011
1.145CysCys: 1.145 ± 0.329
1.145CysAsp: 1.145 ± 0.574
1.636CysGlu: 1.636 ± 0.396
2.126CysPhe: 2.126 ± 0.116
0.654CysGly: 0.654 ± 0.254
0.491CysHis: 0.491 ± 0.324
2.126CysIle: 2.126 ± 0.423
2.617CysLys: 2.617 ± 1.018
3.108CysLeu: 3.108 ± 0.618
0.327CysMet: 0.327 ± 0.191
0.654CysAsn: 0.654 ± 0.524
1.799CysPro: 1.799 ± 0.827
0.818CysGln: 0.818 ± 0.228
1.472CysArg: 1.472 ± 0.491
3.762CysSer: 3.762 ± 0.786
1.963CysThr: 1.963 ± 1.573
1.308CysVal: 1.308 ± 0.509
0.654CysTrp: 0.654 ± 0.254
0.818CysTyr: 0.818 ± 0.726
0.0CysXaa: 0.0 ± 0.0
Asp
1.963AspAla: 1.963 ± 0.437
2.944AspCys: 2.944 ± 0.952
2.944AspAsp: 2.944 ± 0.823
4.253AspGlu: 4.253 ± 1.473
1.472AspPhe: 1.472 ± 0.597
3.108AspGly: 3.108 ± 0.828
0.818AspHis: 0.818 ± 0.228
3.435AspIle: 3.435 ± 1.079
1.963AspLys: 1.963 ± 0.855
5.561AspLeu: 5.561 ± 1.117
0.981AspMet: 0.981 ± 0.646
2.617AspAsn: 2.617 ± 0.772
0.818AspPro: 0.818 ± 0.228
0.818AspGln: 0.818 ± 0.228
3.271AspArg: 3.271 ± 0.653
4.253AspSer: 4.253 ± 0.533
3.762AspThr: 3.762 ± 0.109
1.963AspVal: 1.963 ± 0.373
0.818AspTrp: 0.818 ± 0.228
2.126AspTyr: 2.126 ± 0.519
0.0AspXaa: 0.0 ± 0.0
Glu
3.108GluAla: 3.108 ± 0.744
1.636GluCys: 1.636 ± 0.212
4.58GluAsp: 4.58 ± 1.637
4.58GluGlu: 4.58 ± 0.541
3.271GluPhe: 3.271 ± 0.98
3.762GluGly: 3.762 ± 0.345
1.963GluHis: 1.963 ± 0.548
3.925GluIle: 3.925 ± 1.28
2.944GluLys: 2.944 ± 1.214
8.342GluLeu: 8.342 ± 2.05
1.963GluMet: 1.963 ± 0.865
3.435GluAsn: 3.435 ± 1.093
2.29GluPro: 2.29 ± 0.492
1.963GluGln: 1.963 ± 0.662
3.598GluArg: 3.598 ± 0.789
4.58GluSer: 4.58 ± 0.932
5.234GluThr: 5.234 ± 0.997
5.888GluVal: 5.888 ± 1.293
0.981GluTrp: 0.981 ± 0.859
1.145GluTyr: 1.145 ± 0.41
0.0GluXaa: 0.0 ± 0.0
Phe
1.799PheAla: 1.799 ± 0.055
1.472PheCys: 1.472 ± 0.805
1.799PheAsp: 1.799 ± 0.575
3.435PheGlu: 3.435 ± 1.254
1.963PhePhe: 1.963 ± 0.763
1.963PheGly: 1.963 ± 0.207
0.327PheHis: 0.327 ± 0.404
2.617PheIle: 2.617 ± 0.591
3.271PheLys: 3.271 ± 0.686
4.089PheLeu: 4.089 ± 0.784
1.145PheMet: 1.145 ± 0.329
3.108PheAsn: 3.108 ± 0.62
1.308PhePro: 1.308 ± 0.338
1.308PheGln: 1.308 ± 0.978
0.981PheArg: 0.981 ± 0.274
2.944PheSer: 2.944 ± 0.376
2.617PheThr: 2.617 ± 1.027
1.472PheVal: 1.472 ± 0.805
0.327PheTrp: 0.327 ± 0.127
2.453PheTyr: 2.453 ± 0.361
0.0PheXaa: 0.0 ± 0.0
Gly
1.308GlyAla: 1.308 ± 0.586
1.799GlyCys: 1.799 ± 0.827
3.108GlyAsp: 3.108 ± 1.455
2.126GlyGlu: 2.126 ± 0.116
2.29GlyPhe: 2.29 ± 0.821
1.963GlyGly: 1.963 ± 0.086
0.981GlyHis: 0.981 ± 0.506
3.925GlyIle: 3.925 ± 0.747
5.07GlyLys: 5.07 ± 1.068
5.888GlyLeu: 5.888 ± 0.078
1.145GlyMet: 1.145 ± 0.264
3.108GlyAsn: 3.108 ± 0.62
1.472GlyPro: 1.472 ± 0.491
1.472GlyGln: 1.472 ± 0.298
2.29GlyArg: 2.29 ± 0.394
3.925GlySer: 3.925 ± 0.68
3.925GlyThr: 3.925 ± 1.209
2.617GlyVal: 2.617 ± 0.989
0.327GlyTrp: 0.327 ± 0.127
1.308GlyTyr: 1.308 ± 0.586
0.0GlyXaa: 0.0 ± 0.0
His
1.636HisAla: 1.636 ± 0.326
1.308HisCys: 1.308 ± 0.503
0.491HisAsp: 0.491 ± 0.324
1.145HisGlu: 1.145 ± 0.409
0.981HisPhe: 0.981 ± 0.382
1.308HisGly: 1.308 ± 0.509
0.327HisHis: 0.327 ± 0.191
1.145HisIle: 1.145 ± 0.41
1.636HisLys: 1.636 ± 0.415
2.126HisLeu: 2.126 ± 0.116
1.145HisMet: 1.145 ± 0.409
1.145HisAsn: 1.145 ± 0.233
1.636HisPro: 1.636 ± 0.681
0.818HisGln: 0.818 ± 0.359
0.818HisArg: 0.818 ± 0.359
2.126HisSer: 2.126 ± 0.423
0.981HisThr: 0.981 ± 0.506
1.636HisVal: 1.636 ± 1.257
0.491HisTrp: 0.491 ± 0.287
0.164HisTyr: 0.164 ± 0.096
0.0HisXaa: 0.0 ± 0.0
Ile
3.271IleAla: 3.271 ± 0.423
2.126IleCys: 2.126 ± 1.22
2.617IleAsp: 2.617 ± 0.188
4.743IleGlu: 4.743 ± 1.125
1.472IlePhe: 1.472 ± 0.861
2.781IleGly: 2.781 ± 0.639
1.472IleHis: 1.472 ± 0.701
2.781IleIle: 2.781 ± 0.639
5.234IleLys: 5.234 ± 1.447
6.215IleLeu: 6.215 ± 0.41
1.636IleMet: 1.636 ± 0.374
3.435IleAsn: 3.435 ± 0.616
2.453IlePro: 2.453 ± 0.52
1.963IleGln: 1.963 ± 0.688
2.453IleArg: 2.453 ± 0.935
5.561IleSer: 5.561 ± 1.381
3.271IleThr: 3.271 ± 0.246
4.253IleVal: 4.253 ± 0.941
0.327IleTrp: 0.327 ± 0.191
2.29IleTyr: 2.29 ± 0.832
0.0IleXaa: 0.0 ± 0.0
Lys
3.925LysAla: 3.925 ± 0.874
1.472LysCys: 1.472 ± 0.701
4.907LysAsp: 4.907 ± 1.47
5.234LysGlu: 5.234 ± 1.984
3.271LysPhe: 3.271 ± 1.265
3.598LysGly: 3.598 ± 1.848
1.799LysHis: 1.799 ± 0.869
5.234LysIle: 5.234 ± 0.6
6.542LysLys: 6.542 ± 1.604
8.505LysLeu: 8.505 ± 1.199
1.308LysMet: 1.308 ± 0.338
2.944LysAsn: 2.944 ± 0.316
2.29LysPro: 2.29 ± 0.492
3.762LysGln: 3.762 ± 1.236
3.762LysArg: 3.762 ± 0.786
3.435LysSer: 3.435 ± 0.558
5.07LysThr: 5.07 ± 1.376
3.925LysVal: 3.925 ± 0.563
0.981LysTrp: 0.981 ± 1.051
1.472LysTyr: 1.472 ± 0.158
0.0LysXaa: 0.0 ± 0.0
Leu
4.907LeuAla: 4.907 ± 1.001
1.963LeuCys: 1.963 ± 0.397
6.052LeuAsp: 6.052 ± 1.388
5.561LeuGlu: 5.561 ± 0.765
5.234LeuPhe: 5.234 ± 1.282
5.07LeuGly: 5.07 ± 0.791
2.944LeuHis: 2.944 ± 0.178
6.052LeuIle: 6.052 ± 1.187
10.141LeuLys: 10.141 ± 1.888
12.921LeuLeu: 12.921 ± 1.478
2.781LeuMet: 2.781 ± 0.551
5.07LeuAsn: 5.07 ± 0.443
3.435LeuPro: 3.435 ± 0.813
3.762LeuGln: 3.762 ± 0.818
4.253LeuArg: 4.253 ± 0.925
12.594LeuSer: 12.594 ± 1.838
8.342LeuThr: 8.342 ± 1.783
7.033LeuVal: 7.033 ± 0.612
0.327LeuTrp: 0.327 ± 0.191
3.108LeuTyr: 3.108 ± 0.97
0.0LeuXaa: 0.0 ± 0.0
Met
1.636MetAla: 1.636 ± 0.548
0.164MetCys: 0.164 ± 0.202
0.981MetAsp: 0.981 ± 0.63
1.472MetGlu: 1.472 ± 0.158
1.145MetPhe: 1.145 ± 0.233
1.308MetGly: 1.308 ± 0.338
0.981MetHis: 0.981 ± 0.506
1.308MetIle: 1.308 ± 0.591
1.799MetLys: 1.799 ± 0.492
3.108MetLeu: 3.108 ± 0.742
0.654MetMet: 0.654 ± 0.383
0.981MetAsn: 0.981 ± 0.465
0.818MetPro: 0.818 ± 0.207
0.654MetGln: 0.654 ± 0.383
0.491MetArg: 0.491 ± 0.287
2.781MetSer: 2.781 ± 0.363
0.818MetThr: 0.818 ± 0.448
0.164MetVal: 0.164 ± 0.096
0.0MetTrp: 0.0 ± 0.0
0.327MetTyr: 0.327 ± 0.127
0.0MetXaa: 0.0 ± 0.0
Asn
0.818AsnAla: 0.818 ± 1.098
2.126AsnCys: 2.126 ± 0.505
1.308AsnAsp: 1.308 ± 0.296
2.29AsnGlu: 2.29 ± 1.071
1.472AsnPhe: 1.472 ± 0.597
1.963AsnGly: 1.963 ± 0.548
1.636AsnHis: 1.636 ± 0.326
4.089AsnIle: 4.089 ± 0.546
3.762AsnLys: 3.762 ± 1.33
7.687AsnLeu: 7.687 ± 0.273
1.472AsnMet: 1.472 ± 0.373
1.308AsnAsn: 1.308 ± 0.338
2.126AsnPro: 2.126 ± 1.23
0.654AsnGln: 0.654 ± 0.383
2.781AsnArg: 2.781 ± 0.488
4.907AsnSer: 4.907 ± 1.23
2.617AsnThr: 2.617 ± 0.722
4.089AsnVal: 4.089 ± 0.835
0.654AsnTrp: 0.654 ± 0.148
0.981AsnTyr: 0.981 ± 0.63
0.0AsnXaa: 0.0 ± 0.0
Pro
1.963ProAla: 1.963 ± 0.785
0.327ProCys: 0.327 ± 0.351
1.963ProAsp: 1.963 ± 0.373
3.271ProGlu: 3.271 ± 0.566
1.636ProPhe: 1.636 ± 0.504
1.963ProGly: 1.963 ± 1.363
0.818ProHis: 0.818 ± 0.726
0.981ProIle: 0.981 ± 0.249
2.453ProLys: 2.453 ± 1.096
2.126ProLeu: 2.126 ± 0.726
0.654ProMet: 0.654 ± 0.447
1.472ProAsn: 1.472 ± 0.41
1.145ProPro: 1.145 ± 0.41
1.636ProGln: 1.636 ± 0.636
2.29ProArg: 2.29 ± 0.578
3.108ProSer: 3.108 ± 0.693
4.253ProThr: 4.253 ± 1.609
2.126ProVal: 2.126 ± 1.183
0.654ProTrp: 0.654 ± 0.447
0.981ProTyr: 0.981 ± 0.382
0.0ProXaa: 0.0 ± 0.0
Gln
1.963GlnAla: 1.963 ± 0.951
0.654GlnCys: 0.654 ± 0.254
1.145GlnAsp: 1.145 ± 0.281
2.126GlnGlu: 2.126 ± 0.174
1.145GlnPhe: 1.145 ± 0.281
1.636GlnGly: 1.636 ± 0.717
1.145GlnHis: 1.145 ± 0.574
1.799GlnIle: 1.799 ± 0.575
2.453GlnLys: 2.453 ± 0.344
4.58GlnLeu: 4.58 ± 0.822
0.981GlnMet: 0.981 ± 0.274
1.636GlnAsn: 1.636 ± 0.612
0.654GlnPro: 0.654 ± 0.288
2.944GlnGln: 2.944 ± 0.55
0.818GlnArg: 0.818 ± 0.228
3.271GlnSer: 3.271 ± 0.791
1.963GlnThr: 1.963 ± 0.444
2.617GlnVal: 2.617 ± 0.591
0.327GlnTrp: 0.327 ± 0.191
1.145GlnTyr: 1.145 ± 0.233
0.0GlnXaa: 0.0 ± 0.0
Arg
1.636ArgAla: 1.636 ± 0.13
1.963ArgCys: 1.963 ± 0.444
2.29ArgAsp: 2.29 ± 1.132
2.944ArgGlu: 2.944 ± 0.704
1.308ArgPhe: 1.308 ± 0.503
1.799ArgGly: 1.799 ± 1.039
1.472ArgHis: 1.472 ± 0.298
3.271ArgIle: 3.271 ± 0.513
3.598ArgLys: 3.598 ± 0.553
5.561ArgLeu: 5.561 ± 1.332
1.145ArgMet: 1.145 ± 0.233
3.271ArgAsn: 3.271 ± 0.566
0.981ArgPro: 0.981 ± 0.199
2.617ArgGln: 2.617 ± 0.52
2.781ArgArg: 2.781 ± 0.488
4.253ArgSer: 4.253 ± 0.642
2.617ArgThr: 2.617 ± 0.173
2.29ArgVal: 2.29 ± 0.05
0.327ArgTrp: 0.327 ± 0.191
0.818ArgTyr: 0.818 ± 0.478
0.0ArgXaa: 0.0 ± 0.0
Ser
5.234SerAla: 5.234 ± 1.392
2.126SerCys: 2.126 ± 1.22
4.416SerAsp: 4.416 ± 1.506
7.524SerGlu: 7.524 ± 0.785
3.108SerPhe: 3.108 ± 0.642
4.416SerGly: 4.416 ± 0.78
1.799SerHis: 1.799 ± 0.395
5.561SerIle: 5.561 ± 0.725
4.907SerLys: 4.907 ± 1.521
8.996SerLeu: 8.996 ± 1.925
0.818SerMet: 0.818 ± 0.407
3.925SerAsn: 3.925 ± 0.948
3.925SerPro: 3.925 ± 1.487
2.29SerGln: 2.29 ± 0.318
4.089SerArg: 4.089 ± 0.555
9.814SerSer: 9.814 ± 0.809
9.323SerThr: 9.323 ± 2.564
6.052SerVal: 6.052 ± 0.99
1.799SerTrp: 1.799 ± 0.588
2.453SerTyr: 2.453 ± 0.647
0.0SerXaa: 0.0 ± 0.0
Thr
4.416ThrAla: 4.416 ± 0.722
1.472ThrCys: 1.472 ± 1.533
3.925ThrAsp: 3.925 ± 0.471
5.725ThrGlu: 5.725 ± 0.426
2.617ThrPhe: 2.617 ± 0.403
5.397ThrGly: 5.397 ± 1.188
1.145ThrHis: 1.145 ± 0.233
3.762ThrIle: 3.762 ± 1.576
3.598ThrLys: 3.598 ± 0.895
6.379ThrLeu: 6.379 ± 1.308
0.654ThrMet: 0.654 ± 0.148
3.435ThrAsn: 3.435 ± 0.761
3.925ThrPro: 3.925 ± 2.321
2.944ThrGln: 2.944 ± 0.908
2.29ThrArg: 2.29 ± 0.492
6.379ThrSer: 6.379 ± 1.724
4.58ThrThr: 4.58 ± 1.781
4.58ThrVal: 4.58 ± 0.252
0.818ThrTrp: 0.818 ± 0.207
1.472ThrTyr: 1.472 ± 0.298
0.0ThrXaa: 0.0 ± 0.0
Val
2.126ValAla: 2.126 ± 0.819
1.963ValCys: 1.963 ± 0.785
2.944ValAsp: 2.944 ± 0.704
5.561ValGlu: 5.561 ± 0.256
2.29ValPhe: 2.29 ± 0.394
1.636ValGly: 1.636 ± 0.212
1.145ValHis: 1.145 ± 0.409
3.598ValIle: 3.598 ± 0.891
5.725ValLys: 5.725 ± 0.93
6.706ValLeu: 6.706 ± 0.565
1.145ValMet: 1.145 ± 0.2
3.108ValAsn: 3.108 ± 0.208
1.963ValPro: 1.963 ± 1.096
1.799ValGln: 1.799 ± 0.395
3.108ValArg: 3.108 ± 0.828
6.869ValSer: 6.869 ± 0.247
4.416ValThr: 4.416 ± 0.555
2.453ValVal: 2.453 ± 0.524
0.0ValTrp: 0.0 ± 0.0
0.818ValTyr: 0.818 ± 0.228
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.818TrpCys: 0.818 ± 0.359
0.818TrpAsp: 0.818 ± 0.359
0.491TrpGlu: 0.491 ± 0.099
0.818TrpPhe: 0.818 ± 0.669
1.308TrpGly: 1.308 ± 0.577
0.0TrpHis: 0.0 ± 0.0
0.491TrpIle: 0.491 ± 0.099
1.472TrpLys: 1.472 ± 0.373
1.308TrpLeu: 1.308 ± 0.153
0.327TrpMet: 0.327 ± 0.127
0.0TrpAsn: 0.0 ± 0.0
0.491TrpPro: 0.491 ± 0.606
0.327TrpGln: 0.327 ± 0.191
0.491TrpArg: 0.491 ± 0.348
0.491TrpSer: 0.491 ± 0.099
0.491TrpThr: 0.491 ± 0.287
0.327TrpVal: 0.327 ± 0.351
0.164TrpTrp: 0.164 ± 0.096
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.472TyrAla: 1.472 ± 0.594
0.981TyrCys: 0.981 ± 0.927
0.654TyrAsp: 0.654 ± 0.383
1.472TyrGlu: 1.472 ± 0.594
0.981TyrPhe: 0.981 ± 0.382
1.963TyrGly: 1.963 ± 0.397
1.145TyrHis: 1.145 ± 0.329
1.636TyrIle: 1.636 ± 0.13
1.799TyrLys: 1.799 ± 0.88
2.781TyrLeu: 2.781 ± 0.551
0.327TyrMet: 0.327 ± 0.351
1.636TyrAsn: 1.636 ± 0.49
0.327TyrPro: 0.327 ± 0.127
0.654TyrGln: 0.654 ± 0.288
1.799TyrArg: 1.799 ± 0.377
2.781TyrSer: 2.781 ± 0.551
0.981TyrThr: 0.981 ± 0.382
0.654TyrVal: 0.654 ± 0.148
0.327TyrTrp: 0.327 ± 0.351
0.654TyrTyr: 0.654 ± 0.254
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (6115 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski