Amino acid dipepetide frequency for Chatanga virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.285AlaAla: 2.285 ± 2.364
2.031AlaCys: 2.031 ± 0.767
3.046AlaAsp: 3.046 ± 0.264
3.3AlaGlu: 3.3 ± 1.144
1.523AlaPhe: 1.523 ± 0.637
1.523AlaGly: 1.523 ± 0.987
0.762AlaHis: 0.762 ± 0.473
6.093AlaIle: 6.093 ± 0.8
3.808AlaLys: 3.808 ± 1.88
4.316AlaLeu: 4.316 ± 1.255
2.031AlaMet: 2.031 ± 0.435
2.793AlaAsn: 2.793 ± 0.538
1.269AlaPro: 1.269 ± 0.266
1.269AlaGln: 1.269 ± 0.266
3.554AlaArg: 3.554 ± 1.947
3.046AlaSer: 3.046 ± 1.711
2.285AlaThr: 2.285 ± 0.494
1.777AlaVal: 1.777 ± 0.848
0.254AlaTrp: 0.254 ± 0.158
2.539AlaTyr: 2.539 ± 0.632
0.0AlaXaa: 0.0 ± 0.0
Cys
1.015CysAla: 1.015 ± 0.312
0.254CysCys: 0.254 ± 0.158
0.508CysAsp: 0.508 ± 0.446
1.777CysGlu: 1.777 ± 0.87
2.031CysPhe: 2.031 ± 0.767
2.539CysGly: 2.539 ± 1.878
0.254CysHis: 0.254 ± 0.223
2.031CysIle: 2.031 ± 0.624
2.539CysLys: 2.539 ± 0.884
4.062CysLeu: 4.062 ± 1.246
0.508CysMet: 0.508 ± 0.126
1.015CysAsn: 1.015 ± 0.253
1.269CysPro: 1.269 ± 0.442
1.269CysGln: 1.269 ± 0.442
1.269CysArg: 1.269 ± 1.115
1.015CysSer: 1.015 ± 0.253
2.031CysThr: 2.031 ± 1.432
2.031CysVal: 2.031 ± 1.001
0.254CysTrp: 0.254 ± 0.223
0.762CysTyr: 0.762 ± 0.178
0.0CysXaa: 0.0 ± 0.0
Asp
3.554AspAla: 3.554 ± 0.884
1.015AspCys: 1.015 ± 0.253
2.793AspAsp: 2.793 ± 0.61
3.3AspGlu: 3.3 ± 1.162
3.808AspPhe: 3.808 ± 0.892
2.031AspGly: 2.031 ± 0.505
0.508AspHis: 0.508 ± 0.315
6.347AspIle: 6.347 ± 2.018
5.077AspLys: 5.077 ± 0.237
5.331AspLeu: 5.331 ± 0.794
2.031AspMet: 2.031 ± 0.497
2.793AspAsn: 2.793 ± 0.794
2.793AspPro: 2.793 ± 1.652
1.777AspGln: 1.777 ± 0.376
2.031AspArg: 2.031 ± 0.435
2.793AspSer: 2.793 ± 0.538
1.777AspThr: 1.777 ± 0.848
3.3AspVal: 3.3 ± 0.248
0.762AspTrp: 0.762 ± 0.326
3.554AspTyr: 3.554 ± 0.753
0.0AspXaa: 0.0 ± 0.0
Glu
3.554GluAla: 3.554 ± 1.232
1.015GluCys: 1.015 ± 0.253
2.539GluAsp: 2.539 ± 0.691
3.808GluGlu: 3.808 ± 0.863
4.062GluPhe: 4.062 ± 1.85
2.793GluGly: 2.793 ± 0.538
1.015GluHis: 1.015 ± 0.253
4.824GluIle: 4.824 ± 1.451
4.316GluLys: 4.316 ± 0.814
5.839GluLeu: 5.839 ± 0.352
2.539GluMet: 2.539 ± 0.425
4.316GluAsn: 4.316 ± 1.179
2.031GluPro: 2.031 ± 0.435
3.046GluGln: 3.046 ± 1.721
3.046GluArg: 3.046 ± 1.229
4.316GluSer: 4.316 ± 1.774
3.046GluThr: 3.046 ± 0.264
3.3GluVal: 3.3 ± 1.255
0.762GluTrp: 0.762 ± 0.759
2.793GluTyr: 2.793 ± 1.075
0.0GluXaa: 0.0 ± 0.0
Phe
2.539PheAla: 2.539 ± 0.632
1.523PheCys: 1.523 ± 0.58
2.793PheAsp: 2.793 ± 0.317
3.3PheGlu: 3.3 ± 0.699
1.269PhePhe: 1.269 ± 1.539
2.539PheGly: 2.539 ± 1.258
0.762PheHis: 0.762 ± 0.669
2.539PheIle: 2.539 ± 0.658
5.331PheLys: 5.331 ± 0.474
4.316PheLeu: 4.316 ± 1.774
0.762PheMet: 0.762 ± 0.473
2.539PheAsn: 2.539 ± 1.238
2.031PhePro: 2.031 ± 1.36
0.508PheGln: 0.508 ± 0.446
2.539PheArg: 2.539 ± 1.238
3.554PheSer: 3.554 ± 1.539
2.793PheThr: 2.793 ± 1.075
2.539PheVal: 2.539 ± 0.658
0.762PheTrp: 0.762 ± 0.178
2.031PheTyr: 2.031 ± 1.36
0.0PheXaa: 0.0 ± 0.0
Gly
1.523GlyAla: 1.523 ± 0.73
3.046GlyCys: 3.046 ± 0.758
3.046GlyAsp: 3.046 ± 0.714
2.539GlyGlu: 2.539 ± 0.621
0.762GlyPhe: 0.762 ± 0.752
1.269GlyGly: 1.269 ± 0.64
1.269GlyHis: 1.269 ± 0.266
3.046GlyIle: 3.046 ± 1.711
2.285GlyLys: 2.285 ± 0.535
5.077GlyLeu: 5.077 ± 0.264
1.269GlyMet: 1.269 ± 0.798
3.046GlyAsn: 3.046 ± 1.268
2.285GlyPro: 2.285 ± 1.177
1.269GlyGln: 1.269 ± 0.266
1.269GlyArg: 1.269 ± 0.798
3.046GlySer: 3.046 ± 0.639
3.3GlyThr: 3.3 ± 2.472
1.523GlyVal: 1.523 ± 0.999
0.762GlyTrp: 0.762 ± 0.326
2.031GlyTyr: 2.031 ± 1.418
0.0GlyXaa: 0.0 ± 0.0
His
0.762HisAla: 0.762 ± 0.906
1.015HisCys: 1.015 ± 0.545
2.539HisAsp: 2.539 ± 0.884
1.269HisGlu: 1.269 ± 0.266
1.777HisPhe: 1.777 ± 0.603
1.269HisGly: 1.269 ± 0.461
0.762HisHis: 0.762 ± 0.178
1.777HisIle: 1.777 ± 0.376
2.285HisLys: 2.285 ± 0.771
1.523HisLeu: 1.523 ± 0.379
1.015HisMet: 1.015 ± 0.312
1.269HisAsn: 1.269 ± 0.788
0.508HisPro: 0.508 ± 0.126
0.762HisGln: 0.762 ± 0.178
1.269HisArg: 1.269 ± 1.151
1.523HisSer: 1.523 ± 0.945
0.762HisThr: 0.762 ± 0.326
0.508HisVal: 0.508 ± 0.126
0.254HisTrp: 0.254 ± 0.158
0.254HisTyr: 0.254 ± 0.158
0.0HisXaa: 0.0 ± 0.0
Ile
5.331IleAla: 5.331 ± 0.943
2.031IleCys: 2.031 ± 1.089
4.824IleAsp: 4.824 ± 1.19
4.824IleGlu: 4.824 ± 1.044
3.808IlePhe: 3.808 ± 0.233
3.554IleGly: 3.554 ± 1.425
2.031IleHis: 2.031 ± 0.925
4.57IleIle: 4.57 ± 1.137
7.108IleLys: 7.108 ± 1.476
9.901IleLeu: 9.901 ± 2.096
2.285IleMet: 2.285 ± 1.082
5.839IleAsn: 5.839 ± 0.924
2.031IlePro: 2.031 ± 0.435
1.269IleGln: 1.269 ± 0.266
3.3IleArg: 3.3 ± 0.299
6.093IleSer: 6.093 ± 2.154
6.601IleThr: 6.601 ± 1.496
5.077IleVal: 5.077 ± 0.946
0.762IleTrp: 0.762 ± 0.473
3.808IleTyr: 3.808 ± 1.023
0.0IleXaa: 0.0 ± 0.0
Lys
3.554LysAla: 3.554 ± 2.825
2.539LysCys: 2.539 ± 0.884
4.062LysAsp: 4.062 ± 0.87
8.632LysGlu: 8.632 ± 1.263
3.3LysPhe: 3.3 ± 2.086
2.793LysGly: 2.793 ± 0.61
2.539LysHis: 2.539 ± 0.425
5.331LysIle: 5.331 ± 1.129
5.585LysLys: 5.585 ± 0.651
6.093LysLeu: 6.093 ± 1.624
2.285LysMet: 2.285 ± 1.082
5.077LysAsn: 5.077 ± 1.317
3.046LysPro: 3.046 ± 0.388
3.3LysGln: 3.3 ± 0.619
2.031LysArg: 2.031 ± 0.624
6.601LysSer: 6.601 ± 0.913
5.331LysThr: 5.331 ± 0.814
4.062LysVal: 4.062 ± 0.667
1.015LysTrp: 1.015 ± 0.781
3.046LysTyr: 3.046 ± 0.758
0.254LysXaa: 0.254 ± 0.845
Leu
5.585LeuAla: 5.585 ± 1.56
2.031LeuCys: 2.031 ± 0.767
5.331LeuAsp: 5.331 ± 1.996
6.093LeuGlu: 6.093 ± 0.72
4.062LeuPhe: 4.062 ± 1.248
1.777LeuGly: 1.777 ± 0.562
2.793LeuHis: 2.793 ± 0.616
8.378LeuIle: 8.378 ± 1.318
7.362LeuLys: 7.362 ± 1.778
6.855LeuLeu: 6.855 ± 1.8
2.031LeuMet: 2.031 ± 0.435
4.57LeuAsn: 4.57 ± 0.601
2.793LeuPro: 2.793 ± 0.538
2.539LeuGln: 2.539 ± 1.02
3.046LeuArg: 3.046 ± 0.388
8.124LeuSer: 8.124 ± 1.225
7.87LeuThr: 7.87 ± 1.176
3.808LeuVal: 3.808 ± 0.233
0.254LeuTrp: 0.254 ± 0.158
3.3LeuTyr: 3.3 ± 0.835
0.0LeuXaa: 0.0 ± 0.0
Met
1.523MetAla: 1.523 ± 0.58
1.269MetCys: 1.269 ± 0.266
1.777MetAsp: 1.777 ± 0.672
1.015MetGlu: 1.015 ± 0.63
0.762MetPhe: 0.762 ± 0.759
0.762MetGly: 0.762 ± 0.473
0.254MetHis: 0.254 ± 0.158
3.046MetIle: 3.046 ± 0.714
3.808MetLys: 3.808 ± 0.863
2.285MetLeu: 2.285 ± 0.535
1.269MetMet: 1.269 ± 0.64
1.015MetAsn: 1.015 ± 0.63
1.015MetPro: 1.015 ± 0.312
1.015MetGln: 1.015 ± 0.63
0.254MetArg: 0.254 ± 0.158
2.539MetSer: 2.539 ± 1.399
2.285MetThr: 2.285 ± 0.535
1.015MetVal: 1.015 ± 0.253
0.0MetTrp: 0.0 ± 0.0
1.777MetTyr: 1.777 ± 0.562
0.0MetXaa: 0.0 ± 0.0
Asn
3.3AsnAla: 3.3 ± 0.619
1.523AsnCys: 1.523 ± 0.987
4.062AsnAsp: 4.062 ± 0.304
3.046AsnGlu: 3.046 ± 0.841
2.793AsnPhe: 2.793 ± 0.794
3.046AsnGly: 3.046 ± 1.16
1.523AsnHis: 1.523 ± 0.73
4.57AsnIle: 4.57 ± 1.277
2.793AsnLys: 2.793 ± 0.61
5.585AsnLeu: 5.585 ± 0.538
1.015AsnMet: 1.015 ± 0.63
3.808AsnAsn: 3.808 ± 2.024
3.046AsnPro: 3.046 ± 1.274
1.777AsnGln: 1.777 ± 0.376
2.539AsnArg: 2.539 ± 0.942
3.3AsnSer: 3.3 ± 0.739
1.777AsnThr: 1.777 ± 0.483
3.3AsnVal: 3.3 ± 1.208
1.269AsnTrp: 1.269 ± 0.266
2.285AsnTyr: 2.285 ± 0.771
0.0AsnXaa: 0.0 ± 0.0
Pro
2.031ProAla: 2.031 ± 0.435
0.508ProCys: 0.508 ± 0.446
2.539ProAsp: 2.539 ± 0.942
2.285ProGlu: 2.285 ± 1.39
1.523ProPhe: 1.523 ± 0.379
3.808ProGly: 3.808 ± 0.992
0.508ProHis: 0.508 ± 0.126
5.331ProIle: 5.331 ± 1.228
1.777ProLys: 1.777 ± 0.524
1.523ProLeu: 1.523 ± 0.637
1.523ProMet: 1.523 ± 0.462
1.523ProAsn: 1.523 ± 0.653
1.015ProPro: 1.015 ± 0.312
0.762ProGln: 0.762 ± 0.759
1.015ProArg: 1.015 ± 0.545
2.539ProSer: 2.539 ± 0.632
1.269ProThr: 1.269 ± 0.266
2.285ProVal: 2.285 ± 0.494
0.508ProTrp: 0.508 ± 0.315
0.508ProTyr: 0.508 ± 0.315
0.0ProXaa: 0.0 ± 0.0
Gln
1.269GlnAla: 1.269 ± 0.442
0.762GlnCys: 0.762 ± 0.326
1.777GlnAsp: 1.777 ± 0.376
1.269GlnGlu: 1.269 ± 0.461
2.031GlnPhe: 2.031 ± 0.435
1.015GlnGly: 1.015 ± 0.762
1.269GlnHis: 1.269 ± 0.766
2.285GlnIle: 2.285 ± 1.082
3.808GlnLys: 3.808 ± 0.992
1.777GlnLeu: 1.777 ± 0.524
0.254GlnMet: 0.254 ± 0.158
2.031GlnAsn: 2.031 ± 0.767
0.254GlnPro: 0.254 ± 0.158
0.762GlnGln: 0.762 ± 0.178
2.793GlnArg: 2.793 ± 1.426
1.523GlnSer: 1.523 ± 0.357
3.046GlnThr: 3.046 ± 1.003
1.777GlnVal: 1.777 ± 0.603
0.0GlnTrp: 0.0 ± 0.0
1.523GlnTyr: 1.523 ± 1.624
0.254GlnXaa: 0.254 ± 0.158
Arg
1.015ArgAla: 1.015 ± 0.312
1.269ArgCys: 1.269 ± 0.442
3.3ArgAsp: 3.3 ± 0.835
3.554ArgGlu: 3.554 ± 1.109
2.285ArgPhe: 2.285 ± 0.535
1.015ArgGly: 1.015 ± 0.68
1.777ArgHis: 1.777 ± 1.103
4.57ArgIle: 4.57 ± 1.329
3.808ArgLys: 3.808 ± 0.585
3.554ArgLeu: 3.554 ± 1.344
1.015ArgMet: 1.015 ± 0.489
2.793ArgAsn: 2.793 ± 1.064
0.508ArgPro: 0.508 ± 0.126
1.269ArgGln: 1.269 ± 0.67
1.269ArgArg: 1.269 ± 0.788
3.046ArgSer: 3.046 ± 0.639
1.269ArgThr: 1.269 ± 0.67
2.031ArgVal: 2.031 ± 0.492
1.015ArgTrp: 1.015 ± 1.595
2.285ArgTyr: 2.285 ± 0.557
0.0ArgXaa: 0.0 ± 0.0
Ser
3.3SerAla: 3.3 ± 0.932
2.793SerCys: 2.793 ± 1.414
4.316SerAsp: 4.316 ± 1.027
3.046SerGlu: 3.046 ± 0.714
1.269SerPhe: 1.269 ± 0.798
4.062SerGly: 4.062 ± 1.689
1.523SerHis: 1.523 ± 0.357
7.108SerIle: 7.108 ± 1.152
7.108SerLys: 7.108 ± 1.571
7.362SerLeu: 7.362 ± 1.974
1.269SerMet: 1.269 ± 0.266
3.808SerAsn: 3.808 ± 0.18
3.3SerPro: 3.3 ± 0.248
1.523SerGln: 1.523 ± 1.504
4.062SerArg: 4.062 ± 1.248
5.331SerSer: 5.331 ± 1.249
4.062SerThr: 4.062 ± 1.031
4.824SerVal: 4.824 ± 1.015
0.508SerTrp: 0.508 ± 0.126
3.046SerTyr: 3.046 ± 1.306
0.0SerXaa: 0.0 ± 0.0
Thr
3.046ThrAla: 3.046 ± 2.343
0.762ThrCys: 0.762 ± 0.326
2.793ThrAsp: 2.793 ± 0.61
4.57ThrGlu: 4.57 ± 0.501
4.57ThrPhe: 4.57 ± 0.828
3.554ThrGly: 3.554 ± 0.741
1.015ThrHis: 1.015 ± 0.253
4.316ThrIle: 4.316 ± 0.281
3.046ThrLys: 3.046 ± 0.714
4.316ThrLeu: 4.316 ± 0.281
1.269ThrMet: 1.269 ± 0.266
2.793ThrAsn: 2.793 ± 0.808
2.539ThrPro: 2.539 ± 0.532
1.777ThrGln: 1.777 ± 0.562
2.793ThrArg: 2.793 ± 1.395
4.57ThrSer: 4.57 ± 0.964
3.554ThrThr: 3.554 ± 0.148
3.554ThrVal: 3.554 ± 1.205
1.015ThrTrp: 1.015 ± 0.545
4.062ThrTyr: 4.062 ± 0.87
0.0ThrXaa: 0.0 ± 0.0
Val
2.031ValAla: 2.031 ± 0.7
1.777ValCys: 1.777 ± 0.87
2.031ValAsp: 2.031 ± 0.505
2.031ValGlu: 2.031 ± 1.36
3.046ValPhe: 3.046 ± 0.506
1.777ValGly: 1.777 ± 0.483
1.523ValHis: 1.523 ± 0.58
3.808ValIle: 3.808 ± 1.311
4.316ValLys: 4.316 ± 1.847
4.062ValLeu: 4.062 ± 0.077
1.269ValMet: 1.269 ± 0.461
2.285ValAsn: 2.285 ± 0.547
1.269ValPro: 1.269 ± 0.266
2.539ValGln: 2.539 ± 0.884
2.539ValArg: 2.539 ± 1.507
6.093ValSer: 6.093 ± 1.428
3.808ValThr: 3.808 ± 1.311
1.015ValVal: 1.015 ± 0.253
0.0ValTrp: 0.0 ± 0.0
3.3ValTyr: 3.3 ± 1.208
0.0ValXaa: 0.0 ± 0.0
Trp
0.254TrpAla: 0.254 ± 0.158
0.254TrpCys: 0.254 ± 0.158
0.762TrpAsp: 0.762 ± 0.906
0.254TrpGlu: 0.254 ± 0.158
0.508TrpPhe: 0.508 ± 0.126
0.508TrpGly: 0.508 ± 0.126
0.254TrpHis: 0.254 ± 0.223
0.254TrpIle: 0.254 ± 0.223
0.762TrpLys: 0.762 ± 0.473
1.015TrpLeu: 1.015 ± 0.253
0.762TrpMet: 0.762 ± 0.752
1.015TrpAsn: 1.015 ± 0.312
0.254TrpPro: 0.254 ± 0.223
1.015TrpGln: 1.015 ± 0.68
0.254TrpArg: 0.254 ± 0.223
1.015TrpSer: 1.015 ± 0.312
0.0TrpThr: 0.0 ± 0.0
1.015TrpVal: 1.015 ± 0.68
0.0TrpTrp: 0.0 ± 0.0
0.254TrpTyr: 0.254 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.031TyrAla: 2.031 ± 1.089
1.015TyrCys: 1.015 ± 0.892
2.793TyrAsp: 2.793 ± 0.317
2.793TyrGlu: 2.793 ± 1.395
1.777TyrPhe: 1.777 ± 0.769
1.777TyrGly: 1.777 ± 0.524
0.762TyrHis: 0.762 ± 0.326
4.824TyrIle: 4.824 ± 1.308
3.554TyrLys: 3.554 ± 0.741
3.808TyrLeu: 3.808 ± 1.919
2.031TyrMet: 2.031 ± 0.492
2.031TyrAsn: 2.031 ± 0.435
1.523TyrPro: 1.523 ± 0.58
2.031TyrGln: 2.031 ± 0.435
1.777TyrArg: 1.777 ± 0.603
3.554TyrSer: 3.554 ± 1.124
3.046TyrThr: 3.046 ± 0.714
1.777TyrVal: 1.777 ± 0.376
0.254TyrTrp: 0.254 ± 0.158
1.777TyrTyr: 1.777 ± 0.562
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.254XaaGlu: 0.254 ± 0.158
0.0XaaPhe: 0.0 ± 0.0
0.254XaaGly: 0.254 ± 0.845
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3940 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski