Amino acid dipepetide frequency for Leanyer virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.789AlaAla: 1.789 ± 2.094
0.256AlaCys: 0.256 ± 0.241
2.044AlaAsp: 2.044 ± 2.052
3.067AlaGlu: 3.067 ± 1.842
2.556AlaPhe: 2.556 ± 1.039
1.533AlaGly: 1.533 ± 0.966
0.511AlaHis: 0.511 ± 0.129
3.578AlaIle: 3.578 ± 1.742
5.622AlaLys: 5.622 ± 4.61
2.044AlaLeu: 2.044 ± 0.665
0.767AlaMet: 0.767 ± 0.474
2.556AlaAsn: 2.556 ± 0.485
0.767AlaPro: 0.767 ± 0.827
1.278AlaGln: 1.278 ± 0.791
0.511AlaArg: 0.511 ± 0.481
3.322AlaSer: 3.322 ± 0.752
1.789AlaThr: 1.789 ± 0.44
3.067AlaVal: 3.067 ± 1.074
0.256AlaTrp: 0.256 ± 0.158
2.044AlaTyr: 2.044 ± 0.995
0.0AlaXaa: 0.0 ± 0.0
Cys
1.789CysAla: 1.789 ± 0.596
0.511CysCys: 0.511 ± 0.316
0.767CysAsp: 0.767 ± 0.352
1.022CysGlu: 1.022 ± 0.589
1.278CysPhe: 1.278 ± 0.597
2.3CysGly: 2.3 ± 1.788
1.278CysHis: 1.278 ± 0.828
1.789CysIle: 1.789 ± 0.94
2.811CysLys: 2.811 ± 0.881
3.833CysLeu: 3.833 ± 0.727
0.767CysMet: 0.767 ± 0.352
1.533CysAsn: 1.533 ± 0.704
1.533CysPro: 1.533 ± 0.386
1.278CysGln: 1.278 ± 0.828
0.767CysArg: 0.767 ± 0.722
1.533CysSer: 1.533 ± 0.704
2.556CysThr: 2.556 ± 1.292
0.767CysVal: 0.767 ± 0.352
0.0CysTrp: 0.0 ± 0.0
1.278CysTyr: 1.278 ± 0.242
0.0CysXaa: 0.0 ± 0.0
Asp
1.022AspAla: 1.022 ± 0.633
1.022AspCys: 1.022 ± 0.258
3.067AspAsp: 3.067 ± 1.074
2.556AspGlu: 2.556 ± 0.997
4.6AspPhe: 4.6 ± 1.856
2.556AspGly: 2.556 ± 1.129
1.278AspHis: 1.278 ± 0.597
6.9AspIle: 6.9 ± 0.695
3.833AspLys: 3.833 ± 1.317
5.878AspLeu: 5.878 ± 0.364
1.789AspMet: 1.789 ± 0.596
3.578AspAsn: 3.578 ± 1.742
1.789AspPro: 1.789 ± 0.44
1.278AspGln: 1.278 ± 0.791
2.811AspArg: 2.811 ± 0.759
2.044AspSer: 2.044 ± 0.439
2.811AspThr: 2.811 ± 0.599
2.556AspVal: 2.556 ± 0.595
0.767AspTrp: 0.767 ± 0.67
3.578AspTyr: 3.578 ± 0.901
0.0AspXaa: 0.0 ± 0.0
Glu
4.089GluAla: 4.089 ± 1.648
2.044GluCys: 2.044 ± 0.389
4.344GluAsp: 4.344 ± 0.857
5.622GluGlu: 5.622 ± 0.426
3.322GluPhe: 3.322 ± 1.407
2.556GluGly: 2.556 ± 1.292
2.556GluHis: 2.556 ± 0.595
6.9GluIle: 6.9 ± 2.021
5.111GluLys: 5.111 ± 1.301
4.856GluLeu: 4.856 ± 1.067
3.067GluMet: 3.067 ± 0.888
4.089GluAsn: 4.089 ± 0.909
1.789GluPro: 1.789 ± 0.354
2.556GluGln: 2.556 ± 0.878
4.344GluArg: 4.344 ± 1.624
1.278GluSer: 1.278 ± 0.439
3.833GluThr: 3.833 ± 0.851
3.833GluVal: 3.833 ± 1.705
0.256GluTrp: 0.256 ± 0.158
2.044GluTyr: 2.044 ± 1.265
0.0GluXaa: 0.0 ± 0.0
Phe
2.556PheAla: 2.556 ± 0.388
2.3PheCys: 2.3 ± 0.722
3.578PheAsp: 3.578 ± 1.167
3.833PheGlu: 3.833 ± 1.313
1.278PhePhe: 1.278 ± 0.81
3.833PheGly: 3.833 ± 0.248
0.767PheHis: 0.767 ± 0.159
3.833PheIle: 3.833 ± 1.16
3.833PheLys: 3.833 ± 0.851
4.6PheLeu: 4.6 ± 1.856
1.789PheMet: 1.789 ± 0.545
2.811PheAsn: 2.811 ± 1.204
1.278PhePro: 1.278 ± 0.439
1.022PheGln: 1.022 ± 0.29
1.278PheArg: 1.278 ± 0.791
4.344PheSer: 4.344 ± 0.978
1.789PheThr: 1.789 ± 0.479
1.533PheVal: 1.533 ± 0.386
0.767PheTrp: 0.767 ± 0.474
2.811PheTyr: 2.811 ± 0.282
0.0PheXaa: 0.0 ± 0.0
Gly
0.256GlyAla: 0.256 ± 0.758
2.556GlyCys: 2.556 ± 1.411
2.811GlyAsp: 2.811 ± 0.881
3.578GlyGlu: 3.578 ± 0.699
1.278GlyPhe: 1.278 ± 0.596
1.022GlyGly: 1.022 ± 0.258
0.256GlyHis: 0.256 ± 0.158
2.044GlyIle: 2.044 ± 1.178
2.811GlyLys: 2.811 ± 0.282
4.6GlyLeu: 4.6 ± 1.159
1.022GlyMet: 1.022 ± 0.704
3.578GlyAsn: 3.578 ± 0.901
1.278GlyPro: 1.278 ± 0.597
1.533GlyGln: 1.533 ± 0.672
0.767GlyArg: 0.767 ± 0.159
2.556GlySer: 2.556 ± 0.682
3.067GlyThr: 3.067 ± 1.933
0.767GlyVal: 0.767 ± 0.716
0.767GlyTrp: 0.767 ± 0.352
2.044GlyTyr: 2.044 ± 1.632
0.0GlyXaa: 0.0 ± 0.0
His
0.256HisAla: 0.256 ± 0.241
0.256HisCys: 0.256 ± 0.241
1.022HisAsp: 1.022 ± 0.258
1.789HisGlu: 1.789 ± 0.749
1.022HisPhe: 1.022 ± 0.29
1.789HisGly: 1.789 ± 0.479
0.256HisHis: 0.256 ± 0.158
1.533HisIle: 1.533 ± 0.386
1.533HisLys: 1.533 ± 0.619
0.767HisLeu: 0.767 ± 0.159
0.256HisMet: 0.256 ± 0.158
2.811HisAsn: 2.811 ± 0.759
0.256HisPro: 0.256 ± 0.241
0.0HisGln: 0.0 ± 0.0
1.022HisArg: 1.022 ± 0.955
1.789HisSer: 1.789 ± 0.354
1.789HisThr: 1.789 ± 0.44
1.278HisVal: 1.278 ± 0.828
0.767HisTrp: 0.767 ± 0.352
1.278HisTyr: 1.278 ± 0.472
0.0HisXaa: 0.0 ± 0.0
Ile
3.833IleAla: 3.833 ± 1.017
3.067IleCys: 3.067 ± 1.408
5.111IleAsp: 5.111 ± 0.973
5.111IleGlu: 5.111 ± 0.919
4.856IlePhe: 4.856 ± 0.114
1.789IleGly: 1.789 ± 0.44
1.278IleHis: 1.278 ± 0.439
6.9IleIle: 6.9 ± 1.432
10.733IleLys: 10.733 ± 2.098
8.433IleLeu: 8.433 ± 2.545
2.044IleMet: 2.044 ± 0.515
5.878IleAsn: 5.878 ± 0.651
4.6IlePro: 4.6 ± 0.944
3.833IleGln: 3.833 ± 0.887
2.556IleArg: 2.556 ± 0.485
7.922IleSer: 7.922 ± 0.468
5.111IleThr: 5.111 ± 1.468
2.811IleVal: 2.811 ± 0.848
0.767IleTrp: 0.767 ± 0.159
4.089IleTyr: 4.089 ± 0.857
0.0IleXaa: 0.0 ± 0.0
Lys
3.322LysAla: 3.322 ± 0.985
1.789LysCys: 1.789 ± 1.308
5.622LysAsp: 5.622 ± 1.433
6.9LysGlu: 6.9 ± 1.629
4.344LysPhe: 4.344 ± 0.453
3.833LysGly: 3.833 ± 1.791
2.044LysHis: 2.044 ± 0.58
7.922LysIle: 7.922 ± 1.874
5.367LysLys: 5.367 ± 1.076
9.456LysLeu: 9.456 ± 1.06
3.322LysMet: 3.322 ± 1.079
4.856LysAsn: 4.856 ± 0.72
3.322LysPro: 3.322 ± 1.016
1.533LysGln: 1.533 ± 0.672
2.556LysArg: 2.556 ± 0.644
4.856LysSer: 4.856 ± 0.762
6.389LysThr: 6.389 ± 0.418
5.367LysVal: 5.367 ± 3.035
2.044LysTrp: 2.044 ± 1.311
2.044LysTyr: 2.044 ± 0.58
0.0LysXaa: 0.0 ± 0.0
Leu
5.367LeuAla: 5.367 ± 3.121
2.556LeuCys: 2.556 ± 0.945
5.622LeuAsp: 5.622 ± 1.456
8.433LeuGlu: 8.433 ± 1.654
4.089LeuPhe: 4.089 ± 1.47
1.789LeuGly: 1.789 ± 1.308
2.044LeuHis: 2.044 ± 0.439
8.178LeuIle: 8.178 ± 1.655
7.411LeuLys: 7.411 ± 1.263
7.411LeuLeu: 7.411 ± 1.427
2.3LeuMet: 2.3 ± 0.557
4.6LeuAsn: 4.6 ± 0.181
3.322LeuPro: 3.322 ± 0.296
2.044LeuGln: 2.044 ± 0.439
2.044LeuArg: 2.044 ± 1.111
8.945LeuSer: 8.945 ± 0.889
7.156LeuThr: 7.156 ± 1.399
2.811LeuVal: 2.811 ± 1.377
0.0LeuTrp: 0.0 ± 0.0
4.6LeuTyr: 4.6 ± 0.559
0.0LeuXaa: 0.0 ± 0.0
Met
0.511MetAla: 0.511 ± 0.316
0.511MetCys: 0.511 ± 0.481
2.811MetAsp: 2.811 ± 1.107
1.278MetGlu: 1.278 ± 0.791
1.278MetPhe: 1.278 ± 0.596
0.256MetGly: 0.256 ± 0.158
0.256MetHis: 0.256 ± 0.158
3.067MetIle: 3.067 ± 1.096
2.556MetLys: 2.556 ± 0.485
2.3MetLeu: 2.3 ± 0.727
1.022MetMet: 1.022 ± 0.29
1.278MetAsn: 1.278 ± 0.439
0.511MetPro: 0.511 ± 0.316
0.767MetGln: 0.767 ± 0.159
2.3MetArg: 2.3 ± 0.876
2.044MetSer: 2.044 ± 0.767
1.789MetThr: 1.789 ± 0.596
1.278MetVal: 1.278 ± 0.242
0.0MetTrp: 0.0 ± 0.0
0.256MetTyr: 0.256 ± 0.158
0.0MetXaa: 0.0 ± 0.0
Asn
2.556AsnAla: 2.556 ± 1.191
2.044AsnCys: 2.044 ± 1.924
3.067AsnAsp: 3.067 ± 0.87
4.089AsnGlu: 4.089 ± 0.909
3.322AsnPhe: 3.322 ± 1.025
2.556AsnGly: 2.556 ± 1.656
1.278AsnHis: 1.278 ± 1.37
6.645AsnIle: 6.645 ± 0.592
4.6AsnLys: 4.6 ± 0.87
4.089AsnLeu: 4.089 ± 1.16
1.278AsnMet: 1.278 ± 0.439
3.578AsnAsn: 3.578 ± 0.129
3.833AsnPro: 3.833 ± 1.705
2.3AsnGln: 2.3 ± 0.477
2.3AsnArg: 2.3 ± 0.838
5.111AsnSer: 5.111 ± 0.97
2.044AsnThr: 2.044 ± 0.58
2.3AsnVal: 2.3 ± 0.372
0.511AsnTrp: 0.511 ± 0.129
3.322AsnTyr: 3.322 ± 0.724
0.0AsnXaa: 0.0 ± 0.0
Pro
1.533ProAla: 1.533 ± 0.672
0.256ProCys: 0.256 ± 0.241
1.789ProAsp: 1.789 ± 0.479
3.322ProGlu: 3.322 ± 0.232
1.533ProPhe: 1.533 ± 0.318
1.533ProGly: 1.533 ± 0.892
0.767ProHis: 0.767 ± 0.159
2.811ProIle: 2.811 ± 0.599
3.067ProLys: 3.067 ± 0.723
2.556ProLeu: 2.556 ± 1.194
0.511ProMet: 0.511 ± 0.129
2.3ProAsn: 2.3 ± 0.722
0.0ProPro: 0.0 ± 0.0
0.511ProGln: 0.511 ± 0.129
1.533ProArg: 1.533 ± 0.672
2.044ProSer: 2.044 ± 0.905
2.044ProThr: 2.044 ± 0.389
3.067ProVal: 3.067 ± 0.723
1.022ProTrp: 1.022 ± 0.746
1.278ProTyr: 1.278 ± 0.791
0.0ProXaa: 0.0 ± 0.0
Gln
1.278GlnAla: 1.278 ± 0.242
0.511GlnCys: 0.511 ± 0.481
2.044GlnAsp: 2.044 ± 0.58
2.3GlnGlu: 2.3 ± 0.477
2.044GlnPhe: 2.044 ± 0.439
1.533GlnGly: 1.533 ± 0.386
0.256GlnHis: 0.256 ± 0.241
3.322GlnIle: 3.322 ± 0.296
3.067GlnLys: 3.067 ± 0.87
2.556GlnLeu: 2.556 ± 0.595
0.767GlnMet: 0.767 ± 0.159
2.556GlnAsn: 2.556 ± 0.644
0.0GlnPro: 0.0 ± 0.0
1.533GlnGln: 1.533 ± 0.318
2.556GlnArg: 2.556 ± 1.191
2.556GlnSer: 2.556 ± 0.365
1.022GlnThr: 1.022 ± 0.258
0.511GlnVal: 0.511 ± 0.129
0.511GlnTrp: 0.511 ± 0.316
0.767GlnTyr: 0.767 ± 0.159
0.0GlnXaa: 0.0 ± 0.0
Arg
0.767ArgAla: 0.767 ± 0.159
1.278ArgCys: 1.278 ± 0.242
1.789ArgAsp: 1.789 ± 0.679
3.067ArgGlu: 3.067 ± 1.186
2.556ArgPhe: 2.556 ± 0.644
1.022ArgGly: 1.022 ± 0.258
1.278ArgHis: 1.278 ± 0.242
3.067ArgIle: 3.067 ± 0.888
2.556ArgLys: 2.556 ± 0.365
5.367ArgLeu: 5.367 ± 0.351
0.256ArgMet: 0.256 ± 0.158
2.3ArgAsn: 2.3 ± 0.372
1.278ArgPro: 1.278 ± 0.597
0.256ArgGln: 0.256 ± 0.241
1.278ArgArg: 1.278 ± 0.439
3.578ArgSer: 3.578 ± 0.699
2.044ArgThr: 2.044 ± 0.389
2.3ArgVal: 2.3 ± 1.187
0.256ArgTrp: 0.256 ± 0.241
2.811ArgTyr: 2.811 ± 0.423
0.0ArgXaa: 0.0 ± 0.0
Ser
2.3SerAla: 2.3 ± 0.876
2.3SerCys: 2.3 ± 1.788
3.322SerAsp: 3.322 ± 0.752
4.344SerGlu: 4.344 ± 0.068
2.556SerPhe: 2.556 ± 0.945
1.789SerGly: 1.789 ± 0.545
0.767SerHis: 0.767 ± 0.159
7.156SerIle: 7.156 ± 1.542
7.411SerLys: 7.411 ± 1.252
6.133SerLeu: 6.133 ± 0.496
0.767SerMet: 0.767 ± 0.159
3.578SerAsn: 3.578 ± 0.708
2.3SerPro: 2.3 ± 0.551
3.578SerGln: 3.578 ± 0.901
5.622SerArg: 5.622 ± 1.743
5.622SerSer: 5.622 ± 0.426
6.133SerThr: 6.133 ± 1.047
4.856SerVal: 4.856 ± 0.762
0.511SerTrp: 0.511 ± 0.316
2.556SerTyr: 2.556 ± 0.485
0.0SerXaa: 0.0 ± 0.0
Thr
2.556ThrAla: 2.556 ± 2.741
1.789ThrCys: 1.789 ± 0.354
3.322ThrAsp: 3.322 ± 0.629
3.578ThrGlu: 3.578 ± 0.88
2.556ThrPhe: 2.556 ± 1.129
3.067ThrGly: 3.067 ± 1.096
1.533ThrHis: 1.533 ± 0.386
5.622ThrIle: 5.622 ± 1.697
5.111ThrLys: 5.111 ± 0.301
6.389ThrLeu: 6.389 ± 0.604
1.789ThrMet: 1.789 ± 0.749
2.811ThrAsn: 2.811 ± 0.728
2.044ThrPro: 2.044 ± 0.58
2.3ThrGln: 2.3 ± 0.477
1.789ThrArg: 1.789 ± 0.354
5.622ThrSer: 5.622 ± 1.456
2.556ThrThr: 2.556 ± 0.644
3.322ThrVal: 3.322 ± 1.643
0.511ThrTrp: 0.511 ± 0.481
2.811ThrTyr: 2.811 ± 0.543
0.0ThrXaa: 0.0 ± 0.0
Val
3.067ValAla: 3.067 ± 1.857
2.044ValCys: 2.044 ± 0.389
1.278ValAsp: 1.278 ± 0.596
2.3ValGlu: 2.3 ± 0.372
1.022ValPhe: 1.022 ± 0.29
1.533ValGly: 1.533 ± 0.386
2.044ValHis: 2.044 ± 0.823
2.556ValIle: 2.556 ± 0.388
6.133ValLys: 6.133 ± 2.273
3.322ValLeu: 3.322 ± 0.296
1.022ValMet: 1.022 ± 0.51
2.3ValAsn: 2.3 ± 0.551
2.044ValPro: 2.044 ± 0.823
0.511ValGln: 0.511 ± 0.316
1.022ValArg: 1.022 ± 0.614
4.344ValSer: 4.344 ± 0.453
3.578ValThr: 3.578 ± 0.408
4.089ValVal: 4.089 ± 2.461
0.511ValTrp: 0.511 ± 0.129
3.322ValTyr: 3.322 ± 0.752
0.0ValXaa: 0.0 ± 0.0
Trp
0.256TrpAla: 0.256 ± 0.758
0.511TrpCys: 0.511 ± 0.72
0.256TrpAsp: 0.256 ± 0.158
0.256TrpGlu: 0.256 ± 0.158
1.533TrpPhe: 1.533 ± 0.386
0.511TrpGly: 0.511 ± 0.756
0.511TrpHis: 0.511 ± 0.481
1.022TrpIle: 1.022 ± 0.258
0.511TrpLys: 0.511 ± 0.129
1.533TrpLeu: 1.533 ± 0.318
0.256TrpMet: 0.256 ± 0.758
0.767TrpAsn: 0.767 ± 0.159
0.256TrpPro: 0.256 ± 0.241
0.511TrpGln: 0.511 ± 0.316
0.256TrpArg: 0.256 ± 0.241
1.278TrpSer: 1.278 ± 0.791
0.256TrpThr: 0.256 ± 0.158
0.256TrpVal: 0.256 ± 0.158
0.256TrpTrp: 0.256 ± 0.158
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.256TyrAla: 0.256 ± 0.241
1.789TyrCys: 1.789 ± 0.354
2.044TyrAsp: 2.044 ± 0.439
2.044TyrGlu: 2.044 ± 0.58
2.811TyrPhe: 2.811 ± 1.031
1.789TyrGly: 1.789 ± 1.275
0.511TyrHis: 0.511 ± 0.316
5.878TyrIle: 5.878 ± 1.914
2.811TyrLys: 2.811 ± 1.175
4.856TyrLeu: 4.856 ± 1.518
1.022TyrMet: 1.022 ± 0.746
3.067TyrAsn: 3.067 ± 0.593
1.278TyrPro: 1.278 ± 0.597
3.067TyrGln: 3.067 ± 0.636
1.789TyrArg: 1.789 ± 0.596
2.811TyrSer: 2.811 ± 0.599
3.322TyrThr: 3.322 ± 0.976
1.278TyrVal: 1.278 ± 0.439
0.511TyrTrp: 0.511 ± 0.481
1.022TyrTyr: 1.022 ± 0.258
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3914 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski