Amino acid dipepetide frequency for Pidgey virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.629AlaAla: 1.629 ± 1.688
1.086AlaCys: 1.086 ± 0.26
1.357AlaAsp: 1.357 ± 0.813
1.629AlaGlu: 1.629 ± 0.875
0.814AlaPhe: 0.814 ± 0.488
1.086AlaGly: 1.086 ± 0.724
0.814AlaHis: 0.814 ± 0.488
1.9AlaIle: 1.9 ± 0.748
3.8AlaLys: 3.8 ± 0.831
2.714AlaLeu: 2.714 ± 0.891
0.543AlaMet: 0.543 ± 0.159
2.172AlaAsn: 2.172 ± 0.814
0.543AlaPro: 0.543 ± 0.159
0.814AlaGln: 0.814 ± 0.141
1.357AlaArg: 1.357 ± 1.078
2.714AlaSer: 2.714 ± 1.043
2.443AlaThr: 2.443 ± 0.905
2.986AlaVal: 2.986 ± 1.442
0.814AlaTrp: 0.814 ± 0.556
1.9AlaTyr: 1.9 ± 0.327
0.0AlaXaa: 0.0 ± 0.0
Cys
0.271CysAla: 0.271 ± 0.163
0.271CysCys: 0.271 ± 0.289
1.357CysAsp: 1.357 ± 0.41
0.543CysGlu: 0.543 ± 0.325
2.172CysPhe: 2.172 ± 0.637
0.271CysGly: 0.271 ± 0.289
1.086CysHis: 1.086 ± 1.156
1.9CysIle: 1.9 ± 0.976
1.086CysLys: 1.086 ± 0.724
3.529CysLeu: 3.529 ± 1.31
1.086CysMet: 1.086 ± 0.318
0.814CysAsn: 0.814 ± 0.437
1.086CysPro: 1.086 ± 0.724
1.086CysGln: 1.086 ± 0.26
0.271CysArg: 0.271 ± 0.163
2.443CysSer: 2.443 ± 2.167
1.629CysThr: 1.629 ± 0.478
1.086CysVal: 1.086 ± 1.156
0.271CysTrp: 0.271 ± 0.289
1.357CysTyr: 1.357 ± 0.592
0.0CysXaa: 0.0 ± 0.0
Asp
1.357AspAla: 1.357 ± 0.428
0.814AspCys: 0.814 ± 0.437
5.157AspAsp: 5.157 ± 1.219
2.986AspGlu: 2.986 ± 0.201
3.8AspPhe: 3.8 ± 1.114
1.629AspGly: 1.629 ± 0.478
0.271AspHis: 0.271 ± 0.163
4.886AspIle: 4.886 ± 1.333
4.886AspLys: 4.886 ± 0.719
5.429AspLeu: 5.429 ± 2.019
1.629AspMet: 1.629 ± 0.924
3.8AspAsn: 3.8 ± 0.171
2.714AspPro: 2.714 ± 0.82
1.086AspGln: 1.086 ± 0.318
0.814AspArg: 0.814 ± 0.437
4.886AspSer: 4.886 ± 0.477
1.629AspThr: 1.629 ± 0.782
4.615AspVal: 4.615 ± 1.375
0.271AspTrp: 0.271 ± 0.289
2.986AspTyr: 2.986 ± 0.201
0.0AspXaa: 0.0 ± 0.0
Glu
1.086GluAla: 1.086 ± 0.26
3.257GluCys: 3.257 ± 1.75
5.7GluAsp: 5.7 ± 2.179
4.615GluGlu: 4.615 ± 2.764
1.086GluPhe: 1.086 ± 0.26
1.9GluGly: 1.9 ± 0.579
1.086GluHis: 1.086 ± 0.469
5.429GluIle: 5.429 ± 0.72
5.972GluLys: 5.972 ± 1.637
7.6GluLeu: 7.6 ± 0.783
1.629GluMet: 1.629 ± 0.976
3.8GluAsn: 3.8 ± 0.771
1.086GluPro: 1.086 ± 0.26
2.443GluGln: 2.443 ± 0.667
0.814GluArg: 0.814 ± 0.141
8.415GluSer: 8.415 ± 1.955
4.343GluThr: 4.343 ± 1.273
3.529GluVal: 3.529 ± 1.221
0.0GluTrp: 0.0 ± 0.0
2.714GluTyr: 2.714 ± 0.82
0.0GluXaa: 0.0 ± 0.0
Phe
0.271PheAla: 0.271 ± 0.289
1.357PheCys: 1.357 ± 1.012
2.986PheAsp: 2.986 ± 0.201
1.086PheGlu: 1.086 ± 0.724
1.629PhePhe: 1.629 ± 0.283
1.9PheGly: 1.9 ± 0.399
1.629PheHis: 1.629 ± 0.478
4.886PheIle: 4.886 ± 1.249
4.072PheLys: 4.072 ± 0.707
4.072PheLeu: 4.072 ± 1.136
1.357PheMet: 1.357 ± 0.677
3.8PheAsn: 3.8 ± 1.305
1.9PhePro: 1.9 ± 0.518
1.086PheGln: 1.086 ± 0.26
2.443PheArg: 2.443 ± 0.359
5.157PheSer: 5.157 ± 0.847
2.443PheThr: 2.443 ± 1.048
4.886PheVal: 4.886 ± 1.027
0.543PheTrp: 0.543 ± 0.325
2.443PheTyr: 2.443 ± 0.552
0.0PheXaa: 0.0 ± 0.0
Gly
1.9GlyAla: 1.9 ± 0.748
1.086GlyCys: 1.086 ± 0.318
0.543GlyAsp: 0.543 ± 0.159
2.986GlyGlu: 2.986 ± 0.751
2.172GlyPhe: 2.172 ± 0.303
1.629GlyGly: 1.629 ± 0.283
0.543GlyHis: 0.543 ± 0.325
3.257GlyIle: 3.257 ± 0.838
4.343GlyLys: 4.343 ± 1.286
3.257GlyLeu: 3.257 ± 0.838
0.814GlyMet: 0.814 ± 0.141
1.629GlyAsn: 1.629 ± 0.419
0.543GlyPro: 0.543 ± 0.325
1.629GlyGln: 1.629 ± 0.283
1.629GlyArg: 1.629 ± 0.976
3.257GlySer: 3.257 ± 0.169
1.9GlyThr: 1.9 ± 0.327
1.086GlyVal: 1.086 ± 0.318
0.0GlyTrp: 0.0 ± 0.0
1.086GlyTyr: 1.086 ± 0.26
0.0GlyXaa: 0.0 ± 0.0
His
0.814HisAla: 0.814 ± 0.437
0.543HisCys: 0.543 ± 0.159
1.357HisAsp: 1.357 ± 0.41
1.9HisGlu: 1.9 ± 0.726
0.814HisPhe: 0.814 ± 0.488
0.814HisGly: 0.814 ± 0.437
0.0HisHis: 0.0 ± 0.0
1.357HisIle: 1.357 ± 0.253
0.543HisLys: 0.543 ± 0.325
1.629HisLeu: 1.629 ± 1.021
0.271HisMet: 0.271 ± 0.61
1.9HisAsn: 1.9 ± 0.385
0.814HisPro: 0.814 ± 0.556
0.271HisGln: 0.271 ± 0.163
0.814HisArg: 0.814 ± 0.437
1.357HisSer: 1.357 ± 0.41
0.543HisThr: 0.543 ± 0.159
2.443HisVal: 2.443 ± 0.652
0.0HisTrp: 0.0 ± 0.0
1.629HisTyr: 1.629 ± 0.567
0.0HisXaa: 0.0 ± 0.0
Ile
1.9IleAla: 1.9 ± 0.987
1.357IleCys: 1.357 ± 0.592
3.8IleAsp: 3.8 ± 1.158
4.615IleGlu: 4.615 ± 1.184
3.257IlePhe: 3.257 ± 0.838
2.443IleGly: 2.443 ± 1.229
2.172IleHis: 2.172 ± 0.52
6.786IleIle: 6.786 ± 1.896
9.772IleLys: 9.772 ± 1.831
9.772IleLeu: 9.772 ± 0.909
0.814IleMet: 0.814 ± 0.141
5.7IleAsn: 5.7 ± 1.394
1.629IlePro: 1.629 ± 0.446
3.257IleGln: 3.257 ± 1.468
3.257IleArg: 3.257 ± 0.893
6.243IleSer: 6.243 ± 0.757
2.714IleThr: 2.714 ± 0.468
4.886IleVal: 4.886 ± 1.303
0.271IleTrp: 0.271 ± 0.289
2.443IleTyr: 2.443 ± 1.048
0.0IleXaa: 0.0 ± 0.0
Lys
3.257LysAla: 3.257 ± 1.414
1.9LysCys: 1.9 ± 1.59
2.986LysAsp: 2.986 ± 0.796
5.7LysGlu: 5.7 ± 0.498
2.986LysPhe: 2.986 ± 0.512
2.443LysGly: 2.443 ± 0.359
0.814LysHis: 0.814 ± 0.488
4.072LysIle: 4.072 ± 1.383
4.886LysLys: 4.886 ± 1.249
10.315LysLeu: 10.315 ± 2.438
2.443LysMet: 2.443 ± 0.999
4.615LysAsn: 4.615 ± 0.901
3.8LysPro: 3.8 ± 0.171
2.986LysGln: 2.986 ± 0.512
4.072LysArg: 4.072 ± 0.9
5.972LysSer: 5.972 ± 1.072
5.972LysThr: 5.972 ± 0.782
3.8LysVal: 3.8 ± 0.649
1.086LysTrp: 1.086 ± 0.26
4.072LysTyr: 4.072 ± 0.76
0.0LysXaa: 0.0 ± 0.0
Leu
3.8LeuAla: 3.8 ± 1.036
3.529LeuCys: 3.529 ± 0.656
4.886LeuAsp: 4.886 ± 1.339
8.143LeuGlu: 8.143 ± 2.861
3.8LeuPhe: 3.8 ± 1.036
3.529LeuGly: 3.529 ± 0.459
2.443LeuHis: 2.443 ± 0.287
7.6LeuIle: 7.6 ± 1.424
7.6LeuLys: 7.6 ± 0.982
8.686LeuLeu: 8.686 ± 0.201
4.072LeuMet: 4.072 ± 1.356
7.058LeuAsn: 7.058 ± 1.693
2.986LeuPro: 2.986 ± 0.512
1.9LeuGln: 1.9 ± 0.385
5.7LeuArg: 5.7 ± 1.734
7.6LeuSer: 7.6 ± 0.874
9.229LeuThr: 9.229 ± 0.969
3.8LeuVal: 3.8 ± 0.746
1.357LeuTrp: 1.357 ± 0.41
3.257LeuTyr: 3.257 ± 0.59
0.0LeuXaa: 0.0 ± 0.0
Met
1.357MetAla: 1.357 ± 0.41
0.271MetCys: 0.271 ± 0.163
2.172MetAsp: 2.172 ± 0.814
3.8MetGlu: 3.8 ± 1.053
2.172MetPhe: 2.172 ± 0.427
0.543MetGly: 0.543 ± 0.159
0.814MetHis: 0.814 ± 0.488
3.257MetIle: 3.257 ± 0.649
2.714MetLys: 2.714 ± 0.891
1.9MetLeu: 1.9 ± 0.518
0.814MetMet: 0.814 ± 0.141
2.443MetAsn: 2.443 ± 0.287
1.357MetPro: 1.357 ± 0.253
0.814MetGln: 0.814 ± 0.141
0.814MetArg: 0.814 ± 0.141
3.257MetSer: 3.257 ± 1.414
1.086MetThr: 1.086 ± 1.121
0.543MetVal: 0.543 ± 0.159
0.543MetTrp: 0.543 ± 0.325
0.271MetTyr: 0.271 ± 0.163
0.0MetXaa: 0.0 ± 0.0
Asn
2.986AsnAla: 2.986 ± 0.642
2.714AsnCys: 2.714 ± 1.414
3.529AsnAsp: 3.529 ± 0.05
4.072AsnGlu: 4.072 ± 0.998
4.343AsnPhe: 4.343 ± 0.1
2.714AsnGly: 2.714 ± 0.875
1.629AsnHis: 1.629 ± 0.446
2.986AsnIle: 2.986 ± 0.201
6.786AsnLys: 6.786 ± 0.522
7.872AsnLeu: 7.872 ± 1.017
2.714AsnMet: 2.714 ± 0.519
5.7AsnAsn: 5.7 ± 0.498
2.443AsnPro: 2.443 ± 0.652
3.257AsnGln: 3.257 ± 0.78
1.9AsnArg: 1.9 ± 0.905
4.615AsnSer: 4.615 ± 0.514
3.8AsnThr: 3.8 ± 0.612
4.886AsnVal: 4.886 ± 1.862
0.271AsnTrp: 0.271 ± 0.61
2.172AsnTyr: 2.172 ± 0.378
0.0AsnXaa: 0.0 ± 0.0
Pro
0.814ProAla: 0.814 ± 0.556
0.271ProCys: 0.271 ± 0.289
2.172ProAsp: 2.172 ± 0.887
2.714ProGlu: 2.714 ± 1.09
2.443ProPhe: 2.443 ± 0.552
0.814ProGly: 0.814 ± 0.488
0.0ProHis: 0.0 ± 0.0
2.443ProIle: 2.443 ± 0.974
2.986ProLys: 2.986 ± 0.201
2.443ProLeu: 2.443 ± 0.667
0.543ProMet: 0.543 ± 0.325
2.443ProAsn: 2.443 ± 1.737
0.271ProPro: 0.271 ± 0.289
1.086ProGln: 1.086 ± 0.598
1.357ProArg: 1.357 ± 0.677
1.9ProSer: 1.9 ± 0.399
2.172ProThr: 2.172 ± 0.378
1.9ProVal: 1.9 ± 0.976
0.0ProTrp: 0.0 ± 0.0
1.086ProTyr: 1.086 ± 0.724
0.0ProXaa: 0.0 ± 0.0
Gln
1.629GlnAla: 1.629 ± 0.446
0.543GlnCys: 0.543 ± 0.578
1.357GlnAsp: 1.357 ± 0.41
2.714GlnGlu: 2.714 ± 0.519
2.443GlnPhe: 2.443 ± 0.359
1.357GlnGly: 1.357 ± 0.677
0.543GlnHis: 0.543 ± 0.325
3.529GlnIle: 3.529 ± 0.899
0.814GlnLys: 0.814 ± 0.141
1.9GlnLeu: 1.9 ± 0.748
1.086GlnMet: 1.086 ± 0.724
3.257GlnAsn: 3.257 ± 0.59
0.271GlnPro: 0.271 ± 0.289
0.543GlnGln: 0.543 ± 0.159
0.814GlnArg: 0.814 ± 0.867
2.986GlnSer: 2.986 ± 0.708
2.172GlnThr: 2.172 ± 0.637
2.172GlnVal: 2.172 ± 1.301
0.543GlnTrp: 0.543 ± 0.578
0.543GlnTyr: 0.543 ± 0.159
0.0GlnXaa: 0.0 ± 0.0
Arg
1.9ArgAla: 1.9 ± 0.976
0.814ArgCys: 0.814 ± 0.867
3.8ArgAsp: 3.8 ± 0.655
2.986ArgGlu: 2.986 ± 0.642
1.9ArgPhe: 1.9 ± 0.726
1.086ArgGly: 1.086 ± 0.469
1.086ArgHis: 1.086 ± 0.26
2.986ArgIle: 2.986 ± 0.328
1.629ArgLys: 1.629 ± 0.419
3.529ArgLeu: 3.529 ± 1.781
1.629ArgMet: 1.629 ± 0.446
2.714ArgAsn: 2.714 ± 0.82
1.086ArgPro: 1.086 ± 0.469
0.814ArgGln: 0.814 ± 0.488
2.172ArgArg: 2.172 ± 1.301
3.8ArgSer: 3.8 ± 1.453
2.443ArgThr: 2.443 ± 1.18
1.357ArgVal: 1.357 ± 0.428
0.271ArgTrp: 0.271 ± 0.163
1.086ArgTyr: 1.086 ± 0.65
0.0ArgXaa: 0.0 ± 0.0
Ser
2.172SerAla: 2.172 ± 0.378
1.086SerCys: 1.086 ± 1.156
4.072SerAsp: 4.072 ± 1.439
5.429SerGlu: 5.429 ± 0.356
5.429SerPhe: 5.429 ± 1.14
3.8SerGly: 3.8 ± 0.712
0.543SerHis: 0.543 ± 0.578
6.786SerIle: 6.786 ± 0.874
4.615SerLys: 4.615 ± 0.24
10.315SerLeu: 10.315 ± 1.664
4.343SerMet: 4.343 ± 0.1
5.157SerAsn: 5.157 ± 0.847
2.443SerPro: 2.443 ± 1.075
2.714SerGln: 2.714 ± 1.043
5.157SerArg: 5.157 ± 2.286
9.772SerSer: 9.772 ± 1.976
6.786SerThr: 6.786 ± 0.793
4.343SerVal: 4.343 ± 1.774
1.086SerTrp: 1.086 ± 0.318
3.529SerTyr: 3.529 ± 1.619
0.0SerXaa: 0.0 ± 0.0
Thr
1.357ThrAla: 1.357 ± 0.253
0.543ThrCys: 0.543 ± 0.578
4.615ThrAsp: 4.615 ± 0.787
3.529ThrGlu: 3.529 ± 1.696
4.615ThrPhe: 4.615 ± 1.088
2.172ThrGly: 2.172 ± 0.637
1.9ThrHis: 1.9 ± 0.518
4.343ThrIle: 4.343 ± 0.594
3.257ThrLys: 3.257 ± 0.893
6.786ThrLeu: 6.786 ± 0.189
1.357ThrMet: 1.357 ± 0.592
5.972ThrAsn: 5.972 ± 1.204
1.357ThrPro: 1.357 ± 0.545
2.714ThrGln: 2.714 ± 2.025
1.357ThrArg: 1.357 ± 0.895
5.429ThrSer: 5.429 ± 1.038
5.157ThrThr: 5.157 ± 2.633
2.172ThrVal: 2.172 ± 0.52
0.814ThrTrp: 0.814 ± 0.488
2.443ThrTyr: 2.443 ± 0.882
0.0ThrXaa: 0.0 ± 0.0
Val
2.443ValAla: 2.443 ± 0.652
1.086ValCys: 1.086 ± 0.26
2.443ValAsp: 2.443 ± 0.359
4.072ValGlu: 4.072 ± 0.9
1.629ValPhe: 1.629 ± 0.736
3.8ValGly: 3.8 ± 0.771
1.357ValHis: 1.357 ± 0.428
4.072ValIle: 4.072 ± 0.123
6.515ValLys: 6.515 ± 1.921
4.615ValLeu: 4.615 ± 0.24
1.9ValMet: 1.9 ± 0.353
4.343ValAsn: 4.343 ± 0.41
1.9ValPro: 1.9 ± 0.976
1.086ValGln: 1.086 ± 0.724
2.443ValArg: 2.443 ± 0.287
5.157ValSer: 5.157 ± 0.456
3.257ValThr: 3.257 ± 0.649
2.172ValVal: 2.172 ± 0.303
1.086ValTrp: 1.086 ± 0.318
1.9ValTyr: 1.9 ± 0.579
0.0ValXaa: 0.0 ± 0.0
Trp
0.543TrpAla: 0.543 ± 0.159
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.271TrpGlu: 0.271 ± 0.163
0.543TrpPhe: 0.543 ± 0.325
0.0TrpGly: 0.0 ± 0.0
0.543TrpHis: 0.543 ± 0.159
1.086TrpIle: 1.086 ± 0.686
0.543TrpLys: 0.543 ± 0.325
0.814TrpLeu: 0.814 ± 0.488
0.0TrpMet: 0.0 ± 0.0
1.086TrpAsn: 1.086 ± 0.598
0.543TrpPro: 0.543 ± 0.159
0.543TrpGln: 0.543 ± 0.578
0.0TrpArg: 0.0 ± 0.0
0.543TrpSer: 0.543 ± 0.325
0.543TrpThr: 0.543 ± 0.159
1.086TrpVal: 1.086 ± 0.318
0.0TrpTrp: 0.0 ± 0.0
0.814TrpTyr: 0.814 ± 0.437
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.629TyrAla: 1.629 ± 0.736
1.086TyrCys: 1.086 ± 0.318
1.357TyrAsp: 1.357 ± 0.41
2.714TyrGlu: 2.714 ± 0.519
1.9TyrPhe: 1.9 ± 0.399
1.357TyrGly: 1.357 ± 0.41
0.543TyrHis: 0.543 ± 0.159
3.529TyrIle: 3.529 ± 0.869
1.357TyrLys: 1.357 ± 0.41
4.072TyrLeu: 4.072 ± 2.187
1.629TyrMet: 1.629 ± 0.283
2.986TyrAsn: 2.986 ± 0.602
1.086TyrPro: 1.086 ± 0.26
1.086TyrGln: 1.086 ± 0.598
1.9TyrArg: 1.9 ± 0.385
4.072TyrSer: 4.072 ± 0.76
1.629TyrThr: 1.629 ± 0.567
3.529TyrVal: 3.529 ± 0.734
0.271TyrTrp: 0.271 ± 0.289
1.357TyrTyr: 1.357 ± 0.253
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3685 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski