Amino acid dipepetide frequency for Pacui virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.284AlaAla: 3.284 ± 0.872
1.516AlaCys: 1.516 ± 1.042
1.768AlaAsp: 1.768 ± 0.792
3.031AlaGlu: 3.031 ± 0.677
2.273AlaPhe: 2.273 ± 0.448
2.526AlaGly: 2.526 ± 1.448
1.01AlaHis: 1.01 ± 0.332
7.83AlaIle: 7.83 ± 5.428
2.273AlaLys: 2.273 ± 0.586
3.789AlaLeu: 3.789 ± 0.562
2.526AlaMet: 2.526 ± 1.799
2.526AlaAsn: 2.526 ± 0.962
1.263AlaPro: 1.263 ± 1.149
2.021AlaGln: 2.021 ± 0.587
2.273AlaArg: 2.273 ± 0.552
2.021AlaSer: 2.021 ± 1.28
2.273AlaThr: 2.273 ± 1.108
2.526AlaVal: 2.526 ± 1.434
0.253AlaTrp: 0.253 ± 0.16
2.021AlaTyr: 2.021 ± 0.587
0.0AlaXaa: 0.0 ± 0.0
Cys
0.505CysAla: 0.505 ± 0.32
0.0CysCys: 0.0 ± 0.0
0.758CysAsp: 0.758 ± 0.354
2.273CysGlu: 2.273 ± 1.392
2.273CysPhe: 2.273 ± 0.774
2.273CysGly: 2.273 ± 2.093
1.01CysHis: 1.01 ± 0.294
3.284CysIle: 3.284 ± 1.332
3.031CysLys: 3.031 ± 1.418
1.516CysLeu: 1.516 ± 0.634
0.758CysMet: 0.758 ± 0.354
1.516CysAsn: 1.516 ± 0.709
1.768CysPro: 1.768 ± 0.631
1.768CysGln: 1.768 ± 0.623
1.01CysArg: 1.01 ± 0.294
2.273CysSer: 2.273 ± 0.586
1.263CysThr: 1.263 ± 0.49
0.505CysVal: 0.505 ± 0.147
0.0CysTrp: 0.0 ± 0.0
1.516CysTyr: 1.516 ± 0.401
0.0CysXaa: 0.0 ± 0.0
Asp
3.536AspAla: 3.536 ± 0.138
2.021AspCys: 2.021 ± 0.843
3.536AspAsp: 3.536 ± 2.152
4.041AspGlu: 4.041 ± 0.038
3.789AspPhe: 3.789 ± 2.301
2.273AspGly: 2.273 ± 0.827
0.253AspHis: 0.253 ± 0.233
5.304AspIle: 5.304 ± 1.321
2.526AspLys: 2.526 ± 0.734
6.062AspLeu: 6.062 ± 0.635
1.516AspMet: 1.516 ± 0.957
2.526AspAsn: 2.526 ± 0.422
1.768AspPro: 1.768 ± 1.12
2.778AspGln: 2.778 ± 0.467
2.021AspArg: 2.021 ± 0.518
4.799AspSer: 4.799 ± 0.383
2.021AspThr: 2.021 ± 0.518
5.557AspVal: 5.557 ± 0.644
0.253AspTrp: 0.253 ± 0.233
3.031AspTyr: 3.031 ± 1.271
0.0AspXaa: 0.0 ± 0.0
Glu
3.536GluAla: 3.536 ± 1.297
1.516GluCys: 1.516 ± 0.709
4.294GluAsp: 4.294 ± 1.1
3.536GluGlu: 3.536 ± 0.432
3.536GluPhe: 3.536 ± 0.432
2.021GluGly: 2.021 ± 0.587
1.01GluHis: 1.01 ± 0.332
5.557GluIle: 5.557 ± 1.404
4.294GluLys: 4.294 ± 1.069
5.052GluLeu: 5.052 ± 0.955
2.526GluMet: 2.526 ± 1.158
3.536GluAsn: 3.536 ± 0.9
1.768GluPro: 1.768 ± 0.792
1.768GluGln: 1.768 ± 0.446
2.778GluArg: 2.778 ± 0.467
4.799GluSer: 4.799 ± 1.028
3.536GluThr: 3.536 ± 1.167
3.536GluVal: 3.536 ± 0.715
1.01GluTrp: 1.01 ± 0.76
2.526GluTyr: 2.526 ± 0.722
0.0GluXaa: 0.0 ± 0.0
Phe
2.778PheAla: 2.778 ± 0.702
2.021PheCys: 2.021 ± 0.664
2.526PheAsp: 2.526 ± 0.429
2.526PheGlu: 2.526 ± 0.682
2.526PhePhe: 2.526 ± 0.997
2.021PheGly: 2.021 ± 0.694
1.768PheHis: 1.768 ± 0.631
3.536PheIle: 3.536 ± 0.432
5.052PheLys: 5.052 ± 1.34
5.557PheLeu: 5.557 ± 1.029
1.01PheMet: 1.01 ± 0.332
2.526PheAsn: 2.526 ± 1.361
0.758PhePro: 0.758 ± 0.201
1.768PheGln: 1.768 ± 0.446
2.778PheArg: 2.778 ± 1.129
4.041PheSer: 4.041 ± 0.705
2.778PheThr: 2.778 ± 1.437
2.526PheVal: 2.526 ± 0.626
0.253PheTrp: 0.253 ± 0.16
2.273PheTyr: 2.273 ± 0.586
0.0PheXaa: 0.0 ± 0.0
Gly
1.768GlyAla: 1.768 ± 1.613
2.778GlyCys: 2.778 ± 0.837
2.778GlyAsp: 2.778 ± 0.855
3.536GlyGlu: 3.536 ± 1.291
2.273GlyPhe: 2.273 ± 0.774
1.768GlyGly: 1.768 ± 0.843
1.01GlyHis: 1.01 ± 0.581
2.273GlyIle: 2.273 ± 1.691
2.273GlyLys: 2.273 ± 0.586
4.547GlyLeu: 4.547 ± 0.499
1.516GlyMet: 1.516 ± 1.108
2.021GlyAsn: 2.021 ± 0.518
1.768GlyPro: 1.768 ± 0.525
2.021GlyGln: 2.021 ± 0.505
3.031GlyArg: 3.031 ± 0.757
2.526GlySer: 2.526 ± 1.97
2.273GlyThr: 2.273 ± 0.552
2.526GlyVal: 2.526 ± 0.429
0.758GlyTrp: 0.758 ± 0.354
1.516GlyTyr: 1.516 ± 0.709
0.0GlyXaa: 0.0 ± 0.0
His
1.263HisAla: 1.263 ± 0.811
0.758HisCys: 0.758 ± 0.201
1.768HisAsp: 1.768 ± 0.525
1.263HisGlu: 1.263 ± 0.313
0.758HisPhe: 0.758 ± 0.354
1.01HisGly: 1.01 ± 0.332
0.253HisHis: 0.253 ± 0.16
1.263HisIle: 1.263 ± 0.811
1.01HisLys: 1.01 ± 0.294
1.263HisLeu: 1.263 ± 0.481
1.263HisMet: 1.263 ± 0.811
1.01HisAsn: 1.01 ± 0.332
0.758HisPro: 0.758 ± 0.201
0.0HisGln: 0.0 ± 0.0
1.01HisArg: 1.01 ± 0.581
1.01HisSer: 1.01 ± 0.868
1.768HisThr: 1.768 ± 0.446
1.516HisVal: 1.516 ± 0.401
0.0HisTrp: 0.0 ± 0.0
0.505HisTyr: 0.505 ± 0.465
0.0HisXaa: 0.0 ± 0.0
Ile
2.778IleAla: 2.778 ± 1.437
2.021IleCys: 2.021 ± 0.843
7.578IleAsp: 7.578 ± 1.042
5.052IleGlu: 5.052 ± 0.955
2.778IlePhe: 2.778 ± 1.437
4.294IleGly: 4.294 ± 1.069
3.031IleHis: 3.031 ± 1.12
5.557IleIle: 5.557 ± 1.404
7.83IleLys: 7.83 ± 2.464
6.82IleLeu: 6.82 ± 1.968
2.273IleMet: 2.273 ± 0.573
4.041IleAsn: 4.041 ± 0.408
2.526IlePro: 2.526 ± 1.448
3.031IleGln: 3.031 ± 1.586
5.557IleArg: 5.557 ± 0.601
6.82IleSer: 6.82 ± 0.734
3.789IleThr: 3.789 ± 1.298
5.304IleVal: 5.304 ± 1.498
0.758IleTrp: 0.758 ± 0.48
2.021IleTyr: 2.021 ± 0.843
0.0IleXaa: 0.0 ± 0.0
Lys
3.031LysAla: 3.031 ± 1.494
1.768LysCys: 1.768 ± 0.934
4.799LysAsp: 4.799 ± 1.497
4.799LysGlu: 4.799 ± 0.862
2.526LysPhe: 2.526 ± 0.722
5.304LysGly: 5.304 ± 0.882
3.031LysHis: 3.031 ± 0.757
6.567LysIle: 6.567 ± 0.748
8.083LysLys: 8.083 ± 1.097
6.567LysLeu: 6.567 ± 1.589
2.778LysMet: 2.778 ± 0.702
3.789LysAsn: 3.789 ± 0.56
3.536LysPro: 3.536 ± 0.432
3.536LysGln: 3.536 ± 0.138
3.284LysArg: 3.284 ± 0.223
6.82LysSer: 6.82 ± 0.734
5.304LysThr: 5.304 ± 2.175
5.052LysVal: 5.052 ± 2.979
1.01LysTrp: 1.01 ± 0.64
1.768LysTyr: 1.768 ± 0.446
0.0LysXaa: 0.0 ± 0.0
Leu
4.799LeuAla: 4.799 ± 2.112
2.526LeuCys: 2.526 ± 0.734
3.536LeuAsp: 3.536 ± 1.509
7.072LeuGlu: 7.072 ± 1.801
5.304LeuPhe: 5.304 ± 0.309
5.052LeuGly: 5.052 ± 0.309
1.516LeuHis: 1.516 ± 0.635
6.315LeuIle: 6.315 ± 1.56
8.588LeuLys: 8.588 ± 2.09
6.062LeuLeu: 6.062 ± 0.725
3.031LeuMet: 3.031 ± 0.363
4.799LeuAsn: 4.799 ± 1.028
1.263LeuPro: 1.263 ± 0.717
3.284LeuGln: 3.284 ± 0.921
2.021LeuArg: 2.021 ± 0.575
6.062LeuSer: 6.062 ± 0.565
5.557LeuThr: 5.557 ± 1.381
3.789LeuVal: 3.789 ± 1.471
0.758LeuTrp: 0.758 ± 0.201
3.789LeuTyr: 3.789 ± 1.079
0.0LeuXaa: 0.0 ± 0.0
Met
3.284MetAla: 3.284 ± 0.328
0.758MetCys: 0.758 ± 0.201
2.526MetAsp: 2.526 ± 0.682
1.516MetGlu: 1.516 ± 1.652
1.01MetPhe: 1.01 ± 1.786
0.253MetGly: 0.253 ± 0.16
0.253MetHis: 0.253 ± 0.233
3.031MetIle: 3.031 ± 1.335
3.031MetLys: 3.031 ± 1.268
2.273MetLeu: 2.273 ± 0.602
0.505MetMet: 0.505 ± 0.32
1.263MetAsn: 1.263 ± 0.313
1.01MetPro: 1.01 ± 0.294
1.263MetGln: 1.263 ± 0.717
1.263MetArg: 1.263 ± 0.8
2.021MetSer: 2.021 ± 0.587
1.516MetThr: 1.516 ± 0.96
2.021MetVal: 2.021 ± 0.587
0.0MetTrp: 0.0 ± 0.0
1.768MetTyr: 1.768 ± 0.525
0.0MetXaa: 0.0 ± 0.0
Asn
2.021AsnAla: 2.021 ± 1.132
0.253AsnCys: 0.253 ± 0.16
3.284AsnAsp: 3.284 ± 0.921
2.273AsnGlu: 2.273 ± 0.602
2.526AsnPhe: 2.526 ± 0.682
0.758AsnGly: 0.758 ± 0.354
0.505AsnHis: 0.505 ± 0.32
3.789AsnIle: 3.789 ± 1.003
3.789AsnLys: 3.789 ± 0.197
6.315AsnLeu: 6.315 ± 1.939
1.263AsnMet: 1.263 ± 0.313
3.284AsnAsn: 3.284 ± 0.872
3.536AsnPro: 3.536 ± 1.509
2.021AsnGln: 2.021 ± 0.664
2.526AsnArg: 2.526 ± 0.682
2.778AsnSer: 2.778 ± 0.78
3.284AsnThr: 3.284 ± 1.208
3.284AsnVal: 3.284 ± 0.223
1.516AsnTrp: 1.516 ± 0.709
2.526AsnTyr: 2.526 ± 0.962
0.0AsnXaa: 0.0 ± 0.0
Pro
2.021ProAla: 2.021 ± 0.505
0.505ProCys: 0.505 ± 0.147
2.273ProAsp: 2.273 ± 0.448
2.526ProGlu: 2.526 ± 1.361
0.758ProPhe: 0.758 ± 0.48
1.768ProGly: 1.768 ± 0.754
0.0ProHis: 0.0 ± 0.0
2.778ProIle: 2.778 ± 0.728
1.768ProLys: 1.768 ± 0.446
3.789ProLeu: 3.789 ± 3.113
1.516ProMet: 1.516 ± 0.44
2.273ProAsn: 2.273 ± 0.586
0.253ProPro: 0.253 ± 0.16
0.505ProGln: 0.505 ± 0.147
0.758ProArg: 0.758 ± 0.48
2.021ProSer: 2.021 ± 0.575
2.021ProThr: 2.021 ± 0.587
1.01ProVal: 1.01 ± 0.581
0.505ProTrp: 0.505 ± 0.32
1.01ProTyr: 1.01 ± 0.332
0.0ProXaa: 0.0 ± 0.0
Gln
1.516GlnAla: 1.516 ± 0.722
1.263GlnCys: 1.263 ± 1.163
1.263GlnAsp: 1.263 ± 0.313
2.526GlnGlu: 2.526 ± 0.962
2.273GlnPhe: 2.273 ± 0.811
2.021GlnGly: 2.021 ± 0.587
0.758GlnHis: 0.758 ± 0.354
3.284GlnIle: 3.284 ± 1.427
5.557GlnLys: 5.557 ± 1.339
2.273GlnLeu: 2.273 ± 1.063
0.505GlnMet: 0.505 ± 0.465
2.021GlnAsn: 2.021 ± 0.664
0.505GlnPro: 0.505 ± 0.32
0.758GlnGln: 0.758 ± 0.201
2.778GlnArg: 2.778 ± 0.467
3.284GlnSer: 3.284 ± 1.427
2.778GlnThr: 2.778 ± 1.341
1.768GlnVal: 1.768 ± 0.446
0.253GlnTrp: 0.253 ± 0.16
1.263GlnTyr: 1.263 ± 0.8
0.0GlnXaa: 0.0 ± 0.0
Arg
2.273ArgAla: 2.273 ± 0.774
1.263ArgCys: 1.263 ± 0.811
3.536ArgAsp: 3.536 ± 0.138
2.273ArgGlu: 2.273 ± 0.811
2.778ArgPhe: 2.778 ± 0.855
1.01ArgGly: 1.01 ± 0.332
1.263ArgHis: 1.263 ± 0.724
3.789ArgIle: 3.789 ± 1.443
4.041ArgLys: 4.041 ± 0.408
4.799ArgLeu: 4.799 ± 2.01
1.516ArgMet: 1.516 ± 1.732
2.526ArgAsn: 2.526 ± 1.267
0.505ArgPro: 0.505 ± 0.917
1.768ArgGln: 1.768 ± 0.525
4.041ArgArg: 4.041 ± 1.364
1.516ArgSer: 1.516 ± 0.44
3.031ArgThr: 3.031 ± 0.881
2.526ArgVal: 2.526 ± 0.682
0.0ArgTrp: 0.0 ± 0.0
1.263ArgTyr: 1.263 ± 0.9
0.0ArgXaa: 0.0 ± 0.0
Ser
3.031SerAla: 3.031 ± 0.283
3.536SerCys: 3.536 ± 0.715
4.547SerAsp: 4.547 ± 2.165
3.284SerGlu: 3.284 ± 0.921
3.031SerPhe: 3.031 ± 0.757
3.031SerGly: 3.031 ± 1.393
0.758SerHis: 0.758 ± 0.201
6.82SerIle: 6.82 ± 0.828
7.072SerLys: 7.072 ± 1.883
4.799SerLeu: 4.799 ± 1.202
1.768SerMet: 1.768 ± 0.792
3.536SerAsn: 3.536 ± 0.715
2.021SerPro: 2.021 ± 0.95
3.284SerGln: 3.284 ± 1.142
3.284SerArg: 3.284 ± 1.142
5.557SerSer: 5.557 ± 0.644
3.284SerThr: 3.284 ± 0.328
5.304SerVal: 5.304 ± 1.196
0.505SerTrp: 0.505 ± 0.917
3.031SerTyr: 3.031 ± 1.271
0.0SerXaa: 0.0 ± 0.0
Thr
3.789ThrAla: 3.789 ± 0.939
2.021ThrCys: 2.021 ± 1.162
4.547ThrAsp: 4.547 ± 1.902
3.031ThrGlu: 3.031 ± 0.363
3.536ThrPhe: 3.536 ± 1.435
2.778ThrGly: 2.778 ± 1.515
0.253ThrHis: 0.253 ± 0.16
4.799ThrIle: 4.799 ± 0.383
4.547ThrLys: 4.547 ± 1.007
4.294ThrLeu: 4.294 ± 2.155
0.253ThrMet: 0.253 ± 0.16
2.778ThrAsn: 2.778 ± 0.855
2.526ThrPro: 2.526 ± 0.734
2.273ThrGln: 2.273 ± 0.905
2.273ThrArg: 2.273 ± 0.602
4.294ThrSer: 4.294 ± 1.246
2.273ThrThr: 2.273 ± 0.586
3.284ThrVal: 3.284 ± 0.872
1.01ThrTrp: 1.01 ± 0.332
2.021ThrTyr: 2.021 ± 0.575
0.0ThrXaa: 0.0 ± 0.0
Val
1.768ValAla: 1.768 ± 0.623
1.516ValCys: 1.516 ± 0.44
1.516ValAsp: 1.516 ± 1.042
4.547ValGlu: 4.547 ± 1.104
2.778ValPhe: 2.778 ± 0.918
2.778ValGly: 2.778 ± 0.728
1.01ValHis: 1.01 ± 0.581
4.294ValIle: 4.294 ± 1.004
4.294ValLys: 4.294 ± 2.187
6.062ValLeu: 6.062 ± 0.794
1.516ValMet: 1.516 ± 0.401
2.778ValAsn: 2.778 ± 1.437
2.021ValPro: 2.021 ± 1.162
3.031ValGln: 3.031 ± 2.085
1.516ValArg: 1.516 ± 0.747
5.304ValSer: 5.304 ± 0.309
5.052ValThr: 5.052 ± 0.309
3.031ValVal: 3.031 ± 0.551
0.253ValTrp: 0.253 ± 0.16
3.536ValTyr: 3.536 ± 1.245
0.0ValXaa: 0.0 ± 0.0
Trp
0.253TrpAla: 0.253 ± 0.16
0.505TrpCys: 0.505 ± 0.147
0.0TrpAsp: 0.0 ± 0.0
0.505TrpGlu: 0.505 ± 0.32
1.516TrpPhe: 1.516 ± 0.635
0.505TrpGly: 0.505 ± 0.465
0.253TrpHis: 0.253 ± 0.233
0.253TrpIle: 0.253 ± 0.233
0.0TrpLys: 0.0 ± 0.0
1.516TrpLeu: 1.516 ± 0.44
0.0TrpMet: 0.0 ± 0.0
0.253TrpAsn: 0.253 ± 0.16
0.0TrpPro: 0.0 ± 0.0
0.505TrpGln: 0.505 ± 0.893
0.505TrpArg: 0.505 ± 0.893
1.263TrpSer: 1.263 ± 0.8
0.253TrpThr: 0.253 ± 0.16
0.758TrpVal: 0.758 ± 0.201
0.0TrpTrp: 0.0 ± 0.0
0.758TrpTyr: 0.758 ± 0.354
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.768TyrAla: 1.768 ± 0.623
0.758TyrCys: 0.758 ± 0.354
2.778TyrAsp: 2.778 ± 0.728
2.526TyrGlu: 2.526 ± 0.422
2.778TyrPhe: 2.778 ± 0.855
1.768TyrGly: 1.768 ± 0.754
0.505TyrHis: 0.505 ± 0.147
3.536TyrIle: 3.536 ± 1.028
4.294TyrLys: 4.294 ± 1.069
1.768TyrLeu: 1.768 ± 0.446
2.273TyrMet: 2.273 ± 1.582
2.526TyrAsn: 2.526 ± 1.267
0.505TyrPro: 0.505 ± 0.147
1.263TyrGln: 1.263 ± 0.313
1.01TyrArg: 1.01 ± 0.332
2.273TyrSer: 2.273 ± 0.811
2.526TyrThr: 2.526 ± 0.962
2.778TyrVal: 2.778 ± 0.322
0.505TyrTrp: 0.505 ± 0.147
1.263TyrTyr: 1.263 ± 0.313
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3960 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski