Amino acid dipepetide frequency for Erythrura gouldiae polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.86AlaAla: 5.86 ± 1.585
1.598AlaCys: 1.598 ± 0.799
2.664AlaAsp: 2.664 ± 1.025
8.524AlaGlu: 8.524 ± 2.356
0.0AlaPhe: 0.0 ± 0.0
3.729AlaGly: 3.729 ± 0.868
1.598AlaHis: 1.598 ± 0.626
6.393AlaIle: 6.393 ± 2.09
2.664AlaLys: 2.664 ± 1.407
8.524AlaLeu: 8.524 ± 3.128
1.066AlaMet: 1.066 ± 0.456
3.197AlaAsn: 3.197 ± 0.642
5.328AlaPro: 5.328 ± 2.147
1.598AlaGln: 1.598 ± 0.742
4.262AlaArg: 4.262 ± 1.742
10.123AlaSer: 10.123 ± 1.535
4.795AlaThr: 4.795 ± 2.075
6.393AlaVal: 6.393 ± 1.74
0.533AlaTrp: 0.533 ± 0.465
3.729AlaTyr: 3.729 ± 2.241
0.0AlaXaa: 0.0 ± 0.0
Cys
1.598CysAla: 1.598 ± 0.684
1.066CysCys: 1.066 ± 0.733
0.0CysAsp: 0.0 ± 0.0
1.598CysGlu: 1.598 ± 0.799
0.533CysPhe: 0.533 ± 0.367
1.066CysGly: 1.066 ± 0.456
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
2.664CysLys: 2.664 ± 1.103
1.598CysLeu: 1.598 ± 0.799
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.598CysPro: 1.598 ± 0.505
0.533CysGln: 0.533 ± 0.367
0.533CysArg: 0.533 ± 0.367
0.533CysSer: 0.533 ± 0.367
2.131CysThr: 2.131 ± 1.224
0.533CysVal: 0.533 ± 0.367
0.0CysTrp: 0.0 ± 0.0
0.533CysTyr: 0.533 ± 0.62
0.0CysXaa: 0.0 ± 0.0
Asp
4.795AspAla: 4.795 ± 1.388
0.533AspCys: 0.533 ± 0.367
0.533AspAsp: 0.533 ± 0.367
2.131AspGlu: 2.131 ± 0.696
2.664AspPhe: 2.664 ± 0.71
2.131AspGly: 2.131 ± 1.142
1.066AspHis: 1.066 ± 0.733
2.131AspIle: 2.131 ± 0.884
2.664AspLys: 2.664 ± 1.261
4.795AspLeu: 4.795 ± 0.991
1.066AspMet: 1.066 ± 0.456
3.197AspAsn: 3.197 ± 0.978
2.664AspPro: 2.664 ± 0.903
1.066AspGln: 1.066 ± 0.733
0.533AspArg: 0.533 ± 0.465
3.197AspSer: 3.197 ± 1.31
4.262AspThr: 4.262 ± 1.721
3.729AspVal: 3.729 ± 1.572
1.066AspTrp: 1.066 ± 0.63
3.197AspTyr: 3.197 ± 0.969
0.0AspXaa: 0.0 ± 0.0
Glu
6.393GluAla: 6.393 ± 2.859
0.0GluCys: 0.0 ± 0.0
6.926GluAsp: 6.926 ± 1.697
11.721GluGlu: 11.721 ± 2.759
1.598GluPhe: 1.598 ± 0.799
6.926GluGly: 6.926 ± 1.577
1.066GluHis: 1.066 ± 0.839
1.066GluIle: 1.066 ± 0.456
4.262GluLys: 4.262 ± 1.374
8.524GluLeu: 8.524 ± 2.078
0.533GluMet: 0.533 ± 0.347
2.664GluAsn: 2.664 ± 0.803
3.197GluPro: 3.197 ± 0.866
1.598GluGln: 1.598 ± 0.799
4.262GluArg: 4.262 ± 1.891
3.729GluSer: 3.729 ± 1.137
2.664GluThr: 2.664 ± 0.909
4.795GluVal: 4.795 ± 1.093
1.066GluTrp: 1.066 ± 0.63
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.598PheAla: 1.598 ± 0.599
0.533PheCys: 0.533 ± 0.367
1.598PheAsp: 1.598 ± 0.684
3.729PheGlu: 3.729 ± 1.62
0.533PhePhe: 0.533 ± 0.465
1.598PheGly: 1.598 ± 1.075
0.0PheHis: 0.0 ± 0.0
0.533PheIle: 0.533 ± 0.367
1.066PheLys: 1.066 ± 0.733
3.197PheLeu: 3.197 ± 1.369
1.066PheMet: 1.066 ± 0.572
3.729PheAsn: 3.729 ± 1.129
2.664PhePro: 2.664 ± 1.287
1.598PheGln: 1.598 ± 0.889
1.598PheArg: 1.598 ± 0.684
4.262PheSer: 4.262 ± 1.277
2.131PheThr: 2.131 ± 0.999
0.533PheVal: 0.533 ± 0.367
0.0PheTrp: 0.0 ± 0.0
0.533PheTyr: 0.533 ± 0.367
0.0PheXaa: 0.0 ± 0.0
Gly
9.59GlyAla: 9.59 ± 2.944
0.533GlyCys: 0.533 ± 0.367
2.131GlyAsp: 2.131 ± 0.884
3.729GlyGlu: 3.729 ± 1.002
3.197GlyPhe: 3.197 ± 0.785
5.86GlyGly: 5.86 ± 1.165
1.598GlyHis: 1.598 ± 0.785
3.197GlyIle: 3.197 ± 1.306
1.598GlyLys: 1.598 ± 0.475
8.524GlyLeu: 8.524 ± 2.819
1.066GlyMet: 1.066 ± 0.733
1.598GlyAsn: 1.598 ± 0.684
8.524GlyPro: 8.524 ± 2.007
2.664GlyGln: 2.664 ± 0.93
3.729GlyArg: 3.729 ± 1.826
6.393GlySer: 6.393 ± 1.054
2.131GlyThr: 2.131 ± 0.707
4.262GlyVal: 4.262 ± 1.201
1.066GlyTrp: 1.066 ± 0.63
2.664GlyTyr: 2.664 ± 0.937
0.0GlyXaa: 0.0 ± 0.0
His
1.066HisAla: 1.066 ± 0.839
2.131HisCys: 2.131 ± 0.618
0.0HisAsp: 0.0 ± 0.0
1.066HisGlu: 1.066 ± 0.63
2.131HisPhe: 2.131 ± 0.558
1.598HisGly: 1.598 ± 0.931
0.533HisHis: 0.533 ± 0.367
0.533HisIle: 0.533 ± 0.367
1.066HisLys: 1.066 ± 0.733
1.598HisLeu: 1.598 ± 0.785
0.0HisMet: 0.0 ± 0.0
1.066HisAsn: 1.066 ± 0.605
2.664HisPro: 2.664 ± 0.59
1.598HisGln: 1.598 ± 0.684
1.598HisArg: 1.598 ± 0.931
1.598HisSer: 1.598 ± 0.887
0.0HisThr: 0.0 ± 0.0
0.533HisVal: 0.533 ± 0.367
0.0HisTrp: 0.0 ± 0.0
0.533HisTyr: 0.533 ± 0.367
0.0HisXaa: 0.0 ± 0.0
Ile
3.729IleAla: 3.729 ± 1.124
0.533IleCys: 0.533 ± 0.465
1.598IleAsp: 1.598 ± 1.1
2.131IleGlu: 2.131 ± 1.286
2.131IlePhe: 2.131 ± 0.813
4.262IleGly: 4.262 ± 2.198
0.0IleHis: 0.0 ± 0.0
2.664IleIle: 2.664 ± 0.909
2.131IleLys: 2.131 ± 0.696
5.328IleLeu: 5.328 ± 0.798
0.0IleMet: 0.0 ± 0.0
2.664IleAsn: 2.664 ± 0.882
2.131IlePro: 2.131 ± 1.286
2.131IleGln: 2.131 ± 0.618
1.598IleArg: 1.598 ± 0.505
2.664IleSer: 2.664 ± 0.936
3.729IleThr: 3.729 ± 0.594
2.131IleVal: 2.131 ± 0.419
1.066IleTrp: 1.066 ± 0.63
0.533IleTyr: 0.533 ± 0.367
0.0IleXaa: 0.0 ± 0.0
Lys
5.328LysAla: 5.328 ± 1.711
0.0LysCys: 0.0 ± 0.0
3.197LysAsp: 3.197 ± 1.368
1.598LysGlu: 1.598 ± 0.505
0.533LysPhe: 0.533 ± 0.367
3.729LysGly: 3.729 ± 1.258
1.598LysHis: 1.598 ± 1.1
2.664LysIle: 2.664 ± 1.207
3.197LysLys: 3.197 ± 1.748
5.86LysLeu: 5.86 ± 1.158
1.598LysMet: 1.598 ± 0.692
3.197LysAsn: 3.197 ± 1.241
1.598LysPro: 1.598 ± 0.505
3.197LysGln: 3.197 ± 1.509
6.393LysArg: 6.393 ± 1.194
2.131LysSer: 2.131 ± 0.912
5.328LysThr: 5.328 ± 0.809
1.598LysVal: 1.598 ± 0.505
0.533LysTrp: 0.533 ± 0.367
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.926LeuAla: 6.926 ± 1.787
1.598LeuCys: 1.598 ± 0.684
5.86LeuAsp: 5.86 ± 2.092
7.459LeuGlu: 7.459 ± 2.18
4.262LeuPhe: 4.262 ± 0.545
4.795LeuGly: 4.795 ± 1.661
2.664LeuHis: 2.664 ± 0.667
4.262LeuIle: 4.262 ± 1.018
3.197LeuLys: 3.197 ± 0.833
9.59LeuLeu: 9.59 ± 1.744
3.197LeuMet: 3.197 ± 1.209
6.393LeuAsn: 6.393 ± 1.949
10.123LeuPro: 10.123 ± 2.447
4.795LeuGln: 4.795 ± 1.527
5.86LeuArg: 5.86 ± 0.584
2.664LeuSer: 2.664 ± 0.615
6.393LeuThr: 6.393 ± 2.177
4.262LeuVal: 4.262 ± 1.313
0.0LeuTrp: 0.0 ± 0.0
6.393LeuTyr: 6.393 ± 1.117
0.0LeuXaa: 0.0 ± 0.0
Met
2.664MetAla: 2.664 ± 0.728
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
2.664MetGlu: 2.664 ± 0.59
0.0MetPhe: 0.0 ± 0.0
1.598MetGly: 1.598 ± 0.475
0.533MetHis: 0.533 ± 0.465
0.0MetIle: 0.0 ± 0.0
2.131MetLys: 2.131 ± 0.696
2.664MetLeu: 2.664 ± 0.882
0.533MetMet: 0.533 ± 0.367
0.533MetAsn: 0.533 ± 0.367
0.533MetPro: 0.533 ± 0.367
1.066MetGln: 1.066 ± 0.743
0.0MetArg: 0.0 ± 0.0
1.066MetSer: 1.066 ± 0.608
1.066MetThr: 1.066 ± 0.456
0.533MetVal: 0.533 ± 0.367
0.533MetTrp: 0.533 ± 0.465
1.066MetTyr: 1.066 ± 0.608
0.0MetXaa: 0.0 ± 0.0
Asn
3.197AsnAla: 3.197 ± 1.225
0.533AsnCys: 0.533 ± 0.367
0.533AsnAsp: 0.533 ± 0.465
3.197AsnGlu: 3.197 ± 1.006
0.533AsnPhe: 0.533 ± 0.367
1.598AsnGly: 1.598 ± 0.684
1.066AsnHis: 1.066 ± 0.456
3.197AsnIle: 3.197 ± 0.642
1.066AsnLys: 1.066 ± 0.456
5.86AsnLeu: 5.86 ± 0.877
1.066AsnMet: 1.066 ± 0.573
1.066AsnAsn: 1.066 ± 0.605
7.459AsnPro: 7.459 ± 1.793
2.131AsnGln: 2.131 ± 0.939
2.131AsnArg: 2.131 ± 0.816
1.598AsnSer: 1.598 ± 0.799
2.131AsnThr: 2.131 ± 0.707
0.533AsnVal: 0.533 ± 0.465
0.533AsnTrp: 0.533 ± 0.465
0.533AsnTyr: 0.533 ± 0.367
0.0AsnXaa: 0.0 ± 0.0
Pro
3.729ProAla: 3.729 ± 1.757
1.066ProCys: 1.066 ± 0.605
6.393ProAsp: 6.393 ± 1.162
6.393ProGlu: 6.393 ± 1.972
0.533ProPhe: 0.533 ± 0.62
7.459ProGly: 7.459 ± 1.132
0.533ProHis: 0.533 ± 0.62
4.262ProIle: 4.262 ± 1.332
5.328ProLys: 5.328 ± 0.891
6.926ProLeu: 6.926 ± 1.894
0.533ProMet: 0.533 ± 0.465
1.066ProAsn: 1.066 ± 1.239
5.86ProPro: 5.86 ± 2.347
2.664ProGln: 2.664 ± 0.845
4.795ProArg: 4.795 ± 2.066
3.197ProSer: 3.197 ± 0.785
4.262ProThr: 4.262 ± 1.332
3.729ProVal: 3.729 ± 1.657
0.533ProTrp: 0.533 ± 0.609
3.729ProTyr: 3.729 ± 1.064
0.0ProXaa: 0.0 ± 0.0
Gln
3.197GlnAla: 3.197 ± 1.132
0.533GlnCys: 0.533 ± 0.609
1.598GlnAsp: 1.598 ± 0.799
0.0GlnGlu: 0.0 ± 0.0
1.066GlnPhe: 1.066 ± 0.733
3.729GlnGly: 3.729 ± 1.175
1.598GlnHis: 1.598 ± 0.889
1.066GlnIle: 1.066 ± 0.733
2.131GlnLys: 2.131 ± 1.066
5.86GlnLeu: 5.86 ± 0.94
0.533GlnMet: 0.533 ± 0.367
3.729GlnAsn: 3.729 ± 1.007
1.598GlnPro: 1.598 ± 0.599
0.0GlnGln: 0.0 ± 0.0
4.262GlnArg: 4.262 ± 1.432
0.533GlnSer: 0.533 ± 0.367
4.262GlnThr: 4.262 ± 2.371
1.066GlnVal: 1.066 ± 0.931
1.066GlnTrp: 1.066 ± 1.239
0.533GlnTyr: 0.533 ± 0.465
0.0GlnXaa: 0.0 ± 0.0
Arg
6.926ArgAla: 6.926 ± 2.224
0.533ArgCys: 0.533 ± 0.367
1.066ArgAsp: 1.066 ± 0.608
4.795ArgGlu: 4.795 ± 1.972
3.197ArgPhe: 3.197 ± 0.525
3.729ArgGly: 3.729 ± 0.613
2.131ArgHis: 2.131 ± 1.261
1.598ArgIle: 1.598 ± 0.684
5.328ArgLys: 5.328 ± 0.957
3.729ArgLeu: 3.729 ± 0.89
2.131ArgMet: 2.131 ± 0.722
1.066ArgAsn: 1.066 ± 0.931
3.729ArgPro: 3.729 ± 1.656
2.131ArgGln: 2.131 ± 0.813
5.328ArgArg: 5.328 ± 2.164
8.524ArgSer: 8.524 ± 4.083
1.598ArgThr: 1.598 ± 0.845
1.598ArgVal: 1.598 ± 0.845
1.066ArgTrp: 1.066 ± 0.733
2.664ArgTyr: 2.664 ± 0.728
0.0ArgXaa: 0.0 ± 0.0
Ser
7.459SerAla: 7.459 ± 1.626
3.197SerCys: 3.197 ± 1.024
2.131SerAsp: 2.131 ± 0.618
2.664SerGlu: 2.664 ± 0.59
5.328SerPhe: 5.328 ± 2.451
3.197SerGly: 3.197 ± 1.044
0.533SerHis: 0.533 ± 0.609
0.533SerIle: 0.533 ± 0.367
5.328SerLys: 5.328 ± 2.389
5.328SerLeu: 5.328 ± 0.98
1.066SerMet: 1.066 ± 0.456
1.598SerAsn: 1.598 ± 0.931
5.328SerPro: 5.328 ± 2.177
3.729SerGln: 3.729 ± 0.56
3.729SerArg: 3.729 ± 2.053
4.795SerSer: 4.795 ± 1.385
4.262SerThr: 4.262 ± 1.113
3.197SerVal: 3.197 ± 0.444
0.533SerTrp: 0.533 ± 0.465
1.598SerTyr: 1.598 ± 0.679
0.0SerXaa: 0.0 ± 0.0
Thr
3.197ThrAla: 3.197 ± 1.731
0.533ThrCys: 0.533 ± 0.62
2.131ThrAsp: 2.131 ± 1.224
3.729ThrGlu: 3.729 ± 0.709
0.533ThrPhe: 0.533 ± 0.62
11.188ThrGly: 11.188 ± 1.449
2.131ThrHis: 2.131 ± 0.756
3.729ThrIle: 3.729 ± 1.007
1.598ThrLys: 1.598 ± 1.238
4.795ThrLeu: 4.795 ± 1.625
1.066ThrMet: 1.066 ± 0.733
0.533ThrAsn: 0.533 ± 0.367
3.729ThrPro: 3.729 ± 0.657
0.533ThrGln: 0.533 ± 0.448
4.262ThrArg: 4.262 ± 0.926
2.664ThrSer: 2.664 ± 1.374
5.86ThrThr: 5.86 ± 0.848
5.328ThrVal: 5.328 ± 1.555
1.066ThrTrp: 1.066 ± 0.63
2.131ThrTyr: 2.131 ± 0.884
0.0ThrXaa: 0.0 ± 0.0
Val
4.262ValAla: 4.262 ± 1.58
0.533ValCys: 0.533 ± 0.367
4.795ValAsp: 4.795 ± 0.823
3.729ValGlu: 3.729 ± 1.007
1.066ValPhe: 1.066 ± 0.733
2.131ValGly: 2.131 ± 1.372
0.533ValHis: 0.533 ± 0.367
2.131ValIle: 2.131 ± 0.618
3.197ValLys: 3.197 ± 1.368
4.795ValLeu: 4.795 ± 1.671
0.0ValMet: 0.0 ± 0.0
1.066ValAsn: 1.066 ± 0.733
1.598ValPro: 1.598 ± 0.599
3.197ValGln: 3.197 ± 0.642
3.197ValArg: 3.197 ± 0.642
4.262ValSer: 4.262 ± 1.401
3.197ValThr: 3.197 ± 1.691
2.131ValVal: 2.131 ± 0.419
1.066ValTrp: 1.066 ± 0.63
1.598ValTyr: 1.598 ± 0.679
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
2.131TrpAsp: 2.131 ± 0.979
1.066TrpGlu: 1.066 ± 0.456
0.0TrpPhe: 0.0 ± 0.0
1.066TrpGly: 1.066 ± 0.63
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.598TrpLys: 1.598 ± 0.931
0.533TrpLeu: 0.533 ± 0.367
1.066TrpMet: 1.066 ± 0.63
1.066TrpAsn: 1.066 ± 0.839
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.598TrpArg: 1.598 ± 0.505
1.066TrpSer: 1.066 ± 0.931
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.066TrpTyr: 1.066 ± 0.63
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.066TyrAla: 1.066 ± 0.733
1.066TyrCys: 1.066 ± 0.608
2.131TyrAsp: 2.131 ± 0.716
1.066TyrGlu: 1.066 ± 0.605
2.664TyrPhe: 2.664 ± 1.279
2.664TyrGly: 2.664 ± 0.59
2.131TyrHis: 2.131 ± 0.765
3.197TyrIle: 3.197 ± 1.891
1.066TyrLys: 1.066 ± 0.733
2.664TyrLeu: 2.664 ± 1.103
1.066TyrMet: 1.066 ± 0.63
0.533TyrAsn: 0.533 ± 0.367
2.664TyrPro: 2.664 ± 1.374
2.131TyrGln: 2.131 ± 0.877
3.729TyrArg: 3.729 ± 1.572
0.533TyrSer: 0.533 ± 0.465
0.533TyrThr: 0.533 ± 0.465
1.598TyrVal: 1.598 ± 0.889
0.533TyrTrp: 0.533 ± 0.465
2.131TyrTyr: 2.131 ± 1.261
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1878 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski