Amino acid dipepetide frequency for Akodon montensis polyomavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.345AlaAla: 11.345 ± 5.053
0.54AlaCys: 0.54 ± 0.508
2.701AlaAsp: 2.701 ± 1.191
2.161AlaGlu: 2.161 ± 1.515
0.54AlaPhe: 0.54 ± 0.508
7.023AlaGly: 7.023 ± 4.874
1.08AlaHis: 1.08 ± 0.762
7.023AlaIle: 7.023 ± 2.708
2.161AlaLys: 2.161 ± 1.591
7.563AlaLeu: 7.563 ± 3.153
0.54AlaMet: 0.54 ± 0.508
0.54AlaAsn: 0.54 ± 0.508
0.54AlaPro: 0.54 ± 0.508
3.782AlaGln: 3.782 ± 0.523
2.701AlaArg: 2.701 ± 1.316
2.701AlaSer: 2.701 ± 0.79
3.782AlaThr: 3.782 ± 1.197
3.241AlaVal: 3.241 ± 0.812
2.701AlaTrp: 2.701 ± 1.261
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.08CysAla: 1.08 ± 0.493
0.54CysCys: 0.54 ± 0.534
0.0CysAsp: 0.0 ± 0.0
0.54CysGlu: 0.54 ± 0.398
2.701CysPhe: 2.701 ± 1.398
0.54CysGly: 0.54 ± 0.508
0.0CysHis: 0.0 ± 0.0
2.701CysIle: 2.701 ± 1.017
3.241CysLys: 3.241 ± 1.541
1.621CysLeu: 1.621 ± 0.77
0.54CysMet: 0.54 ± 0.398
1.621CysAsn: 1.621 ± 0.663
0.0CysPro: 0.0 ± 0.0
1.621CysGln: 1.621 ± 0.738
1.621CysArg: 1.621 ± 0.663
0.54CysSer: 0.54 ± 0.398
1.621CysThr: 1.621 ± 0.77
1.621CysVal: 1.621 ± 0.987
1.621CysTrp: 1.621 ± 0.77
3.241CysTyr: 3.241 ± 1.326
0.0CysXaa: 0.0 ± 0.0
Asp
1.08AspAla: 1.08 ± 0.493
2.161AspCys: 2.161 ± 1.496
3.782AspAsp: 3.782 ± 2.784
4.322AspGlu: 4.322 ± 1.697
2.701AspPhe: 2.701 ± 1.989
6.483AspGly: 6.483 ± 3.416
0.0AspHis: 0.0 ± 0.0
4.322AspIle: 4.322 ± 0.876
3.241AspLys: 3.241 ± 1.852
2.161AspLeu: 2.161 ± 0.681
0.0AspMet: 0.0 ± 0.0
1.08AspAsn: 1.08 ± 0.493
2.701AspPro: 2.701 ± 0.557
2.161AspGln: 2.161 ± 0.631
1.621AspArg: 1.621 ± 0.77
1.621AspSer: 1.621 ± 0.738
1.621AspThr: 1.621 ± 0.604
2.701AspVal: 2.701 ± 0.36
1.08AspTrp: 1.08 ± 0.762
1.621AspTyr: 1.621 ± 1.193
0.0AspXaa: 0.0 ± 0.0
Glu
6.483GluAla: 6.483 ± 2.03
2.701GluCys: 2.701 ± 1.191
3.241GluAsp: 3.241 ± 1.477
5.402GluGlu: 5.402 ± 2.944
1.08GluPhe: 1.08 ± 0.796
1.08GluGly: 1.08 ± 1.025
1.08GluHis: 1.08 ± 0.493
2.161GluIle: 2.161 ± 0.484
1.621GluLys: 1.621 ± 1.193
8.104GluLeu: 8.104 ± 1.266
2.161GluMet: 2.161 ± 0.631
5.402GluAsn: 5.402 ± 2.035
3.241GluPro: 3.241 ± 1.83
4.862GluGln: 4.862 ± 0.776
3.241GluArg: 3.241 ± 0.881
2.161GluSer: 2.161 ± 0.986
3.241GluThr: 3.241 ± 1.54
4.862GluVal: 4.862 ± 2.303
0.0GluTrp: 0.0 ± 0.0
1.621GluTyr: 1.621 ± 0.663
0.0GluXaa: 0.0 ± 0.0
Phe
1.08PheAla: 1.08 ± 0.796
0.0PheCys: 0.0 ± 0.0
2.161PheAsp: 2.161 ± 0.45
3.782PheGlu: 3.782 ± 0.936
0.54PhePhe: 0.54 ± 0.508
3.241PheGly: 3.241 ± 1.326
1.621PheHis: 1.621 ± 0.882
0.54PheIle: 0.54 ± 0.398
2.161PheLys: 2.161 ± 1.591
2.161PheLeu: 2.161 ± 0.635
0.54PheMet: 0.54 ± 0.398
0.54PheAsn: 0.54 ± 0.398
1.621PhePro: 1.621 ± 0.738
0.54PheGln: 0.54 ± 0.508
2.161PheArg: 2.161 ± 1.4
1.08PheSer: 1.08 ± 0.796
2.161PheThr: 2.161 ± 1.105
1.621PheVal: 1.621 ± 1.193
0.54PheTrp: 0.54 ± 0.508
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.943GlyAla: 5.943 ± 3.765
1.08GlyCys: 1.08 ± 0.796
3.241GlyAsp: 3.241 ± 0.881
3.782GlyGlu: 3.782 ± 2.324
1.621GlyPhe: 1.621 ± 0.604
12.966GlyGly: 12.966 ± 4.543
1.621GlyHis: 1.621 ± 0.406
5.402GlyIle: 5.402 ± 0.924
3.241GlyLys: 3.241 ± 1.344
8.644GlyLeu: 8.644 ± 1.682
0.54GlyMet: 0.54 ± 0.457
2.161GlyAsn: 2.161 ± 1.055
3.782GlyPro: 3.782 ± 0.574
4.862GlyGln: 4.862 ± 1.863
6.483GlyArg: 6.483 ± 2.211
4.322GlySer: 4.322 ± 0.478
4.322GlyThr: 4.322 ± 2.232
5.943GlyVal: 5.943 ± 2.043
0.0GlyTrp: 0.0 ± 0.0
1.621GlyTyr: 1.621 ± 0.77
0.0GlyXaa: 0.0 ± 0.0
His
1.08HisAla: 1.08 ± 0.53
1.621HisCys: 1.621 ± 0.987
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.54HisPhe: 0.54 ± 0.508
0.54HisGly: 0.54 ± 0.508
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.621HisLys: 1.621 ± 0.77
1.08HisLeu: 1.08 ± 0.493
2.701HisMet: 2.701 ± 1.261
2.161HisAsn: 2.161 ± 0.995
1.08HisPro: 1.08 ± 0.53
0.54HisGln: 0.54 ± 0.398
3.241HisArg: 3.241 ± 1.207
3.241HisSer: 3.241 ± 1.54
2.701HisThr: 2.701 ± 0.79
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
2.701HisTyr: 2.701 ± 0.578
0.0HisXaa: 0.0 ± 0.0
Ile
3.782IleAla: 3.782 ± 2.078
1.08IleCys: 1.08 ± 0.796
2.701IleAsp: 2.701 ± 0.938
1.08IleGlu: 1.08 ± 0.762
2.701IlePhe: 2.701 ± 1.472
0.54IleGly: 0.54 ± 0.508
0.0IleHis: 0.0 ± 0.0
3.782IleIle: 3.782 ± 2.065
2.161IleLys: 2.161 ± 0.45
3.782IleLeu: 3.782 ± 1.702
1.621IleMet: 1.621 ± 0.738
2.161IleAsn: 2.161 ± 1.079
5.402IlePro: 5.402 ± 1.993
1.621IleGln: 1.621 ± 0.406
1.621IleArg: 1.621 ± 0.77
3.241IleSer: 3.241 ± 1.151
4.322IleThr: 4.322 ± 1.705
6.483IleVal: 6.483 ± 1.625
0.54IleTrp: 0.54 ± 0.534
1.621IleTyr: 1.621 ± 1.193
0.0IleXaa: 0.0 ± 0.0
Lys
3.241LysAla: 3.241 ± 0.61
2.161LysCys: 2.161 ± 1.061
1.08LysAsp: 1.08 ± 0.53
5.943LysGlu: 5.943 ± 2.351
1.621LysPhe: 1.621 ± 1.193
5.402LysGly: 5.402 ± 1.001
2.161LysHis: 2.161 ± 1.591
2.161LysIle: 2.161 ± 1.079
10.265LysLys: 10.265 ± 3.149
3.241LysLeu: 3.241 ± 1.852
1.621LysMet: 1.621 ± 0.52
4.322LysAsn: 4.322 ± 0.486
1.621LysPro: 1.621 ± 1.244
1.08LysGln: 1.08 ± 0.493
5.402LysArg: 5.402 ± 1.515
3.241LysSer: 3.241 ± 1.207
2.161LysThr: 2.161 ± 0.681
2.701LysVal: 2.701 ± 1.017
0.0LysTrp: 0.0 ± 0.0
2.701LysTyr: 2.701 ± 1.261
0.0LysXaa: 0.0 ± 0.0
Leu
3.241LeuAla: 3.241 ± 1.09
4.322LeuCys: 4.322 ± 2.074
4.862LeuAsp: 4.862 ± 2.028
3.782LeuGlu: 3.782 ± 1.047
2.161LeuPhe: 2.161 ± 0.681
9.184LeuGly: 9.184 ± 2.591
4.862LeuHis: 4.862 ± 1.533
3.782LeuIle: 3.782 ± 1.047
1.08LeuLys: 1.08 ± 0.762
10.265LeuLeu: 10.265 ± 0.943
7.023LeuMet: 7.023 ± 2.572
3.241LeuAsn: 3.241 ± 1.23
4.862LeuPro: 4.862 ± 1.787
5.943LeuGln: 5.943 ± 2.556
3.782LeuArg: 3.782 ± 0.936
8.104LeuSer: 8.104 ± 1.848
2.161LeuThr: 2.161 ± 0.986
2.161LeuVal: 2.161 ± 0.635
0.54LeuTrp: 0.54 ± 0.534
4.862LeuTyr: 4.862 ± 0.679
0.0LeuXaa: 0.0 ± 0.0
Met
2.701MetAla: 2.701 ± 1.446
1.08MetCys: 1.08 ± 0.493
2.701MetAsp: 2.701 ± 0.578
3.782MetGlu: 3.782 ± 1.371
0.54MetPhe: 0.54 ± 0.398
1.621MetGly: 1.621 ± 0.568
0.54MetHis: 0.54 ± 0.398
2.161MetIle: 2.161 ± 0.45
2.701MetLys: 2.701 ± 1.261
1.08MetLeu: 1.08 ± 0.53
0.0MetMet: 0.0 ± 0.0
1.621MetAsn: 1.621 ± 0.738
0.54MetPro: 0.54 ± 0.398
1.621MetGln: 1.621 ± 0.77
0.54MetArg: 0.54 ± 0.508
1.08MetSer: 1.08 ± 0.53
0.54MetThr: 0.54 ± 0.508
0.54MetVal: 0.54 ± 0.398
1.08MetTrp: 1.08 ± 0.796
1.621MetTyr: 1.621 ± 0.918
0.0MetXaa: 0.0 ± 0.0
Asn
2.701AsnAla: 2.701 ± 0.79
1.08AsnCys: 1.08 ± 0.493
1.621AsnAsp: 1.621 ± 0.738
2.701AsnGlu: 2.701 ± 1.039
0.0AsnPhe: 0.0 ± 0.0
2.161AsnGly: 2.161 ± 0.81
1.08AsnHis: 1.08 ± 0.493
5.943AsnIle: 5.943 ± 2.494
0.54AsnLys: 0.54 ± 0.398
4.862AsnLeu: 4.862 ± 0.679
1.08AsnMet: 1.08 ± 0.493
2.701AsnAsn: 2.701 ± 0.529
4.322AsnPro: 4.322 ± 1.048
1.08AsnGln: 1.08 ± 0.796
2.161AsnArg: 2.161 ± 0.761
4.322AsnSer: 4.322 ± 1.623
1.08AsnThr: 1.08 ± 0.748
5.402AsnVal: 5.402 ± 2.187
0.0AsnTrp: 0.0 ± 0.0
2.701AsnTyr: 2.701 ± 1.017
0.0AsnXaa: 0.0 ± 0.0
Pro
1.621ProAla: 1.621 ± 0.604
0.0ProCys: 0.0 ± 0.0
4.322ProAsp: 4.322 ± 1.43
4.322ProGlu: 4.322 ± 0.81
0.54ProPhe: 0.54 ± 0.398
5.943ProGly: 5.943 ± 1.377
1.08ProHis: 1.08 ± 0.762
1.08ProIle: 1.08 ± 0.493
4.322ProLys: 4.322 ± 1.623
6.483ProLeu: 6.483 ± 2.371
1.621ProMet: 1.621 ± 0.77
3.782ProAsn: 3.782 ± 1.159
6.483ProPro: 6.483 ± 2.341
0.54ProGln: 0.54 ± 0.398
4.862ProArg: 4.862 ± 1.123
3.782ProSer: 3.782 ± 1.139
0.54ProThr: 0.54 ± 0.398
3.782ProVal: 3.782 ± 1.004
1.08ProTrp: 1.08 ± 0.762
1.08ProTyr: 1.08 ± 0.493
0.0ProXaa: 0.0 ± 0.0
Gln
2.701GlnAla: 2.701 ± 0.788
1.621GlnCys: 1.621 ± 0.663
2.701GlnAsp: 2.701 ± 1.316
4.862GlnGlu: 4.862 ± 1.301
1.08GlnPhe: 1.08 ± 0.796
5.943GlnGly: 5.943 ± 1.652
0.54GlnHis: 0.54 ± 0.398
2.161GlnIle: 2.161 ± 0.681
3.241GlnLys: 3.241 ± 0.557
0.54GlnLeu: 0.54 ± 0.508
2.161GlnMet: 2.161 ± 0.45
0.54GlnAsn: 0.54 ± 0.398
3.241GlnPro: 3.241 ± 0.767
2.701GlnGln: 2.701 ± 1.384
2.161GlnArg: 2.161 ± 1.058
3.782GlnSer: 3.782 ± 0.574
2.161GlnThr: 2.161 ± 1.116
5.943GlnVal: 5.943 ± 0.789
1.08GlnTrp: 1.08 ± 0.762
1.08GlnTyr: 1.08 ± 1.015
0.0GlnXaa: 0.0 ± 0.0
Arg
3.782ArgAla: 3.782 ± 2.065
1.08ArgCys: 1.08 ± 1.015
3.241ArgAsp: 3.241 ± 0.767
2.161ArgGlu: 2.161 ± 0.681
1.08ArgPhe: 1.08 ± 0.493
2.701ArgGly: 2.701 ± 0.79
2.161ArgHis: 2.161 ± 0.631
2.701ArgIle: 2.701 ± 0.746
6.483ArgLys: 6.483 ± 0.544
8.104ArgLeu: 8.104 ± 1.017
1.621ArgMet: 1.621 ± 0.663
0.0ArgAsn: 0.0 ± 0.0
2.161ArgPro: 2.161 ± 1.079
2.701ArgGln: 2.701 ± 1.446
4.862ArgArg: 4.862 ± 1.881
0.54ArgSer: 0.54 ± 0.398
4.862ArgThr: 4.862 ± 1.818
2.701ArgVal: 2.701 ± 1.384
0.0ArgTrp: 0.0 ± 0.0
2.161ArgTyr: 2.161 ± 0.761
0.0ArgXaa: 0.0 ± 0.0
Ser
3.782SerAla: 3.782 ± 1.848
2.161SerCys: 2.161 ± 1.061
2.701SerAsp: 2.701 ± 1.316
3.241SerGlu: 3.241 ± 1.477
1.08SerPhe: 1.08 ± 0.796
6.483SerGly: 6.483 ± 1.143
1.08SerHis: 1.08 ± 0.493
0.0SerIle: 0.0 ± 0.0
1.08SerLys: 1.08 ± 0.493
1.621SerLeu: 1.621 ± 0.663
0.0SerMet: 0.0 ± 0.0
5.943SerAsn: 5.943 ± 1.517
2.161SerPro: 2.161 ± 0.986
6.483SerGln: 6.483 ± 2.349
3.782SerArg: 3.782 ± 1.202
3.782SerSer: 3.782 ± 1.901
4.322SerThr: 4.322 ± 1.9
3.241SerVal: 3.241 ± 0.932
0.54SerTrp: 0.54 ± 0.398
2.701SerTyr: 2.701 ± 1.894
0.0SerXaa: 0.0 ± 0.0
Thr
1.621ThrAla: 1.621 ± 0.918
1.621ThrCys: 1.621 ± 0.663
0.54ThrAsp: 0.54 ± 0.508
1.621ThrGlu: 1.621 ± 0.406
2.161ThrPhe: 2.161 ± 0.45
4.322ThrGly: 4.322 ± 1.508
0.0ThrHis: 0.0 ± 0.0
1.08ThrIle: 1.08 ± 0.493
4.862ThrLys: 4.862 ± 1.811
5.402ThrLeu: 5.402 ± 1.35
2.701ThrMet: 2.701 ± 0.79
2.161ThrAsn: 2.161 ± 0.681
7.023ThrPro: 7.023 ± 2.268
3.782ThrGln: 3.782 ± 1.004
1.621ThrArg: 1.621 ± 0.882
1.08ThrSer: 1.08 ± 0.762
3.782ThrThr: 3.782 ± 0.936
4.862ThrVal: 4.862 ± 1.661
0.0ThrTrp: 0.0 ± 0.0
1.08ThrTyr: 1.08 ± 0.493
0.0ThrXaa: 0.0 ± 0.0
Val
4.322ValAla: 4.322 ± 1.752
0.54ValCys: 0.54 ± 0.398
3.241ValAsp: 3.241 ± 1.207
7.023ValGlu: 7.023 ± 2.867
3.241ValPhe: 3.241 ± 0.859
3.241ValGly: 3.241 ± 1.421
3.241ValHis: 3.241 ± 0.881
1.621ValIle: 1.621 ± 0.663
2.701ValLys: 2.701 ± 1.448
5.943ValLeu: 5.943 ± 2.341
1.08ValMet: 1.08 ± 0.493
4.322ValAsn: 4.322 ± 1.044
5.402ValPro: 5.402 ± 0.677
3.782ValGln: 3.782 ± 2.012
2.161ValArg: 2.161 ± 1.525
4.862ValSer: 4.862 ± 0.952
3.782ValThr: 3.782 ± 0.965
4.862ValVal: 4.862 ± 0.913
0.54ValTrp: 0.54 ± 0.508
1.08ValTyr: 1.08 ± 0.493
0.0ValXaa: 0.0 ± 0.0
Trp
0.54TrpAla: 0.54 ± 0.508
0.0TrpCys: 0.0 ± 0.0
0.54TrpAsp: 0.54 ± 0.534
1.621TrpGlu: 1.621 ± 0.738
0.54TrpPhe: 0.54 ± 0.534
1.08TrpGly: 1.08 ± 0.53
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.621TrpLys: 1.621 ± 0.77
2.701TrpLeu: 2.701 ± 0.578
0.0TrpMet: 0.0 ± 0.0
1.621TrpAsn: 1.621 ± 0.663
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
2.701TrpVal: 2.701 ± 1.446
0.54TrpTrp: 0.54 ± 0.398
0.54TrpTyr: 0.54 ± 0.398
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.54TyrAla: 0.54 ± 0.398
1.621TyrCys: 1.621 ± 0.987
1.08TyrAsp: 1.08 ± 0.493
1.621TyrGlu: 1.621 ± 0.77
1.621TyrPhe: 1.621 ± 1.037
1.621TyrGly: 1.621 ± 1.193
2.701TyrHis: 2.701 ± 0.578
1.621TyrIle: 1.621 ± 0.738
3.782TyrLys: 3.782 ± 1.578
4.862TyrLeu: 4.862 ± 2.028
0.0TyrMet: 0.0 ± 0.0
1.621TyrAsn: 1.621 ± 0.77
1.08TyrPro: 1.08 ± 0.762
0.54TyrGln: 0.54 ± 0.398
1.08TyrArg: 1.08 ± 0.493
2.701TyrSer: 2.701 ± 0.746
2.161TyrThr: 2.161 ± 0.986
1.621TyrVal: 1.621 ± 0.604
2.161TyrTrp: 2.161 ± 1.055
2.161TyrTyr: 2.161 ± 0.681
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1852 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski