Amino acid dipepetide frequency for Molossus molossus papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.131AlaAla: 5.131 ± 1.393
0.466AlaCys: 0.466 ± 0.405
3.731AlaAsp: 3.731 ± 1.651
6.063AlaGlu: 6.063 ± 1.246
2.332AlaPhe: 2.332 ± 1.2
3.265AlaGly: 3.265 ± 1.367
0.0AlaHis: 0.0 ± 0.0
1.399AlaIle: 1.399 ± 0.571
4.664AlaLys: 4.664 ± 1.902
4.198AlaLeu: 4.198 ± 2.004
0.466AlaMet: 0.466 ± 0.545
1.399AlaAsn: 1.399 ± 0.641
4.198AlaPro: 4.198 ± 1.289
3.265AlaGln: 3.265 ± 0.882
4.664AlaArg: 4.664 ± 2.117
3.731AlaSer: 3.731 ± 1.191
2.799AlaThr: 2.799 ± 1.175
4.198AlaVal: 4.198 ± 1.542
0.933AlaTrp: 0.933 ± 0.629
3.731AlaTyr: 3.731 ± 1.272
0.0AlaXaa: 0.0 ± 0.0
Cys
1.399CysAla: 1.399 ± 1.897
0.933CysCys: 0.933 ± 0.625
1.399CysAsp: 1.399 ± 0.641
0.466CysGlu: 0.466 ± 0.545
0.933CysPhe: 0.933 ± 1.091
0.466CysGly: 0.466 ± 0.405
0.466CysHis: 0.466 ± 0.632
1.399CysIle: 1.399 ± 0.674
1.866CysLys: 1.866 ± 0.451
2.799CysLeu: 2.799 ± 1.096
0.0CysMet: 0.0 ± 0.0
0.466CysAsn: 0.466 ± 0.366
2.799CysPro: 2.799 ± 1.166
1.399CysGln: 1.399 ± 1.052
1.866CysArg: 1.866 ± 0.82
2.799CysSer: 2.799 ± 1.576
1.866CysThr: 1.866 ± 1.465
0.466CysVal: 0.466 ± 0.632
0.933CysTrp: 0.933 ± 0.392
0.933CysTyr: 0.933 ± 0.392
0.0CysXaa: 0.0 ± 0.0
Asp
6.063AspAla: 6.063 ± 1.301
2.332AspCys: 2.332 ± 0.997
3.265AspAsp: 3.265 ± 1.1
3.731AspGlu: 3.731 ± 1.09
3.265AspPhe: 3.265 ± 1.868
3.731AspGly: 3.731 ± 1.034
0.933AspHis: 0.933 ± 0.787
3.265AspIle: 3.265 ± 0.754
1.866AspLys: 1.866 ± 1.083
6.53AspLeu: 6.53 ± 1.769
0.466AspMet: 0.466 ± 0.366
2.332AspAsn: 2.332 ± 0.616
3.265AspPro: 3.265 ± 0.757
1.866AspGln: 1.866 ± 0.765
2.332AspArg: 2.332 ± 1.221
3.731AspSer: 3.731 ± 0.799
4.664AspThr: 4.664 ± 0.752
2.799AspVal: 2.799 ± 1.286
1.399AspTrp: 1.399 ± 1.099
1.866AspTyr: 1.866 ± 0.533
0.0AspXaa: 0.0 ± 0.0
Glu
3.265GluAla: 3.265 ± 1.901
0.466GluCys: 0.466 ± 0.632
6.53GluAsp: 6.53 ± 0.763
6.063GluGlu: 6.063 ± 2.096
3.265GluPhe: 3.265 ± 0.95
3.265GluGly: 3.265 ± 1.073
0.933GluHis: 0.933 ± 0.885
4.664GluIle: 4.664 ± 1.021
2.332GluLys: 2.332 ± 1.264
6.53GluLeu: 6.53 ± 1.446
1.866GluMet: 1.866 ± 0.704
5.597GluAsn: 5.597 ± 1.784
5.597GluPro: 5.597 ± 2.711
3.265GluGln: 3.265 ± 0.953
5.131GluArg: 5.131 ± 1.22
3.731GluSer: 3.731 ± 0.902
2.332GluThr: 2.332 ± 0.68
6.996GluVal: 6.996 ± 3.438
0.0GluTrp: 0.0 ± 0.0
2.332GluTyr: 2.332 ± 1.657
0.0GluXaa: 0.0 ± 0.0
Phe
3.731PheAla: 3.731 ± 1.885
0.466PheCys: 0.466 ± 0.591
3.265PheAsp: 3.265 ± 1.199
3.731PheGlu: 3.731 ± 1.264
3.731PhePhe: 3.731 ± 1.259
5.597PheGly: 5.597 ± 2.188
0.0PheHis: 0.0 ± 0.0
2.332PheIle: 2.332 ± 0.997
1.866PheLys: 1.866 ± 1.465
6.996PheLeu: 6.996 ± 1.949
0.933PheMet: 0.933 ± 0.59
3.265PheAsn: 3.265 ± 0.862
0.933PhePro: 0.933 ± 0.647
3.265PheGln: 3.265 ± 0.785
1.866PheArg: 1.866 ± 0.51
3.265PheSer: 3.265 ± 0.926
2.332PheThr: 2.332 ± 0.7
2.799PheVal: 2.799 ± 0.649
1.866PheTrp: 1.866 ± 0.783
3.265PheTyr: 3.265 ± 1.414
0.0PheXaa: 0.0 ± 0.0
Gly
1.399GlyAla: 1.399 ± 0.747
0.0GlyCys: 0.0 ± 0.0
3.731GlyAsp: 3.731 ± 0.872
6.996GlyGlu: 6.996 ± 1.789
2.799GlyPhe: 2.799 ± 1.205
6.996GlyGly: 6.996 ± 2.456
0.466GlyHis: 0.466 ± 0.405
3.731GlyIle: 3.731 ± 1.115
3.731GlyLys: 3.731 ± 1.719
3.265GlyLeu: 3.265 ± 0.862
0.933GlyMet: 0.933 ± 0.611
1.866GlyAsn: 1.866 ± 0.783
3.731GlyPro: 3.731 ± 0.584
1.399GlyGln: 1.399 ± 0.91
4.198GlyArg: 4.198 ± 1.514
4.664GlySer: 4.664 ± 0.887
5.131GlyThr: 5.131 ± 2.448
3.265GlyVal: 3.265 ± 1.675
0.0GlyTrp: 0.0 ± 0.0
1.866GlyTyr: 1.866 ± 0.802
0.0GlyXaa: 0.0 ± 0.0
His
0.933HisAla: 0.933 ± 0.392
0.933HisCys: 0.933 ± 0.677
0.933HisAsp: 0.933 ± 0.647
1.399HisGlu: 1.399 ± 0.683
0.933HisPhe: 0.933 ± 0.392
0.0HisGly: 0.0 ± 0.0
0.466HisHis: 0.466 ± 0.366
0.0HisIle: 0.0 ± 0.0
1.399HisLys: 1.399 ± 0.607
1.399HisLeu: 1.399 ± 1.259
0.0HisMet: 0.0 ± 0.0
0.466HisAsn: 0.466 ± 0.366
1.399HisPro: 1.399 ± 0.641
0.466HisGln: 0.466 ± 0.366
1.399HisArg: 1.399 ± 0.729
0.466HisSer: 0.466 ± 0.366
0.933HisThr: 0.933 ± 0.392
2.799HisVal: 2.799 ± 0.65
0.466HisTrp: 0.466 ± 0.366
0.933HisTyr: 0.933 ± 0.677
0.0HisXaa: 0.0 ± 0.0
Ile
2.799IleAla: 2.799 ± 0.789
1.866IleCys: 1.866 ± 0.857
3.265IleAsp: 3.265 ± 0.912
5.597IleGlu: 5.597 ± 0.815
2.332IlePhe: 2.332 ± 0.529
2.332IleGly: 2.332 ± 1.101
0.933IleHis: 0.933 ± 0.625
1.866IleIle: 1.866 ± 0.533
1.866IleLys: 1.866 ± 0.51
5.131IleLeu: 5.131 ± 0.926
0.466IleMet: 0.466 ± 0.364
0.933IleAsn: 0.933 ± 0.809
4.198IlePro: 4.198 ± 1.881
1.866IleGln: 1.866 ± 1.258
1.399IleArg: 1.399 ± 0.66
3.731IleSer: 3.731 ± 0.846
2.799IleThr: 2.799 ± 0.847
2.799IleVal: 2.799 ± 1.527
0.0IleTrp: 0.0 ± 0.0
2.799IleTyr: 2.799 ± 0.952
0.0IleXaa: 0.0 ± 0.0
Lys
2.332LysAla: 2.332 ± 0.781
2.799LysCys: 2.799 ± 1.351
3.265LysAsp: 3.265 ± 1.778
2.332LysGlu: 2.332 ± 1.349
3.731LysPhe: 3.731 ± 1.515
3.265LysGly: 3.265 ± 0.789
1.866LysHis: 1.866 ± 0.826
3.731LysIle: 3.731 ± 1.089
3.265LysLys: 3.265 ± 1.499
1.866LysLeu: 1.866 ± 0.857
1.399LysMet: 1.399 ± 0.84
1.866LysAsn: 1.866 ± 0.783
1.866LysPro: 1.866 ± 0.783
1.866LysGln: 1.866 ± 0.451
5.597LysArg: 5.597 ± 0.289
4.198LysSer: 4.198 ± 1.373
1.866LysThr: 1.866 ± 0.811
2.332LysVal: 2.332 ± 1.142
0.466LysTrp: 0.466 ± 0.394
0.933LysTyr: 0.933 ± 0.413
0.0LysXaa: 0.0 ± 0.0
Leu
3.265LeuAla: 3.265 ± 1.096
4.664LeuCys: 4.664 ± 2.564
5.131LeuAsp: 5.131 ± 1.477
5.597LeuGlu: 5.597 ± 1.831
8.862LeuPhe: 8.862 ± 1.107
6.53LeuGly: 6.53 ± 0.777
1.399LeuHis: 1.399 ± 0.931
2.799LeuIle: 2.799 ± 0.739
5.131LeuLys: 5.131 ± 1.097
7.929LeuLeu: 7.929 ± 1.853
1.399LeuMet: 1.399 ± 0.707
4.198LeuAsn: 4.198 ± 1.439
4.198LeuPro: 4.198 ± 0.778
3.731LeuGln: 3.731 ± 0.991
3.731LeuArg: 3.731 ± 1.567
6.996LeuSer: 6.996 ± 1.606
5.597LeuThr: 5.597 ± 1.109
3.731LeuVal: 3.731 ± 0.828
0.466LeuTrp: 0.466 ± 0.405
4.664LeuTyr: 4.664 ± 1.121
0.0LeuXaa: 0.0 ± 0.0
Met
2.799MetAla: 2.799 ± 1.045
1.866MetCys: 1.866 ± 0.765
1.399MetAsp: 1.399 ± 1.214
0.933MetGlu: 0.933 ± 0.629
0.0MetPhe: 0.0 ± 0.0
0.466MetGly: 0.466 ± 0.366
0.0MetHis: 0.0 ± 0.0
1.399MetIle: 1.399 ± 0.682
1.399MetLys: 1.399 ± 0.253
1.399MetLeu: 1.399 ± 0.683
0.0MetMet: 0.0 ± 0.0
0.466MetAsn: 0.466 ± 0.405
0.0MetPro: 0.0 ± 0.0
0.933MetGln: 0.933 ± 0.732
0.933MetArg: 0.933 ± 0.732
0.933MetSer: 0.933 ± 0.413
0.933MetThr: 0.933 ± 0.809
2.332MetVal: 2.332 ± 0.622
0.0MetTrp: 0.0 ± 0.0
0.466MetTyr: 0.466 ± 0.405
0.0MetXaa: 0.0 ± 0.0
Asn
2.799AsnAla: 2.799 ± 0.725
0.933AsnCys: 0.933 ± 0.732
3.265AsnAsp: 3.265 ± 0.736
4.198AsnGlu: 4.198 ± 1.289
1.399AsnPhe: 1.399 ± 0.607
0.933AsnGly: 0.933 ± 0.809
0.933AsnHis: 0.933 ± 0.732
2.332AsnIle: 2.332 ± 1.137
2.332AsnLys: 2.332 ± 1.507
3.265AsnLeu: 3.265 ± 0.78
0.933AsnMet: 0.933 ± 0.652
2.332AsnAsn: 2.332 ± 0.529
3.265AsnPro: 3.265 ± 1.582
3.265AsnGln: 3.265 ± 1.373
2.332AsnArg: 2.332 ± 1.069
2.799AsnSer: 2.799 ± 0.737
2.799AsnThr: 2.799 ± 0.705
2.332AsnVal: 2.332 ± 1.157
0.466AsnTrp: 0.466 ± 0.405
1.866AsnTyr: 1.866 ± 0.968
0.0AsnXaa: 0.0 ± 0.0
Pro
3.265ProAla: 3.265 ± 1.039
1.866ProCys: 1.866 ± 0.51
5.131ProAsp: 5.131 ± 1.473
4.664ProGlu: 4.664 ± 0.737
4.198ProPhe: 4.198 ± 0.892
2.332ProGly: 2.332 ± 0.974
0.933ProHis: 0.933 ± 0.413
1.399ProIle: 1.399 ± 0.253
3.265ProLys: 3.265 ± 0.683
5.597ProLeu: 5.597 ± 1.293
0.0ProMet: 0.0 ± 0.0
2.799ProAsn: 2.799 ± 1.902
6.996ProPro: 6.996 ± 0.811
1.399ProGln: 1.399 ± 0.682
2.799ProArg: 2.799 ± 1.054
4.664ProSer: 4.664 ± 1.612
2.332ProThr: 2.332 ± 1.463
5.597ProVal: 5.597 ± 3.709
0.933ProTrp: 0.933 ± 0.439
3.265ProTyr: 3.265 ± 1.323
0.0ProXaa: 0.0 ± 0.0
Gln
1.399GlnAla: 1.399 ± 0.707
0.466GlnCys: 0.466 ± 0.366
0.933GlnAsp: 0.933 ± 0.787
2.332GlnGlu: 2.332 ± 1.079
1.866GlnPhe: 1.866 ± 0.451
2.332GlnGly: 2.332 ± 0.94
0.466GlnHis: 0.466 ± 0.366
4.198GlnIle: 4.198 ± 1.851
2.332GlnLys: 2.332 ± 1.468
4.198GlnLeu: 4.198 ± 1.813
1.866GlnMet: 1.866 ± 1.083
2.799GlnAsn: 2.799 ± 0.725
0.933GlnPro: 0.933 ± 0.392
0.933GlnGln: 0.933 ± 0.413
1.866GlnArg: 1.866 ± 1.035
3.265GlnSer: 3.265 ± 1.934
2.332GlnThr: 2.332 ± 0.674
1.866GlnVal: 1.866 ± 0.909
0.466GlnTrp: 0.466 ± 0.366
1.399GlnTyr: 1.399 ± 0.707
0.0GlnXaa: 0.0 ± 0.0
Arg
6.063ArgAla: 6.063 ± 1.11
0.933ArgCys: 0.933 ± 0.659
2.799ArgAsp: 2.799 ± 0.494
3.265ArgGlu: 3.265 ± 0.91
1.866ArgPhe: 1.866 ± 0.765
2.799ArgGly: 2.799 ± 1.207
1.866ArgHis: 1.866 ± 0.857
1.866ArgIle: 1.866 ± 0.795
2.799ArgLys: 2.799 ± 0.802
5.597ArgLeu: 5.597 ± 1.011
0.466ArgMet: 0.466 ± 0.366
1.399ArgAsn: 1.399 ± 0.571
5.597ArgPro: 5.597 ± 1.225
1.866ArgGln: 1.866 ± 0.503
3.731ArgArg: 3.731 ± 1.075
3.265ArgSer: 3.265 ± 0.836
3.731ArgThr: 3.731 ± 1.305
4.664ArgVal: 4.664 ± 1.821
0.933ArgTrp: 0.933 ± 0.652
0.933ArgTyr: 0.933 ± 0.625
0.0ArgXaa: 0.0 ± 0.0
Ser
6.53SerAla: 6.53 ± 2.26
0.933SerCys: 0.933 ± 0.821
2.799SerAsp: 2.799 ± 1.395
5.131SerGlu: 5.131 ± 1.763
2.332SerPhe: 2.332 ± 0.552
4.198SerGly: 4.198 ± 1.346
2.332SerHis: 2.332 ± 1.052
3.731SerIle: 3.731 ± 1.307
1.866SerLys: 1.866 ± 0.857
8.862SerLeu: 8.862 ± 2.18
0.933SerMet: 0.933 ± 0.647
1.866SerAsn: 1.866 ± 0.451
4.664SerPro: 4.664 ± 1.226
2.332SerGln: 2.332 ± 1.035
2.799SerArg: 2.799 ± 1.239
2.332SerSer: 2.332 ± 0.902
4.664SerThr: 4.664 ± 1.086
2.799SerVal: 2.799 ± 1.279
0.466SerTrp: 0.466 ± 0.366
1.866SerTyr: 1.866 ± 0.783
0.0SerXaa: 0.0 ± 0.0
Thr
2.799ThrAla: 2.799 ± 1.136
0.933ThrCys: 0.933 ± 0.659
3.265ThrAsp: 3.265 ± 1.039
2.799ThrGlu: 2.799 ± 0.972
3.265ThrPhe: 3.265 ± 1.252
5.597ThrGly: 5.597 ± 1.885
0.466ThrHis: 0.466 ± 0.366
3.265ThrIle: 3.265 ± 1.245
1.866ThrLys: 1.866 ± 1.118
5.131ThrLeu: 5.131 ± 1.825
3.731ThrMet: 3.731 ± 1.406
3.265ThrAsn: 3.265 ± 0.912
4.198ThrPro: 4.198 ± 1.429
1.399ThrGln: 1.399 ± 0.707
2.799ThrArg: 2.799 ± 0.895
2.332ThrSer: 2.332 ± 0.934
5.597ThrThr: 5.597 ± 1.451
5.131ThrVal: 5.131 ± 2.005
0.933ThrTrp: 0.933 ± 0.629
0.933ThrTyr: 0.933 ± 0.732
0.0ThrXaa: 0.0 ± 0.0
Val
1.866ValAla: 1.866 ± 0.868
1.399ValCys: 1.399 ± 1.897
3.265ValAsp: 3.265 ± 1.199
5.597ValGlu: 5.597 ± 1.583
4.198ValPhe: 4.198 ± 1.924
4.198ValGly: 4.198 ± 1.078
1.866ValHis: 1.866 ± 0.865
3.265ValIle: 3.265 ± 0.832
2.799ValLys: 2.799 ± 1.026
3.731ValLeu: 3.731 ± 1.42
0.933ValMet: 0.933 ± 0.59
3.265ValAsn: 3.265 ± 1.675
4.198ValPro: 4.198 ± 2.129
1.866ValGln: 1.866 ± 0.846
4.664ValArg: 4.664 ± 1.394
4.664ValSer: 4.664 ± 1.459
5.597ValThr: 5.597 ± 2.63
4.198ValVal: 4.198 ± 1.768
0.466ValTrp: 0.466 ± 0.405
0.466ValTyr: 0.466 ± 0.394
0.0ValXaa: 0.0 ± 0.0
Trp
1.399TrpAla: 1.399 ± 0.641
0.0TrpCys: 0.0 ± 0.0
0.933TrpAsp: 0.933 ± 0.652
1.399TrpGlu: 1.399 ± 1.774
0.933TrpPhe: 0.933 ± 0.392
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.866TrpIle: 1.866 ± 0.451
0.466TrpLys: 0.466 ± 0.405
1.399TrpLeu: 1.399 ± 0.253
0.0TrpMet: 0.0 ± 0.0
1.399TrpAsn: 1.399 ± 0.707
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.933TrpArg: 0.933 ± 0.392
0.933TrpSer: 0.933 ± 0.732
0.933TrpThr: 0.933 ± 0.652
0.466TrpVal: 0.466 ± 0.366
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.399TyrAla: 1.399 ± 0.571
1.399TyrCys: 1.399 ± 0.906
1.399TyrAsp: 1.399 ± 0.641
1.866TyrGlu: 1.866 ± 0.701
3.265TyrPhe: 3.265 ± 1.494
1.866TyrGly: 1.866 ± 0.861
1.399TyrHis: 1.399 ± 0.707
0.466TyrIle: 0.466 ± 0.405
4.198TyrLys: 4.198 ± 1.816
4.198TyrLeu: 4.198 ± 0.739
1.399TyrMet: 1.399 ± 0.641
2.799TyrAsn: 2.799 ± 0.505
1.399TyrPro: 1.399 ± 0.729
1.866TyrGln: 1.866 ± 0.686
1.399TyrArg: 1.399 ± 0.931
0.933TyrSer: 0.933 ± 0.439
0.466TyrThr: 0.466 ± 0.405
0.933TyrVal: 0.933 ± 0.809
1.866TyrTrp: 1.866 ± 1.305
1.866TyrTyr: 1.866 ± 1.115
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2145 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski