Amino acid dipepetide frequency for Plantago asiatica mosaic potexvirus (P1AMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.797AlaAla: 7.797 ± 6.997
0.487AlaCys: 0.487 ± 0.917
3.899AlaAsp: 3.899 ± 1.478
1.462AlaGlu: 1.462 ± 0.838
5.361AlaPhe: 5.361 ± 0.822
6.335AlaGly: 6.335 ± 3.477
3.899AlaHis: 3.899 ± 1.745
2.924AlaIle: 2.924 ± 1.676
5.361AlaLys: 5.361 ± 2.315
16.082AlaLeu: 16.082 ± 4.626
1.949AlaMet: 1.949 ± 1.194
4.873AlaAsn: 4.873 ± 1.761
4.873AlaPro: 4.873 ± 2.114
6.823AlaGln: 6.823 ± 1.93
3.899AlaArg: 3.899 ± 4.101
4.873AlaSer: 4.873 ± 2.386
5.848AlaThr: 5.848 ± 1.382
2.924AlaVal: 2.924 ± 0.816
0.487AlaTrp: 0.487 ± 0.279
1.949AlaTyr: 1.949 ± 1.117
0.0AlaXaa: 0.0 ± 0.0
Cys
0.975CysAla: 0.975 ± 0.559
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.487CysGlu: 0.487 ± 0.279
0.975CysPhe: 0.975 ± 0.769
1.462CysGly: 1.462 ± 1.189
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.487CysAsn: 0.487 ± 0.279
5.361CysPro: 5.361 ± 2.874
0.487CysGln: 0.487 ± 0.279
0.975CysArg: 0.975 ± 0.769
0.487CysSer: 0.487 ± 0.279
1.462CysThr: 1.462 ± 1.167
0.487CysVal: 0.487 ± 0.279
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.361AspAla: 5.361 ± 1.942
0.487AspCys: 0.487 ± 0.279
2.437AspAsp: 2.437 ± 1.397
1.949AspGlu: 1.949 ± 0.92
3.411AspPhe: 3.411 ± 1.296
2.437AspGly: 2.437 ± 1.888
1.462AspHis: 1.462 ± 0.838
2.437AspIle: 2.437 ± 1.397
1.462AspLys: 1.462 ± 0.689
3.899AspLeu: 3.899 ± 2.235
0.975AspMet: 0.975 ± 0.73
0.975AspAsn: 0.975 ± 1.258
3.411AspPro: 3.411 ± 1.034
1.949AspGln: 1.949 ± 0.775
1.462AspArg: 1.462 ± 0.838
2.437AspSer: 2.437 ± 1.314
2.924AspThr: 2.924 ± 1.08
2.437AspVal: 2.437 ± 0.939
0.975AspTrp: 0.975 ± 0.559
0.975AspTyr: 0.975 ± 1.258
0.0AspXaa: 0.0 ± 0.0
Glu
7.31GluAla: 7.31 ± 2.548
0.487GluCys: 0.487 ± 0.279
2.437GluAsp: 2.437 ± 1.397
3.411GluGlu: 3.411 ± 1.439
2.924GluPhe: 2.924 ± 1.148
2.924GluGly: 2.924 ± 1.148
1.462GluHis: 1.462 ± 1.098
1.949GluIle: 1.949 ± 1.117
1.949GluLys: 1.949 ± 1.117
4.873GluLeu: 4.873 ± 2.05
0.975GluMet: 0.975 ± 0.559
1.462GluAsn: 1.462 ± 0.838
1.949GluPro: 1.949 ± 0.747
1.949GluGln: 1.949 ± 0.792
2.437GluArg: 2.437 ± 1.397
2.924GluSer: 2.924 ± 1.378
2.437GluThr: 2.437 ± 0.881
3.411GluVal: 3.411 ± 0.838
0.975GluTrp: 0.975 ± 0.559
0.487GluTyr: 0.487 ± 0.832
0.0GluXaa: 0.0 ± 0.0
Phe
3.411PheAla: 3.411 ± 1.101
0.975PheCys: 0.975 ± 0.769
4.386PheAsp: 4.386 ± 2.127
3.899PheGlu: 3.899 ± 1.331
1.949PhePhe: 1.949 ± 0.747
0.487PheGly: 0.487 ± 0.279
1.462PheHis: 1.462 ± 0.838
1.462PheIle: 1.462 ± 1.523
0.975PheLys: 0.975 ± 0.559
6.823PheLeu: 6.823 ± 0.81
1.462PheMet: 1.462 ± 0.838
0.487PheAsn: 0.487 ± 0.279
2.437PhePro: 2.437 ± 1.193
2.924PheGln: 2.924 ± 1.148
1.949PheArg: 1.949 ± 1.117
3.899PheSer: 3.899 ± 1.123
3.899PheThr: 3.899 ± 0.956
1.949PheVal: 1.949 ± 0.747
0.487PheTrp: 0.487 ± 0.279
1.462PheTyr: 1.462 ± 1.27
0.0PheXaa: 0.0 ± 0.0
Gly
4.386GlyAla: 4.386 ± 1.915
1.462GlyCys: 1.462 ± 0.771
3.411GlyAsp: 3.411 ± 1.518
1.462GlyGlu: 1.462 ± 0.838
2.924GlyPhe: 2.924 ± 1.409
3.411GlyGly: 3.411 ± 1.101
3.899GlyHis: 3.899 ± 1.523
1.949GlyIle: 1.949 ± 1.185
2.924GlyLys: 2.924 ± 1.016
5.848GlyLeu: 5.848 ± 4.375
0.0GlyMet: 0.0 ± 0.0
0.487GlyAsn: 0.487 ± 0.279
3.899GlyPro: 3.899 ± 0.716
2.437GlyGln: 2.437 ± 0.881
0.975GlyArg: 0.975 ± 1.258
3.411GlySer: 3.411 ± 2.602
2.924GlyThr: 2.924 ± 2.349
1.949GlyVal: 1.949 ± 1.538
0.0GlyTrp: 0.0 ± 0.0
1.462GlyTyr: 1.462 ± 0.838
0.0GlyXaa: 0.0 ± 0.0
His
4.873HisAla: 4.873 ± 1.951
0.0HisCys: 0.0 ± 0.0
0.487HisAsp: 0.487 ± 0.279
0.975HisGlu: 0.975 ± 0.559
2.924HisPhe: 2.924 ± 1.378
1.949HisGly: 1.949 ± 1.196
1.462HisHis: 1.462 ± 1.189
0.487HisIle: 0.487 ± 0.279
0.975HisLys: 0.975 ± 0.559
2.924HisLeu: 2.924 ± 1.36
0.487HisMet: 0.487 ± 0.708
0.975HisAsn: 0.975 ± 0.559
2.437HisPro: 2.437 ± 2.61
1.949HisGln: 1.949 ± 1.117
3.899HisArg: 3.899 ± 2.364
2.437HisSer: 2.437 ± 1.244
3.411HisThr: 3.411 ± 1.289
1.949HisVal: 1.949 ± 1.964
0.487HisTrp: 0.487 ± 0.279
1.462HisTyr: 1.462 ± 0.771
0.0HisXaa: 0.0 ± 0.0
Ile
3.411IleAla: 3.411 ± 0.838
0.0IleCys: 0.0 ± 0.0
0.0IleAsp: 0.0 ± 0.0
1.949IleGlu: 1.949 ± 1.117
2.924IlePhe: 2.924 ± 1.676
0.487IleGly: 0.487 ± 1.38
1.462IleHis: 1.462 ± 0.838
1.949IleIle: 1.949 ± 1.563
3.411IleLys: 3.411 ± 0.838
4.873IleLeu: 4.873 ± 2.568
1.462IleMet: 1.462 ± 0.838
2.924IleAsn: 2.924 ± 1.378
1.949IlePro: 1.949 ± 0.792
1.949IleGln: 1.949 ± 1.842
0.975IleArg: 0.975 ± 0.711
6.335IleSer: 6.335 ± 2.245
5.361IleThr: 5.361 ± 1.539
0.487IleVal: 0.487 ± 0.998
0.487IleTrp: 0.487 ± 0.917
1.462IleTyr: 1.462 ± 0.838
0.0IleXaa: 0.0 ± 0.0
Lys
3.899LysAla: 3.899 ± 2.235
0.0LysCys: 0.0 ± 0.0
1.949LysAsp: 1.949 ± 1.117
3.899LysGlu: 3.899 ± 2.235
2.924LysPhe: 2.924 ± 1.669
1.949LysGly: 1.949 ± 0.775
0.0LysHis: 0.0 ± 0.0
2.437LysIle: 2.437 ± 0.801
2.924LysLys: 2.924 ± 1.676
7.31LysLeu: 7.31 ± 3.011
0.487LysMet: 0.487 ± 0.279
1.462LysAsn: 1.462 ± 0.838
4.386LysPro: 4.386 ± 1.268
0.0LysGln: 0.0 ± 0.0
0.487LysArg: 0.487 ± 0.279
3.411LysSer: 3.411 ± 1.034
3.411LysThr: 3.411 ± 1.034
2.437LysVal: 2.437 ± 0.801
0.0LysTrp: 0.0 ± 0.0
0.975LysTyr: 0.975 ± 0.559
0.0LysXaa: 0.0 ± 0.0
Leu
9.747LeuAla: 9.747 ± 5.964
1.462LeuCys: 1.462 ± 1.523
4.873LeuAsp: 4.873 ± 1.341
7.797LeuGlu: 7.797 ± 1.563
2.924LeuPhe: 2.924 ± 0.771
6.335LeuGly: 6.335 ± 1.789
3.411LeuHis: 3.411 ± 1.034
4.873LeuIle: 4.873 ± 2.148
6.823LeuLys: 6.823 ± 2.383
9.259LeuLeu: 9.259 ± 5.302
0.487LeuMet: 0.487 ± 0.279
4.386LeuAsn: 4.386 ± 2.114
11.696LeuPro: 11.696 ± 1.298
4.386LeuGln: 4.386 ± 1.698
3.899LeuArg: 3.899 ± 0.984
9.259LeuSer: 9.259 ± 3.522
9.259LeuThr: 9.259 ± 3.037
5.361LeuVal: 5.361 ± 1.348
1.462LeuTrp: 1.462 ± 0.838
2.924LeuTyr: 2.924 ± 1.148
0.0LeuXaa: 0.0 ± 0.0
Met
1.949MetAla: 1.949 ± 1.538
0.487MetCys: 0.487 ± 0.279
0.487MetAsp: 0.487 ± 0.832
0.487MetGlu: 0.487 ± 0.279
0.487MetPhe: 0.487 ± 0.279
0.975MetGly: 0.975 ± 0.559
0.487MetHis: 0.487 ± 0.998
1.462MetIle: 1.462 ± 0.838
0.975MetLys: 0.975 ± 0.559
1.462MetLeu: 1.462 ± 0.838
0.487MetMet: 0.487 ± 0.279
0.0MetAsn: 0.0 ± 0.0
0.487MetPro: 0.487 ± 0.279
0.487MetGln: 0.487 ± 0.279
0.975MetArg: 0.975 ± 0.559
0.975MetSer: 0.975 ± 1.258
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.487MetTyr: 0.487 ± 0.279
0.0MetXaa: 0.0 ± 0.0
Asn
3.411AsnAla: 3.411 ± 1.411
1.462AsnCys: 1.462 ± 0.838
1.462AsnAsp: 1.462 ± 0.838
1.462AsnGlu: 1.462 ± 0.705
0.487AsnPhe: 0.487 ± 0.832
0.0AsnGly: 0.0 ± 0.0
0.487AsnHis: 0.487 ± 0.279
1.949AsnIle: 1.949 ± 0.747
0.975AsnLys: 0.975 ± 0.711
5.361AsnLeu: 5.361 ± 4.008
0.0AsnMet: 0.0 ± 0.0
0.975AsnAsn: 0.975 ± 0.711
5.361AsnPro: 5.361 ± 1.91
0.0AsnGln: 0.0 ± 0.0
1.949AsnArg: 1.949 ± 1.117
2.924AsnSer: 2.924 ± 1.676
3.899AsnThr: 3.899 ± 1.594
1.949AsnVal: 1.949 ± 1.117
0.0AsnTrp: 0.0 ± 0.0
2.924AsnTyr: 2.924 ± 1.676
0.0AsnXaa: 0.0 ± 0.0
Pro
5.848ProAla: 5.848 ± 1.953
0.975ProCys: 0.975 ± 0.711
4.386ProAsp: 4.386 ± 1.464
6.823ProGlu: 6.823 ± 2.765
2.437ProPhe: 2.437 ± 2.432
3.899ProGly: 3.899 ± 0.716
4.386ProHis: 4.386 ± 5.135
2.924ProIle: 2.924 ± 0.816
5.361ProLys: 5.361 ± 3.073
4.873ProLeu: 4.873 ± 3.556
0.0ProMet: 0.0 ± 0.0
2.437ProAsn: 2.437 ± 2.542
6.823ProPro: 6.823 ± 1.257
4.386ProGln: 4.386 ± 0.648
2.924ProArg: 2.924 ± 1.542
9.747ProSer: 9.747 ± 4.54
9.259ProThr: 9.259 ± 4.165
5.361ProVal: 5.361 ± 1.313
0.975ProTrp: 0.975 ± 0.559
0.975ProTyr: 0.975 ± 0.559
0.0ProXaa: 0.0 ± 0.0
Gln
3.411GlnAla: 3.411 ± 1.383
0.487GlnCys: 0.487 ± 0.279
2.924GlnAsp: 2.924 ± 1.676
1.949GlnGlu: 1.949 ± 0.747
2.924GlnPhe: 2.924 ± 1.539
1.949GlnGly: 1.949 ± 0.792
2.437GlnHis: 2.437 ± 0.862
2.924GlnIle: 2.924 ± 1.072
0.487GlnLys: 0.487 ± 0.279
4.873GlnLeu: 4.873 ± 0.696
0.487GlnMet: 0.487 ± 0.279
1.462GlnAsn: 1.462 ± 0.689
3.899GlnPro: 3.899 ± 1.631
1.462GlnGln: 1.462 ± 0.838
1.949GlnArg: 1.949 ± 0.775
2.437GlnSer: 2.437 ± 1.397
5.361GlnThr: 5.361 ± 2.206
1.949GlnVal: 1.949 ± 0.747
0.975GlnTrp: 0.975 ± 0.559
0.975GlnTyr: 0.975 ± 0.847
0.0GlnXaa: 0.0 ± 0.0
Arg
5.361ArgAla: 5.361 ± 1.942
0.975ArgCys: 0.975 ± 0.559
2.924ArgAsp: 2.924 ± 1.36
2.437ArgGlu: 2.437 ± 1.397
2.924ArgPhe: 2.924 ± 1.072
3.899ArgGly: 3.899 ± 0.956
1.462ArgHis: 1.462 ± 1.167
1.462ArgIle: 1.462 ± 0.838
0.975ArgLys: 0.975 ± 1.312
4.386ArgLeu: 4.386 ± 1.281
0.0ArgMet: 0.0 ± 0.883
1.462ArgAsn: 1.462 ± 0.705
1.462ArgPro: 1.462 ± 1.167
1.462ArgGln: 1.462 ± 0.838
2.437ArgArg: 2.437 ± 2.432
3.899ArgSer: 3.899 ± 3.422
4.386ArgThr: 4.386 ± 1.281
0.487ArgVal: 0.487 ± 0.279
0.975ArgTrp: 0.975 ± 0.769
1.949ArgTyr: 1.949 ± 1.185
0.0ArgXaa: 0.0 ± 0.0
Ser
4.386SerAla: 4.386 ± 2.047
1.949SerCys: 1.949 ± 3.132
2.437SerAsp: 2.437 ± 0.881
1.949SerGlu: 1.949 ± 1.117
3.411SerPhe: 3.411 ± 0.861
3.899SerGly: 3.899 ± 2.818
4.873SerHis: 4.873 ± 1.146
2.437SerIle: 2.437 ± 2.489
3.411SerLys: 3.411 ± 1.38
7.797SerLeu: 7.797 ± 2.255
0.0SerMet: 0.0 ± 0.0
3.899SerAsn: 3.899 ± 2.182
9.747SerPro: 9.747 ± 2.027
3.899SerGln: 3.899 ± 0.991
6.335SerArg: 6.335 ± 0.859
6.335SerSer: 6.335 ± 4.333
6.335SerThr: 6.335 ± 1.086
1.462SerVal: 1.462 ± 1.098
0.487SerTrp: 0.487 ± 0.832
2.437SerTyr: 2.437 ± 1.193
0.0SerXaa: 0.0 ± 0.0
Thr
10.234ThrAla: 10.234 ± 4.468
0.975ThrCys: 0.975 ± 0.711
2.924ThrAsp: 2.924 ± 1.113
2.924ThrGlu: 2.924 ± 0.816
1.949ThrPhe: 1.949 ± 1.117
3.411ThrGly: 3.411 ± 1.563
2.924ThrHis: 2.924 ± 1.676
3.899ThrIle: 3.899 ± 2.484
2.437ThrLys: 2.437 ± 1.244
10.234ThrLeu: 10.234 ± 1.82
2.437ThrMet: 2.437 ± 1.397
4.386ThrAsn: 4.386 ± 1.172
10.234ThrPro: 10.234 ± 2.453
4.386ThrGln: 4.386 ± 1.178
3.411ThrArg: 3.411 ± 2.427
4.873ThrSer: 4.873 ± 2.628
6.823ThrThr: 6.823 ± 2.672
4.386ThrVal: 4.386 ± 2.067
0.487ThrTrp: 0.487 ± 0.279
1.462ThrTyr: 1.462 ± 0.838
0.0ThrXaa: 0.0 ± 0.0
Val
1.949ValAla: 1.949 ± 1.423
0.975ValCys: 0.975 ± 0.711
0.975ValAsp: 0.975 ± 1.258
1.949ValGlu: 1.949 ± 1.117
0.487ValPhe: 0.487 ± 0.279
1.462ValGly: 1.462 ± 1.365
0.487ValHis: 0.487 ± 0.832
3.899ValIle: 3.899 ± 1.472
2.437ValLys: 2.437 ± 0.881
4.873ValLeu: 4.873 ± 1.208
0.487ValMet: 0.487 ± 0.279
2.437ValAsn: 2.437 ± 0.881
2.924ValPro: 2.924 ± 1.409
2.924ValGln: 2.924 ± 1.072
4.386ValArg: 4.386 ± 1.172
3.411ValSer: 3.411 ± 1.341
5.361ValThr: 5.361 ± 2.085
3.899ValVal: 3.899 ± 2.095
0.0ValTrp: 0.0 ± 0.0
0.975ValTyr: 0.975 ± 1.258
0.0ValXaa: 0.0 ± 0.0
Trp
0.975TrpAla: 0.975 ± 0.769
0.0TrpCys: 0.0 ± 0.0
0.975TrpAsp: 0.975 ± 0.711
0.975TrpGlu: 0.975 ± 0.559
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.487TrpLys: 0.487 ± 0.279
1.462TrpLeu: 1.462 ± 0.838
0.0TrpMet: 0.0 ± 0.0
0.975TrpAsn: 0.975 ± 0.769
0.0TrpPro: 0.0 ± 0.0
0.975TrpGln: 0.975 ± 0.559
0.0TrpArg: 0.0 ± 0.0
0.487TrpSer: 0.487 ± 0.279
0.0TrpThr: 0.0 ± 0.0
2.437TrpVal: 2.437 ± 1.397
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.411TyrAla: 3.411 ± 0.838
0.0TyrCys: 0.0 ± 0.0
0.487TyrAsp: 0.487 ± 0.279
0.487TyrGlu: 0.487 ± 0.279
2.437TyrPhe: 2.437 ± 1.57
1.949TyrGly: 1.949 ± 1.185
0.487TyrHis: 0.487 ± 0.279
2.437TyrIle: 2.437 ± 0.939
0.0TyrLys: 0.0 ± 0.0
3.411TyrLeu: 3.411 ± 1.289
0.487TyrMet: 0.487 ± 0.279
0.487TyrAsn: 0.487 ± 0.279
1.462TyrPro: 1.462 ± 0.771
0.487TyrGln: 0.487 ± 0.279
0.487TyrArg: 0.487 ± 0.279
2.924TyrSer: 2.924 ± 1.072
2.437TyrThr: 2.437 ± 1.244
0.975TyrVal: 0.975 ± 1.258
0.487TyrTrp: 0.487 ± 0.279
0.975TyrTyr: 0.975 ± 0.847
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2053 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski