Amino acid dipepetide frequency for Penicillium digitatum polymycoviruses 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.024AlaAla: 9.024 ± 2.508
2.149AlaCys: 2.149 ± 1.023
6.446AlaAsp: 6.446 ± 1.668
3.868AlaGlu: 3.868 ± 0.714
4.727AlaPhe: 4.727 ± 1.007
9.024AlaGly: 9.024 ± 0.547
3.868AlaHis: 3.868 ± 2.309
3.008AlaIle: 3.008 ± 1.269
4.727AlaLys: 4.727 ± 1.939
10.743AlaLeu: 10.743 ± 1.317
3.008AlaMet: 3.008 ± 0.856
2.149AlaAsn: 2.149 ± 0.516
6.446AlaPro: 6.446 ± 1.504
1.289AlaGln: 1.289 ± 0.555
9.884AlaArg: 9.884 ± 3.807
6.446AlaSer: 6.446 ± 2.89
6.016AlaThr: 6.016 ± 1.027
11.173AlaVal: 11.173 ± 1.737
0.43AlaTrp: 0.43 ± 0.37
4.297AlaTyr: 4.297 ± 2.046
0.0AlaXaa: 0.0 ± 0.0
Cys
1.289CysAla: 1.289 ± 0.846
0.0CysCys: 0.0 ± 0.0
1.289CysAsp: 1.289 ± 0.638
0.859CysGlu: 0.859 ± 0.402
0.859CysPhe: 0.859 ± 0.371
0.43CysGly: 0.43 ± 0.362
0.43CysHis: 0.43 ± 0.37
0.43CysIle: 0.43 ± 0.327
0.0CysLys: 0.0 ± 0.0
1.289CysLeu: 1.289 ± 1.11
0.43CysMet: 0.43 ± 0.327
0.859CysAsn: 0.859 ± 0.596
0.0CysPro: 0.0 ± 0.0
0.859CysGln: 0.859 ± 0.339
0.859CysArg: 0.859 ± 0.74
0.859CysSer: 0.859 ± 0.654
0.43CysThr: 0.43 ± 0.362
2.149CysVal: 2.149 ± 0.462
0.0CysTrp: 0.0 ± 0.0
0.43CysTyr: 0.43 ± 0.37
0.0CysXaa: 0.0 ± 0.0
Asp
7.306AspAla: 7.306 ± 1.594
0.43AspCys: 0.43 ± 0.327
3.868AspAsp: 3.868 ± 1.76
3.008AspGlu: 3.008 ± 1.4
1.719AspPhe: 1.719 ± 0.437
6.446AspGly: 6.446 ± 1.378
1.719AspHis: 1.719 ± 0.332
3.868AspIle: 3.868 ± 2.248
3.438AspLys: 3.438 ± 1.488
5.157AspLeu: 5.157 ± 1.791
0.43AspMet: 0.43 ± 0.281
1.719AspAsn: 1.719 ± 0.845
4.297AspPro: 4.297 ± 0.692
2.149AspGln: 2.149 ± 0.884
2.578AspArg: 2.578 ± 0.737
2.578AspSer: 2.578 ± 0.721
3.868AspThr: 3.868 ± 0.796
5.587AspVal: 5.587 ± 1.43
0.859AspTrp: 0.859 ± 0.339
1.719AspTyr: 1.719 ± 0.332
0.0AspXaa: 0.0 ± 0.0
Glu
4.727GluAla: 4.727 ± 1.202
0.43GluCys: 0.43 ± 0.37
2.578GluAsp: 2.578 ± 0.737
5.157GluGlu: 5.157 ± 0.575
2.578GluPhe: 2.578 ± 0.737
3.008GluGly: 3.008 ± 1.261
2.149GluHis: 2.149 ± 1.254
0.43GluIle: 0.43 ± 0.327
3.008GluLys: 3.008 ± 1.221
7.735GluLeu: 7.735 ± 1.006
0.43GluMet: 0.43 ± 0.327
0.43GluAsn: 0.43 ± 0.327
3.008GluPro: 3.008 ± 0.589
0.43GluGln: 0.43 ± 0.37
4.727GluArg: 4.727 ± 1.231
3.008GluSer: 3.008 ± 0.584
2.149GluThr: 2.149 ± 0.986
3.868GluVal: 3.868 ± 1.734
0.0GluTrp: 0.0 ± 0.0
5.157GluTyr: 5.157 ± 1.442
0.0GluXaa: 0.0 ± 0.0
Phe
2.578PheAla: 2.578 ± 0.753
1.289PheCys: 1.289 ± 0.692
1.719PheAsp: 1.719 ± 0.93
2.149PheGlu: 2.149 ± 0.973
1.289PhePhe: 1.289 ± 0.555
4.297PheGly: 4.297 ± 0.95
0.43PheHis: 0.43 ± 0.37
2.578PheIle: 2.578 ± 0.753
0.0PheLys: 0.0 ± 0.0
3.868PheLeu: 3.868 ± 1.505
1.289PheMet: 1.289 ± 1.11
0.43PheAsn: 0.43 ± 0.327
1.289PhePro: 1.289 ± 0.199
0.0PheGln: 0.0 ± 0.0
1.719PheArg: 1.719 ± 0.726
2.149PheSer: 2.149 ± 0.987
0.43PheThr: 0.43 ± 0.362
5.157PheVal: 5.157 ± 0.896
0.43PheTrp: 0.43 ± 0.362
0.859PheTyr: 0.859 ± 0.371
0.0PheXaa: 0.0 ± 0.0
Gly
8.595GlyAla: 8.595 ± 0.744
0.859GlyCys: 0.859 ± 0.654
5.587GlyAsp: 5.587 ± 0.873
2.578GlyGlu: 2.578 ± 0.617
3.438GlyPhe: 3.438 ± 0.567
7.306GlyGly: 7.306 ± 3.035
1.719GlyHis: 1.719 ± 0.437
2.149GlyIle: 2.149 ± 1.405
3.438GlyLys: 3.438 ± 0.742
7.735GlyLeu: 7.735 ± 1.25
2.578GlyMet: 2.578 ± 0.973
3.008GlyAsn: 3.008 ± 0.182
5.157GlyPro: 5.157 ± 1.989
3.008GlyGln: 3.008 ± 1.243
7.306GlyArg: 7.306 ± 2.077
6.446GlySer: 6.446 ± 2.12
4.727GlyThr: 4.727 ± 1.21
4.297GlyVal: 4.297 ± 2.212
0.0GlyTrp: 0.0 ± 0.0
3.438GlyTyr: 3.438 ± 1.264
0.0GlyXaa: 0.0 ± 0.0
His
1.719HisAla: 1.719 ± 0.976
0.859HisCys: 0.859 ± 0.74
1.719HisAsp: 1.719 ± 0.804
0.859HisGlu: 0.859 ± 0.402
0.43HisPhe: 0.43 ± 0.362
2.578HisGly: 2.578 ± 0.314
0.859HisHis: 0.859 ± 0.402
0.859HisIle: 0.859 ± 0.654
0.43HisLys: 0.43 ± 0.517
1.719HisLeu: 1.719 ± 0.93
0.859HisMet: 0.859 ± 0.654
1.289HisAsn: 1.289 ± 0.63
1.289HisPro: 1.289 ± 0.981
1.719HisGln: 1.719 ± 0.939
2.149HisArg: 2.149 ± 1.335
1.289HisSer: 1.289 ± 0.647
1.719HisThr: 1.719 ± 0.804
3.438HisVal: 3.438 ± 0.662
0.859HisTrp: 0.859 ± 0.339
0.43HisTyr: 0.43 ± 0.37
0.0HisXaa: 0.0 ± 0.0
Ile
3.438IleAla: 3.438 ± 1.363
0.859IleCys: 0.859 ± 0.593
3.008IleAsp: 3.008 ± 0.446
2.149IleGlu: 2.149 ± 0.785
1.719IlePhe: 1.719 ± 0.476
1.719IleGly: 1.719 ± 0.976
0.0IleHis: 0.0 ± 0.0
1.719IleIle: 1.719 ± 0.939
0.859IleLys: 0.859 ± 0.402
3.008IleLeu: 3.008 ± 1.251
0.859IleMet: 0.859 ± 0.402
0.43IleAsn: 0.43 ± 0.37
3.008IlePro: 3.008 ± 0.856
0.859IleGln: 0.859 ± 0.593
2.149IleArg: 2.149 ± 0.822
2.149IleSer: 2.149 ± 0.423
1.719IleThr: 1.719 ± 0.976
1.719IleVal: 1.719 ± 1.044
0.0IleTrp: 0.0 ± 0.0
0.43IleTyr: 0.43 ± 0.517
0.0IleXaa: 0.0 ± 0.0
Lys
4.297LysAla: 4.297 ± 2.893
0.43LysCys: 0.43 ± 0.362
0.859LysAsp: 0.859 ± 0.654
0.43LysGlu: 0.43 ± 0.362
1.719LysPhe: 1.719 ± 0.651
3.438LysGly: 3.438 ± 1.303
1.719LysHis: 1.719 ± 0.395
0.859LysIle: 0.859 ± 0.593
0.0LysLys: 0.0 ± 0.0
3.868LysLeu: 3.868 ± 1.74
0.0LysMet: 0.0 ± 0.0
0.43LysAsn: 0.43 ± 0.327
3.008LysPro: 3.008 ± 2.028
1.719LysGln: 1.719 ± 0.742
3.008LysArg: 3.008 ± 0.804
2.149LysSer: 2.149 ± 0.501
2.149LysThr: 2.149 ± 0.653
2.149LysVal: 2.149 ± 0.697
0.43LysTrp: 0.43 ± 0.327
0.859LysTyr: 0.859 ± 0.522
0.0LysXaa: 0.0 ± 0.0
Leu
10.743LeuAla: 10.743 ± 3.761
1.289LeuCys: 1.289 ± 0.555
6.016LeuAsp: 6.016 ± 2.416
6.016LeuGlu: 6.016 ± 1.541
3.438LeuPhe: 3.438 ± 0.952
7.735LeuGly: 7.735 ± 2.005
2.578LeuHis: 2.578 ± 1.473
1.719LeuIle: 1.719 ± 0.967
2.578LeuLys: 2.578 ± 0.948
11.603LeuLeu: 11.603 ± 1.417
2.578LeuMet: 2.578 ± 0.948
1.719LeuAsn: 1.719 ± 0.845
5.157LeuPro: 5.157 ± 1.978
1.289LeuGln: 1.289 ± 1.11
8.595LeuArg: 8.595 ± 1.131
6.876LeuSer: 6.876 ± 1.494
4.297LeuThr: 4.297 ± 1.824
11.173LeuVal: 11.173 ± 3.227
0.43LeuTrp: 0.43 ± 0.327
0.43LeuTyr: 0.43 ± 0.37
0.0LeuXaa: 0.0 ± 0.0
Met
3.868MetAla: 3.868 ± 1.072
0.0MetCys: 0.0 ± 0.0
0.43MetAsp: 0.43 ± 0.37
1.289MetGlu: 1.289 ± 0.53
0.43MetPhe: 0.43 ± 0.327
0.859MetGly: 0.859 ± 0.654
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.859MetLys: 0.859 ± 0.402
1.719MetLeu: 1.719 ± 0.804
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.859MetPro: 0.859 ± 0.402
0.0MetGln: 0.0 ± 0.0
3.438MetArg: 3.438 ± 0.48
3.008MetSer: 3.008 ± 0.446
1.289MetThr: 1.289 ± 0.555
2.578MetVal: 2.578 ± 0.799
0.0MetTrp: 0.0 ± 0.0
0.43MetTyr: 0.43 ± 0.327
0.0MetXaa: 0.0 ± 0.0
Asn
3.008AsnAla: 3.008 ± 1.072
0.43AsnCys: 0.43 ± 0.37
1.289AsnAsp: 1.289 ± 0.199
0.859AsnGlu: 0.859 ± 0.654
0.859AsnPhe: 0.859 ± 0.339
0.859AsnGly: 0.859 ± 0.74
0.0AsnHis: 0.0 ± 0.0
0.43AsnIle: 0.43 ± 0.37
1.289AsnLys: 1.289 ± 0.481
3.868AsnLeu: 3.868 ± 1.339
0.859AsnMet: 0.859 ± 0.402
0.43AsnAsn: 0.43 ± 0.327
0.859AsnPro: 0.859 ± 0.371
0.859AsnGln: 0.859 ± 0.522
1.289AsnArg: 1.289 ± 0.481
1.719AsnSer: 1.719 ± 0.395
2.149AsnThr: 2.149 ± 0.423
2.578AsnVal: 2.578 ± 1.275
0.43AsnTrp: 0.43 ± 0.37
0.43AsnTyr: 0.43 ± 0.327
0.0AsnXaa: 0.0 ± 0.0
Pro
4.727ProAla: 4.727 ± 0.88
0.43ProCys: 0.43 ± 0.37
5.157ProAsp: 5.157 ± 0.575
5.157ProGlu: 5.157 ± 0.575
0.43ProPhe: 0.43 ± 0.37
4.297ProGly: 4.297 ± 1.509
0.859ProHis: 0.859 ± 0.371
2.578ProIle: 2.578 ± 0.809
3.008ProLys: 3.008 ± 1.972
3.438ProLeu: 3.438 ± 0.963
1.289ProMet: 1.289 ± 0.692
1.719ProAsn: 1.719 ± 0.679
3.008ProPro: 3.008 ± 1.379
0.859ProGln: 0.859 ± 0.402
3.438ProArg: 3.438 ± 0.769
4.727ProSer: 4.727 ± 1.353
2.578ProThr: 2.578 ± 0.314
3.868ProVal: 3.868 ± 1.3
0.43ProTrp: 0.43 ± 0.37
2.149ProTyr: 2.149 ± 0.785
0.0ProXaa: 0.0 ± 0.0
Gln
3.008GlnAla: 3.008 ± 1.4
0.0GlnCys: 0.0 ± 0.0
0.859GlnAsp: 0.859 ± 0.339
0.43GlnGlu: 0.43 ± 0.37
1.719GlnPhe: 1.719 ± 1.449
3.008GlnGly: 3.008 ± 1.129
0.0GlnHis: 0.0 ± 0.0
0.859GlnIle: 0.859 ± 0.402
0.43GlnLys: 0.43 ± 0.327
3.438GlnLeu: 3.438 ± 1.082
0.859GlnMet: 0.859 ± 0.654
0.43GlnAsn: 0.43 ± 0.517
0.43GlnPro: 0.43 ± 0.327
0.859GlnGln: 0.859 ± 0.402
0.859GlnArg: 0.859 ± 0.654
2.149GlnSer: 2.149 ± 1.077
2.578GlnThr: 2.578 ± 1.324
1.719GlnVal: 1.719 ± 0.804
0.859GlnTrp: 0.859 ± 0.402
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
10.314ArgAla: 10.314 ± 2.69
1.289ArgCys: 1.289 ± 0.555
3.868ArgAsp: 3.868 ± 0.736
1.719ArgGlu: 1.719 ± 1.48
4.297ArgPhe: 4.297 ± 1.371
6.876ArgGly: 6.876 ± 0.855
3.868ArgHis: 3.868 ± 0.83
2.149ArgIle: 2.149 ± 1.09
1.289ArgLys: 1.289 ± 0.555
7.735ArgLeu: 7.735 ± 0.645
1.289ArgMet: 1.289 ± 0.565
3.868ArgAsn: 3.868 ± 0.214
3.438ArgPro: 3.438 ± 1.099
2.149ArgGln: 2.149 ± 1.015
9.454ArgArg: 9.454 ± 0.497
5.587ArgSer: 5.587 ± 1.277
2.578ArgThr: 2.578 ± 0.617
7.735ArgVal: 7.735 ± 1.671
0.0ArgTrp: 0.0 ± 0.0
3.008ArgTyr: 3.008 ± 1.072
0.0ArgXaa: 0.0 ± 0.0
Ser
6.016SerAla: 6.016 ± 2.656
0.43SerCys: 0.43 ± 0.362
4.727SerAsp: 4.727 ± 1.112
5.587SerGlu: 5.587 ± 1.451
0.859SerPhe: 0.859 ± 0.593
6.016SerGly: 6.016 ± 1.028
2.149SerHis: 2.149 ± 0.516
2.578SerIle: 2.578 ± 0.897
2.578SerLys: 2.578 ± 0.314
4.727SerLeu: 4.727 ± 1.231
1.719SerMet: 1.719 ± 1.309
0.43SerAsn: 0.43 ± 0.327
5.587SerPro: 5.587 ± 1.472
2.149SerGln: 2.149 ± 1.072
4.727SerArg: 4.727 ± 1.708
4.727SerSer: 4.727 ± 1.362
3.438SerThr: 3.438 ± 0.388
4.727SerVal: 4.727 ± 1.408
1.289SerTrp: 1.289 ± 0.555
3.438SerTyr: 3.438 ± 0.671
0.0SerXaa: 0.0 ± 0.0
Thr
7.735ThrAla: 7.735 ± 2.417
1.289ThrCys: 1.289 ± 0.63
0.859ThrAsp: 0.859 ± 0.654
2.149ThrGlu: 2.149 ± 1.24
0.859ThrPhe: 0.859 ± 0.371
4.727ThrGly: 4.727 ± 1.348
1.289ThrHis: 1.289 ± 0.981
1.289ThrIle: 1.289 ± 0.647
1.289ThrLys: 1.289 ± 0.199
3.868ThrLeu: 3.868 ± 1.107
0.0ThrMet: 0.0 ± 0.339
0.43ThrAsn: 0.43 ± 0.37
2.578ThrPro: 2.578 ± 1.275
1.289ThrGln: 1.289 ± 0.199
5.587ThrArg: 5.587 ± 0.98
4.297ThrSer: 4.297 ± 2.312
2.149ThrThr: 2.149 ± 0.86
6.016ThrVal: 6.016 ± 0.893
0.43ThrTrp: 0.43 ± 0.362
0.859ThrTyr: 0.859 ± 0.339
0.0ThrXaa: 0.0 ± 0.0
Val
12.462ValAla: 12.462 ± 2.186
0.43ValCys: 0.43 ± 0.37
8.595ValAsp: 8.595 ± 0.538
7.306ValGlu: 7.306 ± 1.018
2.149ValPhe: 2.149 ± 0.516
7.735ValGly: 7.735 ± 1.085
3.008ValHis: 3.008 ± 0.182
3.438ValIle: 3.438 ± 1.096
3.008ValLys: 3.008 ± 1.379
6.016ValLeu: 6.016 ± 1.147
1.289ValMet: 1.289 ± 0.199
2.578ValAsn: 2.578 ± 0.809
3.438ValPro: 3.438 ± 1.099
2.578ValGln: 2.578 ± 0.799
8.165ValArg: 8.165 ± 0.773
6.016ValSer: 6.016 ± 0.893
3.008ValThr: 3.008 ± 1.406
7.306ValVal: 7.306 ± 2.16
1.289ValTrp: 1.289 ± 1.087
1.719ValTyr: 1.719 ± 0.804
0.0ValXaa: 0.0 ± 0.0
Trp
1.289TrpAla: 1.289 ± 0.638
0.43TrpCys: 0.43 ± 0.37
0.859TrpAsp: 0.859 ± 0.339
1.719TrpGlu: 1.719 ± 0.742
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.43TrpLys: 0.43 ± 0.327
1.719TrpLeu: 1.719 ± 0.804
0.43TrpMet: 0.43 ± 0.37
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.43TrpSer: 0.43 ± 0.362
0.0TrpThr: 0.0 ± 0.0
0.859TrpVal: 0.859 ± 0.339
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.578TyrAla: 2.578 ± 0.473
0.43TyrCys: 0.43 ± 0.517
3.008TyrAsp: 3.008 ± 1.207
1.719TyrGlu: 1.719 ± 0.986
0.43TyrPhe: 0.43 ± 0.327
3.008TyrGly: 3.008 ± 0.994
0.43TyrHis: 0.43 ± 0.37
0.859TyrIle: 0.859 ± 0.596
0.859TyrLys: 0.859 ± 0.654
3.008TyrLeu: 3.008 ± 0.57
0.43TyrMet: 0.43 ± 0.362
2.149TyrAsn: 2.149 ± 0.516
1.289TyrPro: 1.289 ± 0.647
0.43TyrGln: 0.43 ± 0.327
2.578TyrArg: 2.578 ± 1.275
1.289TyrSer: 1.289 ± 0.837
2.149TyrThr: 2.149 ± 0.423
3.438TyrVal: 3.438 ± 0.388
0.0TyrTrp: 0.0 ± 0.0
0.43TyrTyr: 0.43 ± 0.327
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2328 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski