Amino acid dipepetide frequency for Deinbollia mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.451AlaAla: 2.451 ± 0.951
1.225AlaCys: 1.225 ± 0.674
1.838AlaAsp: 1.838 ± 1.055
1.225AlaGlu: 1.225 ± 1.065
1.225AlaPhe: 1.225 ± 0.711
1.838AlaGly: 1.838 ± 0.69
1.225AlaHis: 1.225 ± 0.806
3.064AlaIle: 3.064 ± 1.58
4.902AlaLys: 4.902 ± 1.321
3.676AlaLeu: 3.676 ± 1.485
0.0AlaMet: 0.0 ± 0.0
1.838AlaAsn: 1.838 ± 1.274
0.613AlaPro: 0.613 ± 0.554
4.902AlaGln: 4.902 ± 2.138
3.676AlaArg: 3.676 ± 1.675
9.191AlaSer: 9.191 ± 1.748
4.289AlaThr: 4.289 ± 1.779
1.225AlaVal: 1.225 ± 1.16
1.225AlaTrp: 1.225 ± 0.655
0.613AlaTyr: 0.613 ± 0.533
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.225CysCys: 1.225 ± 1.49
0.613CysAsp: 0.613 ± 0.569
1.225CysGlu: 1.225 ± 0.674
0.613CysPhe: 0.613 ± 0.554
1.225CysGly: 1.225 ± 0.687
0.0CysHis: 0.0 ± 0.0
0.613CysIle: 0.613 ± 0.507
0.613CysLys: 0.613 ± 0.58
1.838CysLeu: 1.838 ± 1.171
1.225CysMet: 1.225 ± 0.806
1.838CysAsn: 1.838 ± 0.681
1.838CysPro: 1.838 ± 1.502
0.613CysGln: 0.613 ± 0.745
0.613CysArg: 0.613 ± 0.533
1.838CysSer: 1.838 ± 1.858
1.838CysThr: 1.838 ± 0.773
2.451CysVal: 2.451 ± 1.348
0.0CysTrp: 0.0 ± 0.0
0.613CysTyr: 0.613 ± 0.569
0.0CysXaa: 0.0 ± 0.0
Asp
1.838AspAla: 1.838 ± 0.619
0.613AspCys: 0.613 ± 0.507
1.838AspAsp: 1.838 ± 1.117
1.838AspGlu: 1.838 ± 1.195
1.225AspPhe: 1.225 ± 0.68
1.225AspGly: 1.225 ± 1.065
3.676AspHis: 3.676 ± 1.996
3.064AspIle: 3.064 ± 1.178
2.451AspLys: 2.451 ± 1.544
7.353AspLeu: 7.353 ± 2.311
0.613AspMet: 0.613 ± 0.569
4.902AspAsn: 4.902 ± 1.708
1.838AspPro: 1.838 ± 0.814
0.0AspGln: 0.0 ± 0.0
2.451AspArg: 2.451 ± 1.047
5.515AspSer: 5.515 ± 1.405
1.225AspThr: 1.225 ± 0.628
4.902AspVal: 4.902 ± 1.559
1.225AspTrp: 1.225 ± 0.687
1.838AspTyr: 1.838 ± 0.868
0.0AspXaa: 0.0 ± 0.0
Glu
3.676GluAla: 3.676 ± 1.119
0.613GluCys: 0.613 ± 0.569
3.676GluAsp: 3.676 ± 1.374
2.451GluGlu: 2.451 ± 1.002
1.838GluPhe: 1.838 ± 1.2
5.515GluGly: 5.515 ± 0.736
0.613GluHis: 0.613 ± 0.507
2.451GluIle: 2.451 ± 1.266
0.0GluLys: 0.0 ± 0.0
3.676GluLeu: 3.676 ± 1.569
0.0GluMet: 0.0 ± 0.0
2.451GluAsn: 2.451 ± 1.319
4.289GluPro: 4.289 ± 1.43
2.451GluGln: 2.451 ± 1.361
1.838GluArg: 1.838 ± 1.048
3.064GluSer: 3.064 ± 1.233
1.838GluThr: 1.838 ± 0.814
1.225GluVal: 1.225 ± 0.791
0.613GluTrp: 0.613 ± 0.619
1.838GluTyr: 1.838 ± 1.048
0.0GluXaa: 0.0 ± 0.0
Phe
1.225PheAla: 1.225 ± 0.7
0.613PheCys: 0.613 ± 0.58
2.451PheAsp: 2.451 ± 0.998
0.0PheGlu: 0.0 ± 0.0
2.451PhePhe: 2.451 ± 1.047
1.225PheGly: 1.225 ± 0.674
1.838PheHis: 1.838 ± 0.917
2.451PheIle: 2.451 ± 2.13
3.676PheLys: 3.676 ± 1.423
4.902PheLeu: 4.902 ± 1.527
0.613PheMet: 0.613 ± 0.533
4.902PheAsn: 4.902 ± 1.594
2.451PhePro: 2.451 ± 1.05
1.225PheGln: 1.225 ± 0.687
4.289PheArg: 4.289 ± 1.469
3.064PheSer: 3.064 ± 1.275
3.676PheThr: 3.676 ± 1.095
0.613PheVal: 0.613 ± 0.569
0.613PheTrp: 0.613 ± 0.507
1.225PheTyr: 1.225 ± 0.837
0.0PheXaa: 0.0 ± 0.0
Gly
3.676GlyAla: 3.676 ± 1.675
1.225GlyCys: 1.225 ± 0.837
4.902GlyAsp: 4.902 ± 0.839
3.064GlyGlu: 3.064 ± 1.143
1.838GlyPhe: 1.838 ± 0.903
2.451GlyGly: 2.451 ± 0.938
1.225GlyHis: 1.225 ± 0.845
4.289GlyIle: 4.289 ± 1.163
3.676GlyLys: 3.676 ± 2.087
1.838GlyLeu: 1.838 ± 1.211
1.225GlyMet: 1.225 ± 0.632
1.225GlyAsn: 1.225 ± 1.138
4.902GlyPro: 4.902 ± 1.632
1.838GlyGln: 1.838 ± 1.122
3.064GlyArg: 3.064 ± 1.0
1.225GlySer: 1.225 ± 0.628
3.676GlyThr: 3.676 ± 1.793
2.451GlyVal: 2.451 ± 1.262
0.0GlyTrp: 0.0 ± 0.0
0.613GlyTyr: 0.613 ± 0.619
0.0GlyXaa: 0.0 ± 0.0
His
1.838HisAla: 1.838 ± 1.117
1.225HisCys: 1.225 ± 1.239
1.225HisAsp: 1.225 ± 0.978
1.225HisGlu: 1.225 ± 0.682
1.225HisPhe: 1.225 ± 0.845
1.225HisGly: 1.225 ± 0.832
0.613HisHis: 0.613 ± 0.619
3.064HisIle: 3.064 ± 0.855
1.838HisLys: 1.838 ± 0.997
1.838HisLeu: 1.838 ± 1.091
0.0HisMet: 0.0 ± 0.0
3.676HisAsn: 3.676 ± 1.289
1.225HisPro: 1.225 ± 0.666
1.225HisGln: 1.225 ± 0.837
3.064HisArg: 3.064 ± 1.186
2.451HisSer: 2.451 ± 1.322
2.451HisThr: 2.451 ± 1.653
1.838HisVal: 1.838 ± 1.083
0.0HisTrp: 0.0 ± 0.0
1.838HisTyr: 1.838 ± 1.01
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.613IleCys: 0.613 ± 0.745
3.676IleAsp: 3.676 ± 1.599
4.289IleGlu: 4.289 ± 1.513
2.451IlePhe: 2.451 ± 0.793
0.613IleGly: 0.613 ± 0.619
0.613IleHis: 0.613 ± 0.554
6.127IleIle: 6.127 ± 1.64
8.578IleLys: 8.578 ± 2.345
4.289IleLeu: 4.289 ± 1.158
0.0IleMet: 0.0 ± 0.0
4.289IleAsn: 4.289 ± 1.451
1.225IlePro: 1.225 ± 0.687
4.289IleGln: 4.289 ± 1.395
6.127IleArg: 6.127 ± 2.811
6.74IleSer: 6.74 ± 1.661
5.515IleThr: 5.515 ± 1.564
2.451IleVal: 2.451 ± 1.31
0.0IleTrp: 0.0 ± 0.0
2.451IleTyr: 2.451 ± 1.164
0.0IleXaa: 0.0 ± 0.0
Lys
1.838LysAla: 1.838 ± 0.874
1.838LysCys: 1.838 ± 0.857
3.676LysAsp: 3.676 ± 1.608
4.289LysGlu: 4.289 ± 1.486
2.451LysPhe: 2.451 ± 1.092
1.225LysGly: 1.225 ± 0.628
2.451LysHis: 2.451 ± 0.951
3.064LysIle: 3.064 ± 0.885
3.676LysLys: 3.676 ± 1.585
3.676LysLeu: 3.676 ± 0.985
0.613LysMet: 0.613 ± 0.58
4.289LysAsn: 4.289 ± 1.919
5.515LysPro: 5.515 ± 1.482
2.451LysGln: 2.451 ± 0.812
3.064LysArg: 3.064 ± 1.583
4.289LysSer: 4.289 ± 1.236
2.451LysThr: 2.451 ± 0.938
4.289LysVal: 4.289 ± 1.511
0.613LysTrp: 0.613 ± 0.569
2.451LysTyr: 2.451 ± 1.173
0.0LysXaa: 0.0 ± 0.0
Leu
1.838LeuAla: 1.838 ± 0.92
2.451LeuCys: 2.451 ± 1.332
5.515LeuAsp: 5.515 ± 1.044
4.902LeuGlu: 4.902 ± 1.524
1.838LeuPhe: 1.838 ± 1.063
6.127LeuGly: 6.127 ± 1.517
1.838LeuHis: 1.838 ± 1.048
4.289LeuIle: 4.289 ± 1.968
3.064LeuLys: 3.064 ± 1.148
6.127LeuLeu: 6.127 ± 2.374
1.225LeuMet: 1.225 ± 0.728
5.515LeuAsn: 5.515 ± 1.414
1.838LeuPro: 1.838 ± 0.922
4.289LeuGln: 4.289 ± 1.524
3.676LeuArg: 3.676 ± 1.16
4.289LeuSer: 4.289 ± 1.608
3.676LeuThr: 3.676 ± 0.942
3.064LeuVal: 3.064 ± 1.126
0.613LeuTrp: 0.613 ± 0.533
3.676LeuTyr: 3.676 ± 1.35
0.0LeuXaa: 0.0 ± 0.0
Met
0.613MetAla: 0.613 ± 0.58
0.0MetCys: 0.0 ± 0.0
1.838MetAsp: 1.838 ± 1.083
0.0MetGlu: 0.0 ± 0.0
2.451MetPhe: 2.451 ± 1.831
1.225MetGly: 1.225 ± 0.666
0.613MetHis: 0.613 ± 0.619
0.0MetIle: 0.0 ± 0.0
1.225MetLys: 1.225 ± 0.68
1.838MetLeu: 1.838 ± 0.92
0.0MetMet: 0.0 ± 0.0
0.613MetAsn: 0.613 ± 0.569
1.225MetPro: 1.225 ± 0.728
0.613MetGln: 0.613 ± 0.619
1.225MetArg: 1.225 ± 1.014
1.838MetSer: 1.838 ± 0.941
0.613MetThr: 0.613 ± 0.569
0.0MetVal: 0.0 ± 0.0
0.613MetTrp: 0.613 ± 0.745
1.838MetTyr: 1.838 ± 1.121
0.0MetXaa: 0.0 ± 0.0
Asn
3.676AsnAla: 3.676 ± 1.241
3.064AsnCys: 3.064 ± 1.335
1.838AsnAsp: 1.838 ± 1.044
2.451AsnGlu: 2.451 ± 0.812
3.064AsnPhe: 3.064 ± 1.68
1.838AsnGly: 1.838 ± 1.262
4.289AsnHis: 4.289 ± 2.075
4.289AsnIle: 4.289 ± 1.28
3.064AsnLys: 3.064 ± 1.275
2.451AsnLeu: 2.451 ± 1.188
1.838AsnMet: 1.838 ± 1.092
2.451AsnAsn: 2.451 ± 0.764
4.289AsnPro: 4.289 ± 1.345
3.064AsnGln: 3.064 ± 0.592
3.064AsnArg: 3.064 ± 1.206
4.902AsnSer: 4.902 ± 1.718
4.289AsnThr: 4.289 ± 1.611
5.515AsnVal: 5.515 ± 1.506
0.613AsnTrp: 0.613 ± 0.533
3.676AsnTyr: 3.676 ± 1.661
0.0AsnXaa: 0.0 ± 0.0
Pro
2.451ProAla: 2.451 ± 0.735
1.838ProCys: 1.838 ± 0.933
0.613ProAsp: 0.613 ± 0.745
0.613ProGlu: 0.613 ± 0.569
3.064ProPhe: 3.064 ± 1.139
3.676ProGly: 3.676 ± 1.215
2.451ProHis: 2.451 ± 1.54
5.515ProIle: 5.515 ± 1.26
1.838ProLys: 1.838 ± 0.876
4.289ProLeu: 4.289 ± 1.987
2.451ProMet: 2.451 ± 1.14
4.289ProAsn: 4.289 ± 1.373
1.225ProPro: 1.225 ± 0.817
4.289ProGln: 4.289 ± 1.929
3.064ProArg: 3.064 ± 0.74
3.064ProSer: 3.064 ± 1.499
4.902ProThr: 4.902 ± 2.446
2.451ProVal: 2.451 ± 0.708
1.838ProTrp: 1.838 ± 0.69
3.064ProTyr: 3.064 ± 1.153
0.0ProXaa: 0.0 ± 0.0
Gln
5.515GlnAla: 5.515 ± 2.188
0.0GlnCys: 0.0 ± 0.0
3.676GlnAsp: 3.676 ± 0.896
3.064GlnGlu: 3.064 ± 0.646
1.838GlnPhe: 1.838 ± 0.681
1.838GlnGly: 1.838 ± 0.773
1.225GlnHis: 1.225 ± 0.945
3.676GlnIle: 3.676 ± 1.406
1.838GlnLys: 1.838 ± 1.517
1.838GlnLeu: 1.838 ± 0.781
1.225GlnMet: 1.225 ± 0.945
3.064GlnAsn: 3.064 ± 1.195
4.902GlnPro: 4.902 ± 2.55
3.064GlnGln: 3.064 ± 1.507
4.902GlnArg: 4.902 ± 1.763
4.902GlnSer: 4.902 ± 1.555
3.676GlnThr: 3.676 ± 1.2
3.676GlnVal: 3.676 ± 1.91
0.0GlnTrp: 0.0 ± 0.0
0.613GlnTyr: 0.613 ± 0.569
0.0GlnXaa: 0.0 ± 0.0
Arg
3.676ArgAla: 3.676 ± 1.119
2.451ArgCys: 2.451 ± 0.863
4.289ArgAsp: 4.289 ± 2.371
1.838ArgGlu: 1.838 ± 1.092
4.289ArgPhe: 4.289 ± 1.372
4.289ArgGly: 4.289 ± 1.622
2.451ArgHis: 2.451 ± 0.804
2.451ArgIle: 2.451 ± 1.208
3.676ArgLys: 3.676 ± 1.755
6.127ArgLeu: 6.127 ± 3.126
1.838ArgMet: 1.838 ± 0.876
1.225ArgAsn: 1.225 ± 0.7
5.515ArgPro: 5.515 ± 1.535
2.451ArgGln: 2.451 ± 0.783
9.191ArgArg: 9.191 ± 3.682
9.804ArgSer: 9.804 ± 2.0
1.838ArgThr: 1.838 ± 0.872
3.064ArgVal: 3.064 ± 1.583
0.0ArgTrp: 0.0 ± 0.0
2.451ArgTyr: 2.451 ± 1.08
0.0ArgXaa: 0.0 ± 0.0
Ser
4.289SerAla: 4.289 ± 1.63
0.0SerCys: 0.0 ± 0.0
1.838SerAsp: 1.838 ± 0.64
2.451SerGlu: 2.451 ± 1.042
2.451SerPhe: 2.451 ± 1.774
3.064SerGly: 3.064 ± 1.177
3.064SerHis: 3.064 ± 1.942
6.127SerIle: 6.127 ± 1.407
4.289SerLys: 4.289 ± 1.558
4.902SerLeu: 4.902 ± 1.488
1.225SerMet: 1.225 ± 0.815
5.515SerAsn: 5.515 ± 1.044
7.353SerPro: 7.353 ± 1.42
7.966SerGln: 7.966 ± 2.756
6.74SerArg: 6.74 ± 1.656
9.191SerSer: 9.191 ± 2.53
6.74SerThr: 6.74 ± 2.149
4.289SerVal: 4.289 ± 1.922
0.613SerTrp: 0.613 ± 0.533
3.676SerTyr: 3.676 ± 1.091
0.0SerXaa: 0.0 ± 0.0
Thr
4.902ThrAla: 4.902 ± 2.292
0.613ThrCys: 0.613 ± 0.507
2.451ThrAsp: 2.451 ± 1.021
3.064ThrGlu: 3.064 ± 1.051
4.289ThrPhe: 4.289 ± 1.357
4.902ThrGly: 4.902 ± 1.166
1.838ThrHis: 1.838 ± 1.3
4.289ThrIle: 4.289 ± 1.067
2.451ThrLys: 2.451 ± 0.764
4.289ThrLeu: 4.289 ± 1.044
0.0ThrMet: 0.0 ± 0.0
4.289ThrAsn: 4.289 ± 1.627
1.838ThrPro: 1.838 ± 0.64
2.451ThrGln: 2.451 ± 0.701
3.064ThrArg: 3.064 ± 1.274
4.289ThrSer: 4.289 ± 1.463
3.676ThrThr: 3.676 ± 1.628
1.838ThrVal: 1.838 ± 1.117
2.451ThrTrp: 2.451 ± 1.418
2.451ThrTyr: 2.451 ± 0.821
0.0ThrXaa: 0.0 ± 0.0
Val
0.613ValAla: 0.613 ± 0.58
0.613ValCys: 0.613 ± 0.533
1.838ValAsp: 1.838 ± 0.72
3.064ValGlu: 3.064 ± 2.006
1.838ValPhe: 1.838 ± 1.166
2.451ValGly: 2.451 ± 1.078
1.838ValHis: 1.838 ± 1.268
3.064ValIle: 3.064 ± 1.383
4.902ValLys: 4.902 ± 2.006
1.225ValLeu: 1.225 ± 0.736
1.225ValMet: 1.225 ± 0.674
3.676ValAsn: 3.676 ± 1.511
3.676ValPro: 3.676 ± 1.029
4.289ValGln: 4.289 ± 2.274
3.676ValArg: 3.676 ± 1.819
3.676ValSer: 3.676 ± 0.939
1.225ValThr: 1.225 ± 1.16
3.064ValVal: 3.064 ± 1.696
3.064ValTrp: 3.064 ± 1.028
3.064ValTyr: 3.064 ± 1.179
0.0ValXaa: 0.0 ± 0.0
Trp
3.676TrpAla: 3.676 ± 1.328
0.0TrpCys: 0.0 ± 0.0
0.613TrpAsp: 0.613 ± 0.745
0.613TrpGlu: 0.613 ± 0.507
0.613TrpPhe: 0.613 ± 0.659
0.613TrpGly: 0.613 ± 0.533
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.613TrpMet: 0.613 ± 0.58
0.613TrpAsn: 0.613 ± 0.659
0.613TrpPro: 0.613 ± 0.569
1.225TrpGln: 1.225 ± 0.682
0.613TrpArg: 0.613 ± 0.619
1.225TrpSer: 1.225 ± 0.817
1.838TrpThr: 1.838 ± 0.713
0.613TrpVal: 0.613 ± 0.569
0.0TrpTrp: 0.0 ± 0.0
0.613TrpTyr: 0.613 ± 0.533
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.451TyrAla: 2.451 ± 1.348
0.613TyrCys: 0.613 ± 0.745
1.225TyrAsp: 1.225 ± 0.674
2.451TyrGlu: 2.451 ± 1.287
2.451TyrPhe: 2.451 ± 0.657
1.838TyrGly: 1.838 ± 0.619
1.225TyrHis: 1.225 ± 0.687
2.451TyrIle: 2.451 ± 0.522
2.451TyrLys: 2.451 ± 1.0
3.676TyrLeu: 3.676 ± 1.503
1.838TyrMet: 1.838 ± 0.927
3.064TyrAsn: 3.064 ± 1.195
0.613TyrPro: 0.613 ± 0.533
1.838TyrGln: 1.838 ± 1.044
5.515TyrArg: 5.515 ± 2.1
1.838TyrSer: 1.838 ± 1.119
0.0TyrThr: 0.0 ± 0.0
3.064TyrVal: 3.064 ± 1.666
0.0TyrTrp: 0.0 ± 0.0
1.225TyrTyr: 1.225 ± 0.687
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1633 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski