Amino acid dipepetide frequency for Brassica yellows virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.131AlaAla: 4.131 ± 1.705
1.127AlaCys: 1.127 ± 0.478
3.004AlaAsp: 3.004 ± 0.659
5.633AlaGlu: 5.633 ± 0.529
1.878AlaPhe: 1.878 ± 0.658
5.257AlaGly: 5.257 ± 1.485
0.376AlaHis: 0.376 ± 0.378
3.004AlaIle: 3.004 ± 0.821
2.629AlaLys: 2.629 ± 0.745
6.759AlaLeu: 6.759 ± 2.017
3.004AlaMet: 3.004 ± 0.601
2.629AlaAsn: 2.629 ± 0.78
3.755AlaPro: 3.755 ± 1.392
2.253AlaGln: 2.253 ± 0.824
3.755AlaArg: 3.755 ± 0.954
8.261AlaSer: 8.261 ± 1.767
1.878AlaThr: 1.878 ± 0.759
4.882AlaVal: 4.882 ± 1.187
1.127AlaTrp: 1.127 ± 0.495
1.878AlaTyr: 1.878 ± 0.974
0.0AlaXaa: 0.0 ± 0.0
Cys
0.376CysAla: 0.376 ± 0.378
0.0CysCys: 0.0 ± 0.0
0.376CysAsp: 0.376 ± 0.273
0.376CysGlu: 0.376 ± 0.273
0.751CysPhe: 0.751 ± 0.374
1.127CysGly: 1.127 ± 0.478
0.376CysHis: 0.376 ± 0.417
0.0CysIle: 0.0 ± 0.0
1.878CysLys: 1.878 ± 0.659
3.38CysLeu: 3.38 ± 1.1
0.0CysMet: 0.0 ± 0.0
0.376CysAsn: 0.376 ± 0.417
1.502CysPro: 1.502 ± 0.703
1.502CysGln: 1.502 ± 0.548
0.376CysArg: 0.376 ± 0.378
1.502CysSer: 1.502 ± 0.714
0.0CysThr: 0.0 ± 0.0
1.878CysVal: 1.878 ± 0.759
0.376CysTrp: 0.376 ± 0.273
0.376CysTyr: 0.376 ± 0.378
0.0CysXaa: 0.0 ± 0.0
Asp
4.131AspAla: 4.131 ± 0.98
1.878AspCys: 1.878 ± 0.545
5.257AspAsp: 5.257 ± 1.417
3.004AspGlu: 3.004 ± 1.192
3.755AspPhe: 3.755 ± 0.933
5.257AspGly: 5.257 ± 1.2
1.878AspHis: 1.878 ± 0.943
1.502AspIle: 1.502 ± 0.445
1.878AspLys: 1.878 ± 0.583
3.38AspLeu: 3.38 ± 0.928
2.629AspMet: 2.629 ± 0.664
1.502AspAsn: 1.502 ± 1.04
3.004AspPro: 3.004 ± 1.452
1.502AspGln: 1.502 ± 0.568
1.127AspArg: 1.127 ± 0.816
0.751AspSer: 0.751 ± 0.834
1.127AspThr: 1.127 ± 0.816
2.629AspVal: 2.629 ± 0.651
1.127AspTrp: 1.127 ± 0.819
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.502GluAla: 1.502 ± 0.568
0.376GluCys: 0.376 ± 0.378
5.257GluAsp: 5.257 ± 0.44
4.882GluGlu: 4.882 ± 1.791
2.629GluPhe: 2.629 ± 0.806
3.38GluGly: 3.38 ± 0.986
0.751GluHis: 0.751 ± 0.374
4.506GluIle: 4.506 ± 0.923
4.882GluLys: 4.882 ± 1.185
4.131GluLeu: 4.131 ± 1.24
2.629GluMet: 2.629 ± 0.682
3.004GluAsn: 3.004 ± 0.843
2.253GluPro: 2.253 ± 0.503
3.004GluGln: 3.004 ± 1.447
3.38GluArg: 3.38 ± 1.412
3.755GluSer: 3.755 ± 1.213
3.38GluThr: 3.38 ± 0.622
4.131GluVal: 4.131 ± 1.373
0.751GluTrp: 0.751 ± 0.639
1.502GluTyr: 1.502 ± 1.147
0.0GluXaa: 0.0 ± 0.0
Phe
1.502PheAla: 1.502 ± 0.621
1.878PheCys: 1.878 ± 0.759
1.127PheAsp: 1.127 ± 0.534
2.629PheGlu: 2.629 ± 0.763
1.127PhePhe: 1.127 ± 0.329
3.004PheGly: 3.004 ± 1.284
1.878PheHis: 1.878 ± 0.899
1.502PheIle: 1.502 ± 0.862
3.755PheLys: 3.755 ± 1.094
6.759PheLeu: 6.759 ± 1.385
1.127PheMet: 1.127 ± 0.478
1.127PheAsn: 1.127 ± 0.329
1.127PhePro: 1.127 ± 0.889
1.502PheGln: 1.502 ± 0.445
1.878PheArg: 1.878 ± 0.974
7.886PheSer: 7.886 ± 1.109
2.253PheThr: 2.253 ± 0.469
3.004PheVal: 3.004 ± 1.724
0.751PheTrp: 0.751 ± 0.698
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.004GlyAla: 3.004 ± 1.012
1.127GlyCys: 1.127 ± 0.379
2.253GlyAsp: 2.253 ± 0.67
4.131GlyGlu: 4.131 ± 0.844
3.755GlyPhe: 3.755 ± 1.188
2.629GlyGly: 2.629 ± 0.695
1.502GlyHis: 1.502 ± 0.684
2.629GlyIle: 2.629 ± 1.523
4.506GlyLys: 4.506 ± 1.05
6.384GlyLeu: 6.384 ± 1.622
0.751GlyMet: 0.751 ± 0.712
2.253GlyAsn: 2.253 ± 1.038
2.629GlyPro: 2.629 ± 0.467
1.502GlyGln: 1.502 ± 1.093
4.882GlyArg: 4.882 ± 1.963
10.139GlySer: 10.139 ± 2.746
4.131GlyThr: 4.131 ± 0.98
2.253GlyVal: 2.253 ± 0.578
1.502GlyTrp: 1.502 ± 0.714
3.38GlyTyr: 3.38 ± 0.714
0.0GlyXaa: 0.0 ± 0.0
His
2.253HisAla: 2.253 ± 0.87
1.127HisCys: 1.127 ± 0.603
1.502HisAsp: 1.502 ± 0.862
0.376HisGlu: 0.376 ± 0.545
1.127HisPhe: 1.127 ± 0.889
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.376HisIle: 0.376 ± 0.273
1.127HisLys: 1.127 ± 0.768
0.751HisLeu: 0.751 ± 0.511
0.0HisMet: 0.0 ± 0.0
1.127HisAsn: 1.127 ± 0.954
2.629HisPro: 2.629 ± 1.165
0.0HisGln: 0.0 ± 0.0
1.878HisArg: 1.878 ± 0.453
3.004HisSer: 3.004 ± 1.09
1.502HisThr: 1.502 ± 0.861
2.253HisVal: 2.253 ± 0.82
0.376HisTrp: 0.376 ± 0.337
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.506IleAla: 4.506 ± 0.84
0.0IleCys: 0.0 ± 0.0
1.878IleAsp: 1.878 ± 0.633
1.878IleGlu: 1.878 ± 0.742
3.38IlePhe: 3.38 ± 1.1
0.751IleGly: 0.751 ± 0.546
0.751IleHis: 0.751 ± 0.511
1.502IleIle: 1.502 ± 0.611
1.502IleLys: 1.502 ± 0.695
1.878IleLeu: 1.878 ± 1.848
0.376IleMet: 0.376 ± 0.273
2.629IleAsn: 2.629 ± 1.871
6.008IlePro: 6.008 ± 1.144
1.127IleGln: 1.127 ± 0.603
1.878IleArg: 1.878 ± 1.012
7.135IleSer: 7.135 ± 0.978
3.755IleThr: 3.755 ± 1.299
1.502IleVal: 1.502 ± 0.387
0.751IleTrp: 0.751 ± 0.311
1.502IleTyr: 1.502 ± 0.445
0.0IleXaa: 0.0 ± 0.0
Lys
5.257LysAla: 5.257 ± 1.257
2.253LysCys: 2.253 ± 0.723
4.506LysAsp: 4.506 ± 0.777
3.004LysGlu: 3.004 ± 0.598
1.878LysPhe: 1.878 ± 0.545
2.253LysGly: 2.253 ± 0.603
0.0LysHis: 0.0 ± 0.0
2.629LysIle: 2.629 ± 1.092
2.629LysLys: 2.629 ± 1.042
3.38LysLeu: 3.38 ± 0.939
2.253LysMet: 2.253 ± 0.527
0.751LysAsn: 0.751 ± 0.464
2.253LysPro: 2.253 ± 1.14
3.38LysGln: 3.38 ± 0.683
5.633LysArg: 5.633 ± 0.854
6.759LysSer: 6.759 ± 1.274
6.759LysThr: 6.759 ± 1.237
1.502LysVal: 1.502 ± 0.421
0.0LysTrp: 0.0 ± 0.0
1.127LysTyr: 1.127 ± 1.252
0.0LysXaa: 0.0 ± 0.0
Leu
9.388LeuAla: 9.388 ± 1.762
1.878LeuCys: 1.878 ± 0.633
2.629LeuAsp: 2.629 ± 0.942
5.633LeuGlu: 5.633 ± 0.81
4.131LeuPhe: 4.131 ± 1.222
4.882LeuGly: 4.882 ± 0.883
1.502LeuHis: 1.502 ± 0.861
3.38LeuIle: 3.38 ± 1.308
3.755LeuLys: 3.755 ± 1.18
10.139LeuLeu: 10.139 ± 2.597
3.38LeuMet: 3.38 ± 0.7
3.004LeuAsn: 3.004 ± 0.764
5.257LeuPro: 5.257 ± 1.285
3.755LeuGln: 3.755 ± 0.634
6.008LeuArg: 6.008 ± 1.569
5.257LeuSer: 5.257 ± 0.808
6.008LeuThr: 6.008 ± 1.195
3.755LeuVal: 3.755 ± 0.75
3.004LeuTrp: 3.004 ± 0.922
5.257LeuTyr: 5.257 ± 1.514
0.0LeuXaa: 0.0 ± 0.0
Met
1.502MetAla: 1.502 ± 0.714
0.0MetCys: 0.0 ± 0.0
0.751MetAsp: 0.751 ± 0.834
2.629MetGlu: 2.629 ± 1.498
0.751MetPhe: 0.751 ± 0.583
0.376MetGly: 0.376 ± 0.273
0.0MetHis: 0.0 ± 0.0
1.502MetIle: 1.502 ± 0.756
1.502MetLys: 1.502 ± 0.902
3.004MetLeu: 3.004 ± 1.23
0.376MetMet: 0.376 ± 0.337
1.502MetAsn: 1.502 ± 0.467
0.0MetPro: 0.0 ± 0.0
0.376MetGln: 0.376 ± 0.378
0.0MetArg: 0.0 ± 0.0
2.253MetSer: 2.253 ± 0.679
1.127MetThr: 1.127 ± 0.509
3.38MetVal: 3.38 ± 0.815
0.0MetTrp: 0.0 ± 0.0
0.376MetTyr: 0.376 ± 0.417
0.0MetXaa: 0.0 ± 0.0
Asn
2.629AsnAla: 2.629 ± 0.489
0.751AsnCys: 0.751 ± 0.834
1.127AsnAsp: 1.127 ± 0.329
1.127AsnGlu: 1.127 ± 0.534
3.004AsnPhe: 3.004 ± 0.889
4.882AsnGly: 4.882 ± 2.099
1.127AsnHis: 1.127 ± 0.587
1.878AsnIle: 1.878 ± 0.579
2.253AsnLys: 2.253 ± 0.83
5.257AsnLeu: 5.257 ± 0.872
0.376AsnMet: 0.376 ± 0.338
2.629AsnAsn: 2.629 ± 0.636
1.502AsnPro: 1.502 ± 0.815
2.253AsnGln: 2.253 ± 0.603
0.751AsnArg: 0.751 ± 0.52
4.506AsnSer: 4.506 ± 1.786
1.127AsnThr: 1.127 ± 0.519
3.38AsnVal: 3.38 ± 1.07
2.253AsnTrp: 2.253 ± 0.578
1.502AsnTyr: 1.502 ± 0.387
0.0AsnXaa: 0.0 ± 0.0
Pro
3.38ProAla: 3.38 ± 0.529
0.751ProCys: 0.751 ± 0.546
1.502ProAsp: 1.502 ± 1.17
4.131ProGlu: 4.131 ± 1.766
0.0ProPhe: 0.0 ± 0.0
4.882ProGly: 4.882 ± 0.676
2.253ProHis: 2.253 ± 1.274
3.004ProIle: 3.004 ± 0.843
5.257ProLys: 5.257 ± 1.427
2.629ProLeu: 2.629 ± 1.024
0.376ProMet: 0.376 ± 0.417
2.253ProAsn: 2.253 ± 0.739
5.257ProPro: 5.257 ± 1.489
4.131ProGln: 4.131 ± 0.987
4.131ProArg: 4.131 ± 1.639
4.131ProSer: 4.131 ± 0.851
3.004ProThr: 3.004 ± 0.62
4.131ProVal: 4.131 ± 0.662
0.376ProTrp: 0.376 ± 0.417
1.878ProTyr: 1.878 ± 0.759
0.0ProXaa: 0.0 ± 0.0
Gln
4.131GlnAla: 4.131 ± 0.633
0.0GlnCys: 0.0 ± 0.0
1.127GlnAsp: 1.127 ± 0.768
1.878GlnGlu: 1.878 ± 0.631
1.878GlnPhe: 1.878 ± 1.626
1.878GlnGly: 1.878 ± 0.818
0.376GlnHis: 0.376 ± 0.559
0.376GlnIle: 0.376 ± 0.337
3.755GlnLys: 3.755 ± 1.419
1.878GlnLeu: 1.878 ± 1.012
0.0GlnMet: 0.0 ± 0.0
3.755GlnAsn: 3.755 ± 0.593
1.878GlnPro: 1.878 ± 0.993
0.376GlnGln: 0.376 ± 0.417
3.755GlnArg: 3.755 ± 1.721
3.755GlnSer: 3.755 ± 1.187
2.629GlnThr: 2.629 ± 0.661
1.878GlnVal: 1.878 ± 0.642
0.751GlnTrp: 0.751 ± 1.118
0.751GlnTyr: 0.751 ± 0.397
0.0GlnXaa: 0.0 ± 0.0
Arg
2.629ArgAla: 2.629 ± 0.763
0.376ArgCys: 0.376 ± 0.273
3.38ArgAsp: 3.38 ± 0.913
4.131ArgGlu: 4.131 ± 1.007
1.878ArgPhe: 1.878 ± 1.047
4.131ArgGly: 4.131 ± 0.802
1.502ArgHis: 1.502 ± 1.021
3.755ArgIle: 3.755 ± 1.549
3.004ArgLys: 3.004 ± 1.529
7.886ArgLeu: 7.886 ± 1.836
0.376ArgMet: 0.376 ± 0.624
2.629ArgAsn: 2.629 ± 0.806
3.755ArgPro: 3.755 ± 1.247
2.253ArgGln: 2.253 ± 0.812
8.261ArgArg: 8.261 ± 5.02
4.131ArgSer: 4.131 ± 0.798
2.253ArgThr: 2.253 ± 2.236
2.253ArgVal: 2.253 ± 0.879
1.502ArgTrp: 1.502 ± 0.756
1.502ArgTyr: 1.502 ± 0.387
0.0ArgXaa: 0.0 ± 0.0
Ser
5.633SerAla: 5.633 ± 1.732
1.502SerCys: 1.502 ± 0.621
5.633SerAsp: 5.633 ± 1.006
4.506SerGlu: 4.506 ± 1.082
7.886SerPhe: 7.886 ± 1.041
9.012SerGly: 9.012 ± 2.091
1.127SerHis: 1.127 ± 0.495
4.882SerIle: 4.882 ± 1.405
5.257SerLys: 5.257 ± 1.014
11.265SerLeu: 11.265 ± 0.912
0.376SerMet: 0.376 ± 0.545
3.38SerAsn: 3.38 ± 0.529
6.008SerPro: 6.008 ± 1.684
4.131SerGln: 4.131 ± 2.168
6.759SerArg: 6.759 ± 1.485
13.519SerSer: 13.519 ± 3.567
5.633SerThr: 5.633 ± 1.369
3.38SerVal: 3.38 ± 0.972
1.127SerTrp: 1.127 ± 0.603
1.502SerTyr: 1.502 ± 0.387
0.0SerXaa: 0.0 ± 0.0
Thr
4.506ThrAla: 4.506 ± 0.786
1.502ThrCys: 1.502 ± 0.621
3.004ThrAsp: 3.004 ± 1.097
1.878ThrGlu: 1.878 ± 0.375
1.878ThrPhe: 1.878 ± 1.611
3.755ThrGly: 3.755 ± 1.167
2.629ThrHis: 2.629 ± 0.821
4.506ThrIle: 4.506 ± 0.782
1.127ThrLys: 1.127 ± 0.954
3.755ThrLeu: 3.755 ± 0.905
1.502ThrMet: 1.502 ± 0.727
3.755ThrAsn: 3.755 ± 1.008
4.131ThrPro: 4.131 ± 0.861
0.751ThrGln: 0.751 ± 0.639
3.755ThrArg: 3.755 ± 0.806
5.257ThrSer: 5.257 ± 0.85
5.633ThrThr: 5.633 ± 1.152
2.629ThrVal: 2.629 ± 1.132
1.502ThrTrp: 1.502 ± 0.836
1.502ThrTyr: 1.502 ± 0.621
0.0ThrXaa: 0.0 ± 0.0
Val
3.004ValAla: 3.004 ± 0.882
0.0ValCys: 0.0 ± 0.0
2.253ValAsp: 2.253 ± 1.018
4.506ValGlu: 4.506 ± 0.915
2.629ValPhe: 2.629 ± 0.467
4.131ValGly: 4.131 ± 1.353
1.502ValHis: 1.502 ± 0.691
2.629ValIle: 2.629 ± 0.872
3.004ValLys: 3.004 ± 0.632
6.008ValLeu: 6.008 ± 1.887
1.127ValMet: 1.127 ± 0.653
1.878ValAsn: 1.878 ± 0.642
4.131ValPro: 4.131 ± 1.289
1.502ValGln: 1.502 ± 0.77
2.629ValArg: 2.629 ± 1.308
4.131ValSer: 4.131 ± 0.856
3.38ValThr: 3.38 ± 0.587
2.629ValVal: 2.629 ± 2.028
0.376ValTrp: 0.376 ± 0.417
1.878ValTyr: 1.878 ± 0.515
0.0ValXaa: 0.0 ± 0.0
Trp
1.502TrpAla: 1.502 ± 0.387
0.0TrpCys: 0.0 ± 0.0
0.376TrpAsp: 0.376 ± 0.417
1.878TrpGlu: 1.878 ± 0.583
0.376TrpPhe: 0.376 ± 0.378
1.878TrpGly: 1.878 ± 0.759
0.376TrpHis: 0.376 ± 0.545
0.751TrpIle: 0.751 ± 0.311
0.751TrpLys: 0.751 ± 0.397
2.253TrpLeu: 2.253 ± 0.85
0.376TrpMet: 0.376 ± 0.273
0.751TrpAsn: 0.751 ± 0.397
0.376TrpPro: 0.376 ± 0.273
0.0TrpGln: 0.0 ± 0.0
0.751TrpArg: 0.751 ± 0.374
2.629TrpSer: 2.629 ± 1.481
1.502TrpThr: 1.502 ± 1.126
0.751TrpVal: 0.751 ± 0.311
0.751TrpTrp: 0.751 ± 0.311
0.376TrpTyr: 0.376 ± 0.417
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.253TyrAla: 2.253 ± 0.456
0.0TyrCys: 0.0 ± 0.0
0.376TyrAsp: 0.376 ± 0.273
1.878TyrGlu: 1.878 ± 0.717
1.127TyrPhe: 1.127 ± 0.478
1.878TyrGly: 1.878 ± 1.225
1.502TyrHis: 1.502 ± 0.568
0.376TyrIle: 0.376 ± 0.273
3.755TyrLys: 3.755 ± 0.784
1.127TyrLeu: 1.127 ± 0.478
0.0TyrMet: 0.0 ± 0.0
3.38TyrAsn: 3.38 ± 1.009
0.0TyrPro: 0.0 ± 0.0
1.502TyrGln: 1.502 ± 0.838
0.376TyrArg: 0.376 ± 0.273
3.755TyrSer: 3.755 ± 0.656
1.878TyrThr: 1.878 ± 0.58
1.127TyrVal: 1.127 ± 0.329
0.0TyrTrp: 0.0 ± 0.0
0.376TyrTyr: 0.376 ± 0.273
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2664 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski