Amino acid dipepetide frequency for Botryosphaeria dothidea virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.634AlaAla: 18.634 ± 3.69
1.165AlaCys: 1.165 ± 1.047
6.988AlaAsp: 6.988 ± 0.625
6.211AlaGlu: 6.211 ± 1.082
5.435AlaPhe: 5.435 ± 1.362
8.54AlaGly: 8.54 ± 1.239
2.717AlaHis: 2.717 ± 0.832
2.717AlaIle: 2.717 ± 1.129
4.658AlaLys: 4.658 ± 1.439
11.646AlaLeu: 11.646 ± 1.975
4.27AlaMet: 4.27 ± 0.757
2.717AlaAsn: 2.717 ± 1.011
5.435AlaPro: 5.435 ± 1.249
5.047AlaGln: 5.047 ± 1.542
9.317AlaArg: 9.317 ± 1.518
7.376AlaSer: 7.376 ± 2.323
6.988AlaThr: 6.988 ± 1.136
10.093AlaVal: 10.093 ± 1.426
1.165AlaTrp: 1.165 ± 0.702
2.717AlaTyr: 2.717 ± 1.037
0.388AlaXaa: 0.388 ± 0.314
Cys
2.717CysAla: 2.717 ± 1.022
0.0CysCys: 0.0 ± 0.0
0.388CysAsp: 0.388 ± 0.349
0.776CysGlu: 0.776 ± 0.334
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.388CysIle: 0.388 ± 0.349
0.388CysLys: 0.388 ± 0.349
0.776CysLeu: 0.776 ± 0.548
0.776CysMet: 0.776 ± 0.387
0.388CysAsn: 0.388 ± 0.349
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.388CysArg: 0.388 ± 0.349
1.165CysSer: 1.165 ± 0.468
0.776CysThr: 0.776 ± 0.387
0.388CysVal: 0.388 ± 0.349
0.0CysTrp: 0.0 ± 0.0
0.388CysTyr: 0.388 ± 0.35
0.0CysXaa: 0.0 ± 0.0
Asp
9.317AspAla: 9.317 ± 1.785
0.776AspCys: 0.776 ± 0.548
4.658AspAsp: 4.658 ± 1.162
2.717AspGlu: 2.717 ± 0.446
1.941AspPhe: 1.941 ± 0.712
6.211AspGly: 6.211 ± 2.553
1.553AspHis: 1.553 ± 0.578
1.553AspIle: 1.553 ± 1.042
1.165AspLys: 1.165 ± 0.767
5.823AspLeu: 5.823 ± 1.497
1.165AspMet: 1.165 ± 0.606
1.165AspAsn: 1.165 ± 0.299
6.599AspPro: 6.599 ± 1.078
0.0AspGln: 0.0 ± 0.0
5.435AspArg: 5.435 ± 1.757
3.106AspSer: 3.106 ± 0.992
3.882AspThr: 3.882 ± 0.878
5.435AspVal: 5.435 ± 2.414
0.0AspTrp: 0.0 ± 0.0
1.165AspTyr: 1.165 ± 1.047
0.0AspXaa: 0.0 ± 0.0
Glu
5.823GluAla: 5.823 ± 1.572
0.0GluCys: 0.0 ± 0.0
3.106GluAsp: 3.106 ± 1.42
3.106GluGlu: 3.106 ± 0.691
4.27GluPhe: 4.27 ± 1.123
3.106GluGly: 3.106 ± 0.935
1.553GluHis: 1.553 ± 0.385
3.882GluIle: 3.882 ± 2.221
1.165GluLys: 1.165 ± 0.612
3.882GluLeu: 3.882 ± 1.088
1.165GluMet: 1.165 ± 0.639
1.553GluAsn: 1.553 ± 1.003
2.329GluPro: 2.329 ± 0.618
1.553GluGln: 1.553 ± 1.122
4.658GluArg: 4.658 ± 1.18
2.717GluSer: 2.717 ± 1.077
1.553GluThr: 1.553 ± 0.453
2.329GluVal: 2.329 ± 0.923
1.553GluTrp: 1.553 ± 0.523
2.717GluTyr: 2.717 ± 0.975
0.0GluXaa: 0.0 ± 0.0
Phe
4.27PheAla: 4.27 ± 1.37
0.776PheCys: 0.776 ± 0.334
3.882PheAsp: 3.882 ± 0.65
1.553PheGlu: 1.553 ± 0.385
1.165PhePhe: 1.165 ± 0.798
2.329PheGly: 2.329 ± 1.198
0.776PheHis: 0.776 ± 0.414
0.776PheIle: 0.776 ± 0.548
1.165PheLys: 1.165 ± 0.549
3.106PheLeu: 3.106 ± 1.27
0.776PheMet: 0.776 ± 0.7
1.165PheAsn: 1.165 ± 0.612
0.776PhePro: 0.776 ± 0.564
0.776PheGln: 0.776 ± 0.334
1.165PheArg: 1.165 ± 0.667
1.553PheSer: 1.553 ± 0.827
0.776PheThr: 0.776 ± 0.521
4.27PheVal: 4.27 ± 1.055
0.388PheTrp: 0.388 ± 0.535
0.776PheTyr: 0.776 ± 0.521
0.0PheXaa: 0.0 ± 0.0
Gly
5.047GlyAla: 5.047 ± 1.115
0.388GlyCys: 0.388 ± 0.314
5.435GlyAsp: 5.435 ± 1.469
6.211GlyGlu: 6.211 ± 1.974
1.553GlyPhe: 1.553 ± 0.722
8.152GlyGly: 8.152 ± 2.771
2.717GlyHis: 2.717 ± 0.482
1.165GlyIle: 1.165 ± 0.761
0.776GlyLys: 0.776 ± 0.387
7.764GlyLeu: 7.764 ± 2.524
1.553GlyMet: 1.553 ± 0.523
1.553GlyAsn: 1.553 ± 0.453
6.988GlyPro: 6.988 ± 2.117
1.941GlyGln: 1.941 ± 1.006
5.823GlyArg: 5.823 ± 0.786
5.047GlySer: 5.047 ± 2.089
6.211GlyThr: 6.211 ± 1.358
8.54GlyVal: 8.54 ± 2.847
0.388GlyTrp: 0.388 ± 0.494
1.941GlyTyr: 1.941 ± 0.645
0.0GlyXaa: 0.0 ± 0.0
His
3.882HisAla: 3.882 ± 1.378
0.0HisCys: 0.0 ± 0.0
1.941HisAsp: 1.941 ± 1.033
0.388HisGlu: 0.388 ± 0.314
0.388HisPhe: 0.388 ± 0.349
2.329HisGly: 2.329 ± 1.011
0.388HisHis: 0.388 ± 0.349
1.165HisIle: 1.165 ± 0.606
0.388HisLys: 0.388 ± 0.314
1.165HisLeu: 1.165 ± 0.549
0.388HisMet: 0.388 ± 0.314
0.776HisAsn: 0.776 ± 0.548
1.941HisPro: 1.941 ± 0.776
0.0HisGln: 0.0 ± 0.0
1.941HisArg: 1.941 ± 0.516
0.0HisSer: 0.0 ± 0.0
1.553HisThr: 1.553 ± 0.811
3.106HisVal: 3.106 ± 1.082
0.0HisTrp: 0.0 ± 0.0
2.717HisTyr: 2.717 ± 1.11
0.0HisXaa: 0.0 ± 0.0
Ile
3.882IleAla: 3.882 ± 0.628
0.0IleCys: 0.0 ± 0.0
3.494IleAsp: 3.494 ± 0.906
2.717IleGlu: 2.717 ± 1.241
1.165IlePhe: 1.165 ± 0.523
1.553IleGly: 1.553 ± 0.667
0.388IleHis: 0.388 ± 0.535
1.165IleIle: 1.165 ± 0.531
1.165IleLys: 1.165 ± 0.531
3.494IleLeu: 3.494 ± 1.297
0.388IleMet: 0.388 ± 0.314
1.941IleAsn: 1.941 ± 0.974
1.941IlePro: 1.941 ± 1.102
1.165IleGln: 1.165 ± 1.047
1.553IleArg: 1.553 ± 1.096
2.329IleSer: 2.329 ± 1.077
1.941IleThr: 1.941 ± 0.958
2.717IleVal: 2.717 ± 0.841
0.0IleTrp: 0.0 ± 0.0
1.553IleTyr: 1.553 ± 0.931
0.0IleXaa: 0.0 ± 0.0
Lys
3.494LysAla: 3.494 ± 0.766
0.388LysCys: 0.388 ± 0.35
1.165LysAsp: 1.165 ± 0.761
1.553LysGlu: 1.553 ± 0.633
1.165LysPhe: 1.165 ± 0.556
1.165LysGly: 1.165 ± 0.702
0.388LysHis: 0.388 ± 0.349
1.165LysIle: 1.165 ± 0.606
1.941LysLys: 1.941 ± 1.324
3.882LysLeu: 3.882 ± 1.837
0.388LysMet: 0.388 ± 0.314
0.776LysAsn: 0.776 ± 0.387
1.941LysPro: 1.941 ± 0.544
1.165LysGln: 1.165 ± 0.468
1.941LysArg: 1.941 ± 0.884
0.776LysSer: 0.776 ± 0.387
1.553LysThr: 1.553 ± 0.633
1.553LysVal: 1.553 ± 0.724
0.388LysTrp: 0.388 ± 0.314
0.388LysTyr: 0.388 ± 0.35
0.0LysXaa: 0.0 ± 0.0
Leu
11.258LeuAla: 11.258 ± 1.807
1.165LeuCys: 1.165 ± 0.682
5.435LeuAsp: 5.435 ± 0.79
6.211LeuGlu: 6.211 ± 0.478
1.553LeuPhe: 1.553 ± 0.931
6.988LeuGly: 6.988 ± 2.372
1.941LeuHis: 1.941 ± 0.72
2.717LeuIle: 2.717 ± 1.44
1.941LeuLys: 1.941 ± 1.016
11.258LeuLeu: 11.258 ± 2.339
2.717LeuMet: 2.717 ± 0.941
2.329LeuAsn: 2.329 ± 1.171
4.658LeuPro: 4.658 ± 2.475
2.717LeuGln: 2.717 ± 1.118
5.047LeuArg: 5.047 ± 0.927
11.258LeuSer: 11.258 ± 2.407
5.823LeuThr: 5.823 ± 1.774
5.823LeuVal: 5.823 ± 0.665
0.776LeuTrp: 0.776 ± 0.521
1.553LeuTyr: 1.553 ± 0.574
0.0LeuXaa: 0.0 ± 0.0
Met
3.106MetAla: 3.106 ± 0.493
0.388MetCys: 0.388 ± 0.314
2.329MetAsp: 2.329 ± 0.787
1.165MetGlu: 1.165 ± 0.606
0.0MetPhe: 0.0 ± 0.0
3.494MetGly: 3.494 ± 1.114
0.0MetHis: 0.0 ± 0.0
0.776MetIle: 0.776 ± 0.7
0.776MetLys: 0.776 ± 0.387
1.165MetLeu: 1.165 ± 0.606
0.388MetMet: 0.388 ± 0.314
0.0MetAsn: 0.0 ± 0.0
1.553MetPro: 1.553 ± 0.385
0.388MetGln: 0.388 ± 0.349
2.329MetArg: 2.329 ± 0.544
1.941MetSer: 1.941 ± 0.516
1.553MetThr: 1.553 ± 0.667
1.553MetVal: 1.553 ± 0.827
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.329AsnAla: 2.329 ± 0.905
0.388AsnCys: 0.388 ± 0.314
0.388AsnAsp: 0.388 ± 0.314
2.329AsnGlu: 2.329 ± 0.618
0.388AsnPhe: 0.388 ± 0.314
2.329AsnGly: 2.329 ± 0.523
0.0AsnHis: 0.0 ± 0.0
0.388AsnIle: 0.388 ± 0.35
0.388AsnLys: 0.388 ± 0.349
1.941AsnLeu: 1.941 ± 0.529
0.776AsnMet: 0.776 ± 0.334
0.776AsnAsn: 0.776 ± 0.582
1.165AsnPro: 1.165 ± 0.504
1.941AsnGln: 1.941 ± 1.331
1.165AsnArg: 1.165 ± 0.523
0.388AsnSer: 0.388 ± 0.35
2.717AsnThr: 2.717 ± 1.069
1.553AsnVal: 1.553 ± 0.667
0.0AsnTrp: 0.0 ± 0.0
0.776AsnTyr: 0.776 ± 0.334
0.0AsnXaa: 0.0 ± 0.0
Pro
5.823ProAla: 5.823 ± 1.148
0.388ProCys: 0.388 ± 0.494
3.882ProAsp: 3.882 ± 0.717
2.329ProGlu: 2.329 ± 0.827
0.776ProPhe: 0.776 ± 0.387
4.27ProGly: 4.27 ± 0.863
3.106ProHis: 3.106 ± 1.229
3.106ProIle: 3.106 ± 1.119
1.553ProLys: 1.553 ± 0.773
5.435ProLeu: 5.435 ± 1.134
0.776ProMet: 0.776 ± 0.564
0.776ProAsn: 0.776 ± 1.07
4.27ProPro: 4.27 ± 0.621
1.165ProGln: 1.165 ± 0.546
5.435ProArg: 5.435 ± 0.688
3.494ProSer: 3.494 ± 1.393
6.211ProThr: 6.211 ± 1.378
6.599ProVal: 6.599 ± 1.372
1.165ProTrp: 1.165 ± 0.606
1.165ProTyr: 1.165 ± 0.468
0.0ProXaa: 0.0 ± 0.0
Gln
2.329GlnAla: 2.329 ± 0.447
0.0GlnCys: 0.0 ± 0.0
0.776GlnAsp: 0.776 ± 0.7
1.941GlnGlu: 1.941 ± 0.712
1.165GlnPhe: 1.165 ± 0.549
1.941GlnGly: 1.941 ± 0.714
1.553GlnHis: 1.553 ± 0.523
0.0GlnIle: 0.0 ± 0.0
0.388GlnLys: 0.388 ± 0.314
5.435GlnLeu: 5.435 ± 2.191
0.388GlnMet: 0.388 ± 0.314
0.0GlnAsn: 0.0 ± 0.0
1.165GlnPro: 1.165 ± 0.692
0.388GlnGln: 0.388 ± 0.314
2.717GlnArg: 2.717 ± 1.157
3.106GlnSer: 3.106 ± 1.382
1.941GlnThr: 1.941 ± 0.423
2.329GlnVal: 2.329 ± 0.792
0.388GlnTrp: 0.388 ± 0.314
0.776GlnTyr: 0.776 ± 0.706
0.0GlnXaa: 0.0 ± 0.0
Arg
12.811ArgAla: 12.811 ± 1.754
0.388ArgCys: 0.388 ± 0.314
2.329ArgAsp: 2.329 ± 0.872
2.717ArgGlu: 2.717 ± 0.86
2.329ArgPhe: 2.329 ± 0.698
5.435ArgGly: 5.435 ± 0.92
2.329ArgHis: 2.329 ± 0.883
2.717ArgIle: 2.717 ± 0.951
1.553ArgLys: 1.553 ± 0.705
8.54ArgLeu: 8.54 ± 1.221
0.776ArgMet: 0.776 ± 0.548
2.329ArgAsn: 2.329 ± 1.171
3.882ArgPro: 3.882 ± 0.94
3.106ArgGln: 3.106 ± 0.806
5.047ArgArg: 5.047 ± 1.26
5.047ArgSer: 5.047 ± 1.805
5.047ArgThr: 5.047 ± 1.596
7.764ArgVal: 7.764 ± 1.885
0.388ArgTrp: 0.388 ± 0.494
2.329ArgTyr: 2.329 ± 1.16
0.0ArgXaa: 0.0 ± 0.0
Ser
6.988SerAla: 6.988 ± 1.719
0.388SerCys: 0.388 ± 0.349
5.435SerAsp: 5.435 ± 1.268
2.717SerGlu: 2.717 ± 1.069
1.165SerPhe: 1.165 ± 0.767
5.047SerGly: 5.047 ± 1.126
0.776SerHis: 0.776 ± 0.387
2.329SerIle: 2.329 ± 0.599
1.553SerLys: 1.553 ± 0.39
5.047SerLeu: 5.047 ± 1.666
1.941SerMet: 1.941 ± 0.645
0.776SerAsn: 0.776 ± 0.558
5.047SerPro: 5.047 ± 1.322
2.329SerGln: 2.329 ± 0.447
6.988SerArg: 6.988 ± 1.704
4.27SerSer: 4.27 ± 0.988
3.106SerThr: 3.106 ± 0.94
5.047SerVal: 5.047 ± 1.633
0.776SerTrp: 0.776 ± 0.387
3.882SerTyr: 3.882 ± 0.621
0.0SerXaa: 0.0 ± 0.0
Thr
6.599ThrAla: 6.599 ± 1.372
1.165ThrCys: 1.165 ± 0.549
4.27ThrAsp: 4.27 ± 1.72
0.776ThrGlu: 0.776 ± 0.334
3.106ThrPhe: 3.106 ± 1.034
5.823ThrGly: 5.823 ± 2.024
1.941ThrHis: 1.941 ± 0.927
4.27ThrIle: 4.27 ± 1.116
2.717ThrLys: 2.717 ± 0.893
4.658ThrLeu: 4.658 ± 0.999
1.553ThrMet: 1.553 ± 0.593
1.553ThrAsn: 1.553 ± 0.39
6.211ThrPro: 6.211 ± 0.889
2.717ThrGln: 2.717 ± 1.311
3.106ThrArg: 3.106 ± 0.843
3.882ThrSer: 3.882 ± 1.178
5.435ThrThr: 5.435 ± 2.482
1.553ThrVal: 1.553 ± 0.453
0.388ThrTrp: 0.388 ± 0.314
1.941ThrTyr: 1.941 ± 0.888
0.0ThrXaa: 0.0 ± 0.0
Val
9.705ValAla: 9.705 ± 0.973
1.553ValCys: 1.553 ± 0.727
5.823ValAsp: 5.823 ± 1.177
5.047ValGlu: 5.047 ± 1.849
4.27ValPhe: 4.27 ± 0.621
6.599ValGly: 6.599 ± 2.14
1.553ValHis: 1.553 ± 0.724
2.717ValIle: 2.717 ± 0.811
3.106ValLys: 3.106 ± 0.683
5.047ValLeu: 5.047 ± 1.493
2.329ValMet: 2.329 ± 1.309
1.165ValAsn: 1.165 ± 0.667
5.047ValPro: 5.047 ± 1.221
1.165ValGln: 1.165 ± 0.667
8.54ValArg: 8.54 ± 1.413
5.823ValSer: 5.823 ± 2.096
4.27ValThr: 4.27 ± 1.71
5.435ValVal: 5.435 ± 2.726
0.776ValTrp: 0.776 ± 0.387
0.776ValTyr: 0.776 ± 0.7
0.0ValXaa: 0.0 ± 0.0
Trp
1.165TrpAla: 1.165 ± 0.556
0.388TrpCys: 0.388 ± 0.349
0.776TrpAsp: 0.776 ± 0.334
0.776TrpGlu: 0.776 ± 0.7
0.0TrpPhe: 0.0 ± 0.0
1.165TrpGly: 1.165 ± 0.983
0.0TrpHis: 0.0 ± 0.0
0.776TrpIle: 0.776 ± 0.629
0.0TrpLys: 0.0 ± 0.0
0.388TrpLeu: 0.388 ± 0.314
0.0TrpMet: 0.0 ± 0.0
0.388TrpAsn: 0.388 ± 0.494
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.388TrpArg: 0.388 ± 0.35
0.776TrpSer: 0.776 ± 0.582
0.388TrpThr: 0.388 ± 0.494
1.165TrpVal: 1.165 ± 0.546
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.047TyrAla: 5.047 ± 1.155
0.0TyrCys: 0.0 ± 0.0
1.165TyrAsp: 1.165 ± 0.612
0.776TyrGlu: 0.776 ± 0.698
0.388TyrPhe: 0.388 ± 0.349
2.717TyrGly: 2.717 ± 1.12
0.776TyrHis: 0.776 ± 0.574
0.776TyrIle: 0.776 ± 0.414
0.0TyrLys: 0.0 ± 0.0
2.717TyrLeu: 2.717 ± 0.682
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
0.776TyrPro: 0.776 ± 0.548
1.165TyrGln: 1.165 ± 0.705
3.882TyrArg: 3.882 ± 1.023
1.553TyrSer: 1.553 ± 0.523
2.329TyrThr: 2.329 ± 0.792
3.494TyrVal: 3.494 ± 0.974
0.0TyrTrp: 0.0 ± 0.0
0.776TyrTyr: 0.776 ± 0.521
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.388XaaLys: 0.388 ± 0.314
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2577 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski