Amino acid dipepetide frequency for Vibrio virus fs1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.546AlaAla: 5.546 ± 2.219
0.555AlaCys: 0.555 ± 0.535
2.219AlaAsp: 2.219 ± 1.033
3.328AlaGlu: 3.328 ± 1.006
4.992AlaPhe: 4.992 ± 1.084
2.773AlaGly: 2.773 ± 1.751
2.219AlaHis: 2.219 ± 1.176
6.101AlaIle: 6.101 ± 1.828
3.882AlaLys: 3.882 ± 1.175
9.983AlaLeu: 9.983 ± 2.94
1.109AlaMet: 1.109 ± 0.822
2.773AlaAsn: 2.773 ± 1.244
2.219AlaPro: 2.219 ± 0.967
4.992AlaGln: 4.992 ± 1.207
0.555AlaArg: 0.555 ± 0.535
2.219AlaSer: 2.219 ± 1.203
1.109AlaThr: 1.109 ± 0.683
7.765AlaVal: 7.765 ± 2.786
1.109AlaTrp: 1.109 ± 0.752
3.328AlaTyr: 3.328 ± 0.856
0.0AlaXaa: 0.0 ± 0.0
Cys
1.109CysAla: 1.109 ± 0.597
0.0CysCys: 0.0 ± 0.0
1.664CysAsp: 1.664 ± 0.853
0.555CysGlu: 0.555 ± 0.473
1.664CysPhe: 1.664 ± 0.723
1.109CysGly: 1.109 ± 0.678
0.555CysHis: 0.555 ± 0.473
2.773CysIle: 2.773 ± 0.988
1.109CysLys: 1.109 ± 0.564
0.555CysLeu: 0.555 ± 0.562
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.555CysPro: 0.555 ± 0.535
0.555CysGln: 0.555 ± 0.473
0.0CysArg: 0.0 ± 0.0
2.773CysSer: 2.773 ± 1.37
2.219CysThr: 2.219 ± 0.797
1.109CysVal: 1.109 ± 0.535
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.992AspAla: 4.992 ± 1.332
0.555AspCys: 0.555 ± 0.473
4.437AspAsp: 4.437 ± 1.427
1.664AspGlu: 1.664 ± 0.847
2.219AspPhe: 2.219 ± 0.941
4.437AspGly: 4.437 ± 1.437
1.664AspHis: 1.664 ± 0.533
5.546AspIle: 5.546 ± 1.709
1.109AspLys: 1.109 ± 0.918
6.101AspLeu: 6.101 ± 1.436
2.219AspMet: 2.219 ± 0.979
1.109AspAsn: 1.109 ± 0.597
5.546AspPro: 5.546 ± 2.611
1.664AspGln: 1.664 ± 0.533
0.555AspArg: 0.555 ± 0.5
2.773AspSer: 2.773 ± 1.257
3.882AspThr: 3.882 ± 1.097
3.328AspVal: 3.328 ± 2.104
1.109AspTrp: 1.109 ± 0.637
2.219AspTyr: 2.219 ± 1.314
0.0AspXaa: 0.0 ± 0.0
Glu
4.437GluAla: 4.437 ± 1.134
2.219GluCys: 2.219 ± 0.855
1.664GluAsp: 1.664 ± 0.894
1.109GluGlu: 1.109 ± 0.727
1.664GluPhe: 1.664 ± 0.7
0.555GluGly: 0.555 ± 0.535
2.219GluHis: 2.219 ± 0.785
0.555GluIle: 0.555 ± 0.429
4.437GluLys: 4.437 ± 1.134
3.882GluLeu: 3.882 ± 1.724
0.555GluMet: 0.555 ± 0.429
1.664GluAsn: 1.664 ± 0.799
4.437GluPro: 4.437 ± 1.748
3.328GluGln: 3.328 ± 1.306
0.0GluArg: 0.0 ± 0.0
2.773GluSer: 2.773 ± 1.369
3.328GluThr: 3.328 ± 1.488
1.664GluVal: 1.664 ± 1.07
0.555GluTrp: 0.555 ± 0.611
1.664GluTyr: 1.664 ± 0.69
0.0GluXaa: 0.0 ± 0.0
Phe
4.992PheAla: 4.992 ± 1.79
0.555PheCys: 0.555 ± 0.473
3.882PheAsp: 3.882 ± 0.956
1.664PheGlu: 1.664 ± 0.83
0.555PhePhe: 0.555 ± 0.473
4.992PheGly: 4.992 ± 1.364
1.109PheHis: 1.109 ± 0.656
2.219PheIle: 2.219 ± 0.796
1.664PheLys: 1.664 ± 0.738
3.882PheLeu: 3.882 ± 1.727
2.773PheMet: 2.773 ± 0.786
1.664PheAsn: 1.664 ± 0.782
1.664PhePro: 1.664 ± 0.69
1.109PheGln: 1.109 ± 0.637
2.219PheArg: 2.219 ± 1.24
5.546PheSer: 5.546 ± 1.467
3.328PheThr: 3.328 ± 1.027
2.219PheVal: 2.219 ± 0.929
1.109PheTrp: 1.109 ± 0.535
2.773PheTyr: 2.773 ± 1.178
0.0PheXaa: 0.0 ± 0.0
Gly
4.437GlyAla: 4.437 ± 1.331
1.109GlyCys: 1.109 ± 0.564
3.328GlyAsp: 3.328 ± 1.357
1.664GlyGlu: 1.664 ± 0.669
4.992GlyPhe: 4.992 ± 1.551
3.328GlyGly: 3.328 ± 1.606
1.109GlyHis: 1.109 ± 0.858
6.101GlyIle: 6.101 ± 1.931
2.773GlyLys: 2.773 ± 0.969
4.437GlyLeu: 4.437 ± 1.645
2.773GlyMet: 2.773 ± 1.08
1.664GlyAsn: 1.664 ± 0.696
0.555GlyPro: 0.555 ± 0.535
2.773GlyGln: 2.773 ± 1.3
1.664GlyArg: 1.664 ± 0.889
5.546GlySer: 5.546 ± 1.32
2.773GlyThr: 2.773 ± 1.072
3.328GlyVal: 3.328 ± 2.229
0.555GlyTrp: 0.555 ± 0.473
3.328GlyTyr: 3.328 ± 1.411
0.0GlyXaa: 0.0 ± 0.0
His
0.555HisAla: 0.555 ± 0.429
0.0HisCys: 0.0 ± 0.0
1.109HisAsp: 1.109 ± 0.85
0.555HisGlu: 0.555 ± 0.5
0.0HisPhe: 0.0 ± 0.0
1.109HisGly: 1.109 ± 0.58
1.109HisHis: 1.109 ± 1.0
1.109HisIle: 1.109 ± 0.78
2.773HisLys: 2.773 ± 1.251
2.219HisLeu: 2.219 ± 1.129
1.109HisMet: 1.109 ± 0.58
0.555HisAsn: 0.555 ± 0.473
0.555HisPro: 0.555 ± 0.5
0.0HisGln: 0.0 ± 0.0
1.664HisArg: 1.664 ± 0.696
1.664HisSer: 1.664 ± 0.69
0.555HisThr: 0.555 ± 0.429
1.109HisVal: 1.109 ± 0.858
0.555HisTrp: 0.555 ± 0.5
1.664HisTyr: 1.664 ± 1.287
0.0HisXaa: 0.0 ± 0.0
Ile
3.882IleAla: 3.882 ± 1.6
1.664IleCys: 1.664 ± 0.602
6.656IleAsp: 6.656 ± 2.553
5.546IleGlu: 5.546 ± 1.36
1.109IlePhe: 1.109 ± 1.029
2.773IleGly: 2.773 ± 1.157
0.555IleHis: 0.555 ± 0.429
4.992IleIle: 4.992 ± 1.831
2.773IleLys: 2.773 ± 0.997
5.546IleLeu: 5.546 ± 1.496
1.664IleMet: 1.664 ± 1.01
4.992IleAsn: 4.992 ± 1.776
6.656IlePro: 6.656 ± 1.761
3.882IleGln: 3.882 ± 1.148
1.664IleArg: 1.664 ± 0.904
7.21IleSer: 7.21 ± 2.288
6.101IleThr: 6.101 ± 1.432
4.992IleVal: 4.992 ± 1.075
1.664IleTrp: 1.664 ± 0.854
3.328IleTyr: 3.328 ± 1.059
0.0IleXaa: 0.0 ± 0.0
Lys
3.882LysAla: 3.882 ± 1.129
1.664LysCys: 1.664 ± 1.131
2.773LysAsp: 2.773 ± 1.021
1.109LysGlu: 1.109 ± 0.58
1.664LysPhe: 1.664 ± 0.741
1.664LysGly: 1.664 ± 1.025
1.109LysHis: 1.109 ± 0.678
3.882LysIle: 3.882 ± 0.903
6.656LysLys: 6.656 ± 1.568
5.546LysLeu: 5.546 ± 1.695
3.328LysMet: 3.328 ± 1.393
2.773LysAsn: 2.773 ± 0.671
2.773LysPro: 2.773 ± 1.416
3.328LysGln: 3.328 ± 1.352
4.437LysArg: 4.437 ± 1.587
4.437LysSer: 4.437 ± 1.191
5.546LysThr: 5.546 ± 1.346
5.546LysVal: 5.546 ± 1.877
0.0LysTrp: 0.0 ± 0.0
1.664LysTyr: 1.664 ± 0.642
0.0LysXaa: 0.0 ± 0.0
Leu
5.546LeuAla: 5.546 ± 2.119
1.109LeuCys: 1.109 ± 0.946
3.328LeuAsp: 3.328 ± 0.856
3.882LeuGlu: 3.882 ± 1.0
3.328LeuPhe: 3.328 ± 1.869
8.319LeuGly: 8.319 ± 1.313
1.664LeuHis: 1.664 ± 1.23
12.202LeuIle: 12.202 ± 3.028
5.546LeuLys: 5.546 ± 1.355
6.656LeuLeu: 6.656 ± 2.486
2.773LeuMet: 2.773 ± 1.332
6.101LeuAsn: 6.101 ± 1.484
6.101LeuPro: 6.101 ± 2.239
1.109LeuGln: 1.109 ± 0.656
3.328LeuArg: 3.328 ± 1.257
4.992LeuSer: 4.992 ± 1.803
5.546LeuThr: 5.546 ± 1.797
3.328LeuVal: 3.328 ± 1.213
1.109LeuTrp: 1.109 ± 0.752
4.437LeuTyr: 4.437 ± 1.53
0.0LeuXaa: 0.0 ± 0.0
Met
4.437MetAla: 4.437 ± 2.015
0.0MetCys: 0.0 ± 0.0
1.109MetAsp: 1.109 ± 1.123
1.109MetGlu: 1.109 ± 0.678
1.664MetPhe: 1.664 ± 0.802
1.109MetGly: 1.109 ± 0.58
0.555MetHis: 0.555 ± 0.429
2.773MetIle: 2.773 ± 1.286
0.555MetLys: 0.555 ± 0.611
2.219MetLeu: 2.219 ± 1.044
0.555MetMet: 0.555 ± 0.456
1.664MetAsn: 1.664 ± 0.894
2.773MetPro: 2.773 ± 1.523
0.0MetGln: 0.0 ± 0.0
1.109MetArg: 1.109 ± 0.564
2.773MetSer: 2.773 ± 1.114
1.664MetThr: 1.664 ± 0.616
2.219MetVal: 2.219 ± 0.532
0.0MetTrp: 0.0 ± 0.0
0.555MetTyr: 0.555 ± 0.707
0.0MetXaa: 0.0 ± 0.0
Asn
1.664AsnAla: 1.664 ± 1.016
0.555AsnCys: 0.555 ± 0.473
2.219AsnAsp: 2.219 ± 1.417
2.219AsnGlu: 2.219 ± 1.257
1.109AsnPhe: 1.109 ± 0.866
1.109AsnGly: 1.109 ± 0.678
0.555AsnHis: 0.555 ± 0.473
3.882AsnIle: 3.882 ± 1.761
6.101AsnLys: 6.101 ± 1.792
3.882AsnLeu: 3.882 ± 1.001
0.555AsnMet: 0.555 ± 0.664
2.219AsnAsn: 2.219 ± 0.952
4.437AsnPro: 4.437 ± 2.066
2.219AsnGln: 2.219 ± 1.405
2.219AsnArg: 2.219 ± 0.94
3.328AsnSer: 3.328 ± 1.051
3.328AsnThr: 3.328 ± 1.034
1.664AsnVal: 1.664 ± 1.043
1.109AsnTrp: 1.109 ± 0.858
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.219ProAla: 2.219 ± 0.791
1.109ProCys: 1.109 ± 0.564
5.546ProAsp: 5.546 ± 2.412
3.882ProGlu: 3.882 ± 2.858
5.546ProPhe: 5.546 ± 1.357
0.0ProGly: 0.0 ± 0.0
1.664ProHis: 1.664 ± 1.114
0.555ProIle: 0.555 ± 0.429
4.992ProLys: 4.992 ± 1.457
6.101ProLeu: 6.101 ± 2.161
1.664ProMet: 1.664 ± 0.643
2.773ProAsn: 2.773 ± 1.511
2.219ProPro: 2.219 ± 0.999
2.773ProGln: 2.773 ± 1.16
2.773ProArg: 2.773 ± 0.833
5.546ProSer: 5.546 ± 2.261
4.992ProThr: 4.992 ± 1.324
3.328ProVal: 3.328 ± 1.307
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.219GlnAla: 2.219 ± 0.772
1.109GlnCys: 1.109 ± 0.752
2.773GlnAsp: 2.773 ± 0.819
1.109GlnGlu: 1.109 ± 0.597
1.664GlnPhe: 1.664 ± 1.287
2.773GlnGly: 2.773 ± 1.747
0.0GlnHis: 0.0 ± 0.0
3.328GlnIle: 3.328 ± 1.327
1.109GlnLys: 1.109 ± 0.551
4.437GlnLeu: 4.437 ± 1.635
0.555GlnMet: 0.555 ± 0.5
1.664GlnAsn: 1.664 ± 0.794
2.773GlnPro: 2.773 ± 1.123
1.664GlnGln: 1.664 ± 0.847
2.219GlnArg: 2.219 ± 0.825
3.882GlnSer: 3.882 ± 1.712
2.219GlnThr: 2.219 ± 1.013
3.882GlnVal: 3.882 ± 1.186
0.0GlnTrp: 0.0 ± 0.0
0.555GlnTyr: 0.555 ± 0.512
0.0GlnXaa: 0.0 ± 0.0
Arg
2.773ArgAla: 2.773 ± 1.302
0.555ArgCys: 0.555 ± 0.557
1.109ArgAsp: 1.109 ± 0.749
2.219ArgGlu: 2.219 ± 1.162
3.882ArgPhe: 3.882 ± 1.358
1.664ArgGly: 1.664 ± 1.064
0.555ArgHis: 0.555 ± 0.5
6.101ArgIle: 6.101 ± 1.507
2.773ArgLys: 2.773 ± 1.35
4.992ArgLeu: 4.992 ± 1.205
0.555ArgMet: 0.555 ± 0.627
1.664ArgAsn: 1.664 ± 1.156
3.882ArgPro: 3.882 ± 0.888
0.555ArgGln: 0.555 ± 0.535
2.773ArgArg: 2.773 ± 1.987
1.664ArgSer: 1.664 ± 0.723
1.664ArgThr: 1.664 ± 0.766
1.109ArgVal: 1.109 ± 0.535
0.555ArgTrp: 0.555 ± 0.429
0.555ArgTyr: 0.555 ± 0.473
0.0ArgXaa: 0.0 ± 0.0
Ser
6.101SerAla: 6.101 ± 1.806
0.555SerCys: 0.555 ± 0.473
4.437SerAsp: 4.437 ± 1.787
1.109SerGlu: 1.109 ± 0.78
3.882SerPhe: 3.882 ± 1.032
7.765SerGly: 7.765 ± 1.536
1.109SerHis: 1.109 ± 0.722
2.773SerIle: 2.773 ± 0.835
7.21SerLys: 7.21 ± 1.749
7.21SerLeu: 7.21 ± 2.893
3.882SerMet: 3.882 ± 1.084
3.328SerAsn: 3.328 ± 1.246
1.109SerPro: 1.109 ± 0.564
2.219SerGln: 2.219 ± 1.304
4.437SerArg: 4.437 ± 1.608
4.992SerSer: 4.992 ± 1.517
2.219SerThr: 2.219 ± 1.154
3.882SerVal: 3.882 ± 1.592
0.0SerTrp: 0.0 ± 0.0
3.328SerTyr: 3.328 ± 0.837
0.0SerXaa: 0.0 ± 0.0
Thr
3.328ThrAla: 3.328 ± 1.139
2.773ThrCys: 2.773 ± 1.015
3.328ThrAsp: 3.328 ± 1.41
2.773ThrGlu: 2.773 ± 0.859
2.773ThrPhe: 2.773 ± 0.912
6.101ThrGly: 6.101 ± 1.452
0.555ThrHis: 0.555 ± 0.685
3.328ThrIle: 3.328 ± 0.89
3.882ThrLys: 3.882 ± 1.335
4.437ThrLeu: 4.437 ± 1.085
1.664ThrMet: 1.664 ± 1.181
2.219ThrAsn: 2.219 ± 1.028
3.882ThrPro: 3.882 ± 1.169
2.219ThrGln: 2.219 ± 0.857
3.882ThrArg: 3.882 ± 0.801
2.773ThrSer: 2.773 ± 1.413
3.882ThrThr: 3.882 ± 2.172
3.328ThrVal: 3.328 ± 1.122
0.555ThrTrp: 0.555 ± 0.429
3.328ThrTyr: 3.328 ± 1.161
0.0ThrXaa: 0.0 ± 0.0
Val
3.328ValAla: 3.328 ± 1.129
1.109ValCys: 1.109 ± 0.918
3.882ValAsp: 3.882 ± 1.287
4.437ValGlu: 4.437 ± 1.555
3.882ValPhe: 3.882 ± 1.044
2.773ValGly: 2.773 ± 0.718
0.0ValHis: 0.0 ± 0.0
7.21ValIle: 7.21 ± 2.076
2.773ValLys: 2.773 ± 1.089
2.773ValLeu: 2.773 ± 0.874
0.555ValMet: 0.555 ± 0.473
3.882ValAsn: 3.882 ± 1.614
3.328ValPro: 3.328 ± 1.092
2.219ValGln: 2.219 ± 0.935
2.773ValArg: 2.773 ± 1.0
4.992ValSer: 4.992 ± 1.43
4.437ValThr: 4.437 ± 1.87
2.219ValVal: 2.219 ± 0.655
0.0ValTrp: 0.0 ± 0.0
2.773ValTyr: 2.773 ± 1.273
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.664TrpGly: 1.664 ± 0.847
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.555TrpLys: 0.555 ± 0.429
2.219TrpLeu: 2.219 ± 1.638
0.0TrpMet: 0.0 ± 0.0
0.555TrpAsn: 0.555 ± 0.473
1.109TrpPro: 1.109 ± 0.635
0.555TrpGln: 0.555 ± 0.611
1.109TrpArg: 1.109 ± 0.551
0.555TrpSer: 0.555 ± 0.535
0.0TrpThr: 0.0 ± 0.0
1.664TrpVal: 1.664 ± 1.215
0.555TrpTrp: 0.555 ± 0.5
0.555TrpTyr: 0.555 ± 0.535
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.882TyrAla: 3.882 ± 1.024
1.109TyrCys: 1.109 ± 0.811
1.664TyrAsp: 1.664 ± 0.844
2.773TyrGlu: 2.773 ± 1.089
3.328TyrPhe: 3.328 ± 1.023
2.219TyrGly: 2.219 ± 0.94
1.664TyrHis: 1.664 ± 0.889
2.219TyrIle: 2.219 ± 0.942
1.109TyrLys: 1.109 ± 0.858
3.328TyrLeu: 3.328 ± 1.127
0.0TyrMet: 0.0 ± 0.0
1.109TyrAsn: 1.109 ± 0.678
1.109TyrPro: 1.109 ± 0.597
2.773TyrGln: 2.773 ± 1.263
2.219TyrArg: 2.219 ± 0.759
1.109TyrSer: 1.109 ± 0.749
2.219TyrThr: 2.219 ± 1.138
1.664TyrVal: 1.664 ± 0.844
0.555TyrTrp: 0.555 ± 0.535
1.664TyrTyr: 1.664 ± 0.847
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (1804 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski