Amino acid dipepetide frequency for Sanxia Water Strider Virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.711AlaAla: 2.711 ± 1.425
1.085AlaCys: 1.085 ± 0.185
2.711AlaAsp: 2.711 ± 1.048
2.35AlaGlu: 2.35 ± 0.335
1.446AlaPhe: 1.446 ± 0.922
1.265AlaGly: 1.265 ± 0.571
0.181AlaHis: 0.181 ± 0.091
3.073AlaIle: 3.073 ± 0.604
1.265AlaLys: 1.265 ± 0.161
3.977AlaLeu: 3.977 ± 0.843
1.085AlaMet: 1.085 ± 0.795
1.446AlaAsn: 1.446 ± 0.575
0.542AlaPro: 0.542 ± 0.685
1.808AlaGln: 1.808 ± 0.317
1.265AlaArg: 1.265 ± 0.636
3.254AlaSer: 3.254 ± 1.113
3.796AlaThr: 3.796 ± 1.504
3.254AlaVal: 3.254 ± 2.878
0.723AlaTrp: 0.723 ± 0.364
1.085AlaTyr: 1.085 ± 0.581
0.0AlaXaa: 0.0 ± 0.0
Cys
0.723CysAla: 0.723 ± 0.356
0.181CysCys: 0.181 ± 0.251
0.723CysAsp: 0.723 ± 0.583
1.627CysGlu: 1.627 ± 0.389
0.723CysPhe: 0.723 ± 0.135
0.723CysGly: 0.723 ± 0.356
0.542CysHis: 0.542 ± 0.13
2.35CysIle: 2.35 ± 0.5
1.988CysLys: 1.988 ± 1.455
1.808CysLeu: 1.808 ± 0.485
0.362CysMet: 0.362 ± 0.182
1.808CysAsn: 1.808 ± 0.596
1.265CysPro: 1.265 ± 1.137
0.904CysGln: 0.904 ± 0.191
1.808CysArg: 1.808 ± 1.022
1.265CysSer: 1.265 ± 0.249
1.808CysThr: 1.808 ± 1.529
0.723CysVal: 0.723 ± 0.288
0.181CysTrp: 0.181 ± 0.251
0.723CysTyr: 0.723 ± 1.006
0.0CysXaa: 0.0 ± 0.0
Asp
1.808AspAla: 1.808 ± 1.238
0.904AspCys: 0.904 ± 0.298
3.615AspAsp: 3.615 ± 0.34
3.796AspGlu: 3.796 ± 0.66
3.073AspPhe: 3.073 ± 0.787
1.446AspGly: 1.446 ± 0.535
1.085AspHis: 1.085 ± 0.185
3.073AspIle: 3.073 ± 0.924
3.615AspLys: 3.615 ± 0.677
5.785AspLeu: 5.785 ± 0.449
1.085AspMet: 1.085 ± 0.402
3.073AspAsn: 3.073 ± 0.924
2.35AspPro: 2.35 ± 1.303
1.808AspGln: 1.808 ± 0.052
3.435AspArg: 3.435 ± 0.366
6.688AspSer: 6.688 ± 2.11
3.073AspThr: 3.073 ± 0.353
2.35AspVal: 2.35 ± 0.335
0.181AspTrp: 0.181 ± 0.091
2.531AspTyr: 2.531 ± 0.689
0.0AspXaa: 0.0 ± 0.0
Glu
2.169GluAla: 2.169 ± 0.72
1.265GluCys: 1.265 ± 0.249
3.796GluAsp: 3.796 ± 1.463
4.881GluGlu: 4.881 ± 2.149
4.158GluPhe: 4.158 ± 0.987
3.615GluGly: 3.615 ± 0.386
1.808GluHis: 1.808 ± 0.689
7.773GluIle: 7.773 ± 2.059
4.7GluLys: 4.7 ± 1.301
7.954GluLeu: 7.954 ± 2.152
3.073GluMet: 3.073 ± 1.252
1.988GluAsn: 1.988 ± 0.066
1.446GluPro: 1.446 ± 0.423
1.627GluGln: 1.627 ± 0.524
1.988GluArg: 1.988 ± 0.856
3.977GluSer: 3.977 ± 1.403
3.796GluThr: 3.796 ± 0.96
4.158GluVal: 4.158 ± 0.469
0.0GluTrp: 0.0 ± 0.0
1.085GluTyr: 1.085 ± 0.267
0.0GluXaa: 0.0 ± 0.0
Phe
1.446PheAla: 1.446 ± 0.504
1.446PheCys: 1.446 ± 0.423
3.435PheAsp: 3.435 ± 0.366
2.35PheGlu: 2.35 ± 0.881
2.531PhePhe: 2.531 ± 0.989
1.808PheGly: 1.808 ± 0.373
1.446PheHis: 1.446 ± 0.185
3.073PheIle: 3.073 ± 0.353
3.615PheLys: 3.615 ± 0.34
4.158PheLeu: 4.158 ± 0.121
0.542PheMet: 0.542 ± 0.273
3.435PheAsn: 3.435 ± 1.132
1.265PhePro: 1.265 ± 0.571
1.446PheGln: 1.446 ± 0.423
2.35PheArg: 2.35 ± 0.5
5.061PheSer: 5.061 ± 1.215
1.988PheThr: 1.988 ± 0.603
1.265PheVal: 1.265 ± 0.161
0.362PheTrp: 0.362 ± 0.503
2.711PheTyr: 2.711 ± 0.926
0.0PheXaa: 0.0 ± 0.0
Gly
1.627GlyAla: 1.627 ± 0.989
1.446GlyCys: 1.446 ± 0.713
2.531GlyAsp: 2.531 ± 0.322
1.808GlyGlu: 1.808 ± 0.891
2.892GlyPhe: 2.892 ± 1.693
2.892GlyGly: 2.892 ± 0.624
1.085GlyHis: 1.085 ± 0.259
2.892GlyIle: 2.892 ± 0.542
2.169GlyLys: 2.169 ± 0.37
4.338GlyLeu: 4.338 ± 0.812
1.446GlyMet: 1.446 ± 0.523
2.531GlyAsn: 2.531 ± 0.929
0.723GlyPro: 0.723 ± 0.356
2.169GlyGln: 2.169 ± 0.15
1.988GlyArg: 1.988 ± 0.422
5.061GlySer: 5.061 ± 1.398
2.169GlyThr: 2.169 ± 0.465
1.446GlyVal: 1.446 ± 0.185
0.542GlyTrp: 0.542 ± 0.13
1.265GlyTyr: 1.265 ± 0.474
0.0GlyXaa: 0.0 ± 0.0
His
0.723HisAla: 0.723 ± 0.288
0.542HisCys: 0.542 ± 0.397
0.723HisAsp: 0.723 ± 0.135
0.723HisGlu: 0.723 ± 0.135
1.085HisPhe: 1.085 ± 0.715
1.085HisGly: 1.085 ± 0.534
0.904HisHis: 0.904 ± 0.455
1.627HisIle: 1.627 ± 0.818
1.265HisLys: 1.265 ± 0.249
2.35HisLeu: 2.35 ± 0.446
0.542HisMet: 0.542 ± 0.273
0.904HisAsn: 0.904 ± 0.298
0.542HisPro: 0.542 ± 0.13
0.723HisGln: 0.723 ± 0.316
1.808HisArg: 1.808 ± 0.612
1.265HisSer: 1.265 ± 0.474
0.904HisThr: 0.904 ± 0.313
0.904HisVal: 0.904 ± 0.805
0.362HisTrp: 0.362 ± 0.182
0.542HisTyr: 0.542 ± 0.13
0.0HisXaa: 0.0 ± 0.0
Ile
3.435IleAla: 3.435 ± 0.886
1.446IleCys: 1.446 ± 1.195
4.7IleAsp: 4.7 ± 0.478
5.423IleGlu: 5.423 ± 0.917
2.892IlePhe: 2.892 ± 0.834
2.531IleGly: 2.531 ± 0.498
2.35IleHis: 2.35 ± 0.446
7.231IleIle: 7.231 ± 1.747
8.496IleLys: 8.496 ± 1.428
7.954IleLeu: 7.954 ± 0.784
1.988IleMet: 1.988 ± 1.0
7.231IleAsn: 7.231 ± 0.808
2.711IlePro: 2.711 ± 0.785
3.254IleGln: 3.254 ± 0.206
3.615IleArg: 3.615 ± 0.492
10.123IleSer: 10.123 ± 1.123
3.796IleThr: 3.796 ± 0.661
2.531IleVal: 2.531 ± 0.689
0.362IleTrp: 0.362 ± 0.178
3.254IleTyr: 3.254 ± 1.037
0.0IleXaa: 0.0 ± 0.0
Lys
2.531LysAla: 2.531 ± 0.329
1.446LysCys: 1.446 ± 0.713
4.519LysAsp: 4.519 ± 1.645
6.869LysGlu: 6.869 ± 1.288
3.977LysPhe: 3.977 ± 0.358
2.711LysGly: 2.711 ± 0.42
1.627LysHis: 1.627 ± 0.318
6.508LysIle: 6.508 ± 1.115
4.519LysLys: 4.519 ± 0.853
8.496LysLeu: 8.496 ± 2.422
1.988LysMet: 1.988 ± 0.398
4.519LysAsn: 4.519 ± 0.295
1.988LysPro: 1.988 ± 0.551
3.615LysGln: 3.615 ± 1.224
1.627LysArg: 1.627 ± 0.776
7.411LysSer: 7.411 ± 1.365
5.242LysThr: 5.242 ± 0.601
3.977LysVal: 3.977 ± 1.714
1.446LysTrp: 1.446 ± 0.221
3.073LysTyr: 3.073 ± 0.584
0.0LysXaa: 0.0 ± 0.0
Leu
3.796LeuAla: 3.796 ± 0.834
1.808LeuCys: 1.808 ± 0.373
6.146LeuAsp: 6.146 ± 1.793
4.881LeuGlu: 4.881 ± 0.533
3.796LeuPhe: 3.796 ± 0.296
3.435LeuGly: 3.435 ± 1.04
1.265LeuHis: 1.265 ± 0.474
9.038LeuIle: 9.038 ± 1.909
11.027LeuLys: 11.027 ± 2.36
9.219LeuLeu: 9.219 ± 0.958
2.892LeuMet: 2.892 ± 0.843
4.7LeuAsn: 4.7 ± 1.184
2.711LeuPro: 2.711 ± 0.42
3.254LeuGln: 3.254 ± 0.265
4.7LeuArg: 4.7 ± 1.184
12.834LeuSer: 12.834 ± 1.273
7.592LeuThr: 7.592 ± 0.762
3.977LeuVal: 3.977 ± 0.358
1.085LeuTrp: 1.085 ± 0.715
4.338LeuTyr: 4.338 ± 0.931
0.0LeuXaa: 0.0 ± 0.0
Met
0.904MetAla: 0.904 ± 0.313
0.542MetCys: 0.542 ± 0.13
1.085MetAsp: 1.085 ± 0.545
0.723MetGlu: 0.723 ± 0.364
1.085MetPhe: 1.085 ± 0.259
0.904MetGly: 0.904 ± 0.455
0.723MetHis: 0.723 ± 0.288
2.35MetIle: 2.35 ± 0.902
1.988MetLys: 1.988 ± 1.0
3.254MetLeu: 3.254 ± 0.609
1.085MetMet: 1.085 ± 0.267
1.085MetAsn: 1.085 ± 0.267
0.904MetPro: 0.904 ± 0.243
0.0MetGln: 0.0 ± 0.0
1.446MetArg: 1.446 ± 0.492
2.711MetSer: 2.711 ± 0.745
1.085MetThr: 1.085 ± 0.267
1.627MetVal: 1.627 ± 1.075
0.0MetTrp: 0.0 ± 0.0
1.085MetTyr: 1.085 ± 0.267
0.0MetXaa: 0.0 ± 0.0
Asn
2.892AsnAla: 2.892 ± 2.558
1.446AsnCys: 1.446 ± 0.271
1.988AsnAsp: 1.988 ± 0.551
4.158AsnGlu: 4.158 ± 0.547
2.169AsnPhe: 2.169 ± 0.406
1.808AsnGly: 1.808 ± 0.357
1.085AsnHis: 1.085 ± 0.611
5.604AsnIle: 5.604 ± 1.647
3.977AsnLys: 3.977 ± 1.134
6.146AsnLeu: 6.146 ± 0.892
0.542AsnMet: 0.542 ± 0.273
3.796AsnAsn: 3.796 ± 0.473
2.35AsnPro: 2.35 ± 0.114
1.627AsnGln: 1.627 ± 0.243
1.265AsnArg: 1.265 ± 0.421
6.146AsnSer: 6.146 ± 0.835
3.073AsnThr: 3.073 ± 0.604
2.892AsnVal: 2.892 ± 0.235
0.542AsnTrp: 0.542 ± 0.273
3.254AsnTyr: 3.254 ± 1.113
0.0AsnXaa: 0.0 ± 0.0
Pro
1.446ProAla: 1.446 ± 1.35
0.723ProCys: 0.723 ± 0.316
1.085ProAsp: 1.085 ± 0.36
1.446ProGlu: 1.446 ± 0.492
1.988ProPhe: 1.988 ± 0.422
1.808ProGly: 1.808 ± 0.317
0.181ProHis: 0.181 ± 0.091
3.073ProIle: 3.073 ± 0.584
3.615ProLys: 3.615 ± 0.386
2.711ProLeu: 2.711 ± 1.477
0.362ProMet: 0.362 ± 0.178
1.446ProAsn: 1.446 ± 0.423
0.723ProPro: 0.723 ± 0.288
0.362ProGln: 0.362 ± 0.32
1.085ProArg: 1.085 ± 0.545
3.073ProSer: 3.073 ± 0.47
1.988ProThr: 1.988 ± 0.936
0.904ProVal: 0.904 ± 0.455
0.542ProTrp: 0.542 ± 0.29
0.542ProTyr: 0.542 ± 0.13
0.0ProXaa: 0.0 ± 0.0
Gln
0.723GlnAla: 0.723 ± 0.364
0.542GlnCys: 0.542 ± 0.426
2.35GlnAsp: 2.35 ± 0.446
1.988GlnGlu: 1.988 ± 0.701
1.988GlnPhe: 1.988 ± 0.422
1.808GlnGly: 1.808 ± 0.357
0.723GlnHis: 0.723 ± 0.135
3.435GlnIle: 3.435 ± 0.48
2.531GlnLys: 2.531 ± 0.843
3.977GlnLeu: 3.977 ± 0.463
1.265GlnMet: 1.265 ± 0.636
2.531GlnAsn: 2.531 ± 0.507
1.085GlnPro: 1.085 ± 0.534
1.265GlnGln: 1.265 ± 0.161
1.446GlnArg: 1.446 ± 0.504
2.892GlnSer: 2.892 ± 0.235
2.169GlnThr: 2.169 ± 0.483
1.808GlnVal: 1.808 ± 0.919
0.181GlnTrp: 0.181 ± 0.091
0.904GlnTyr: 0.904 ± 0.191
0.0GlnXaa: 0.0 ± 0.0
Arg
1.988ArgAla: 1.988 ± 1.607
0.542ArgCys: 0.542 ± 0.397
1.446ArgAsp: 1.446 ± 0.185
3.073ArgGlu: 3.073 ± 1.019
2.169ArgPhe: 2.169 ± 0.15
2.892ArgGly: 2.892 ± 0.443
0.181ArgHis: 0.181 ± 0.091
2.531ArgIle: 2.531 ± 0.329
3.073ArgLys: 3.073 ± 0.61
4.7ArgLeu: 4.7 ± 0.593
0.723ArgMet: 0.723 ± 0.316
1.808ArgAsn: 1.808 ± 0.45
1.446ArgPro: 1.446 ± 0.727
1.627ArgGln: 1.627 ± 0.318
2.711ArgArg: 2.711 ± 0.139
3.796ArgSer: 3.796 ± 0.383
2.169ArgThr: 2.169 ± 0.529
1.627ArgVal: 1.627 ± 0.318
0.542ArgTrp: 0.542 ± 0.273
1.988ArgTyr: 1.988 ± 0.936
0.0ArgXaa: 0.0 ± 0.0
Ser
3.073SerAla: 3.073 ± 0.212
1.265SerCys: 1.265 ± 0.625
6.146SerAsp: 6.146 ± 1.158
11.208SerGlu: 11.208 ± 2.185
4.338SerPhe: 4.338 ± 0.455
3.977SerGly: 3.977 ± 0.132
1.085SerHis: 1.085 ± 0.545
8.315SerIle: 8.315 ± 1.622
9.581SerLys: 9.581 ± 2.397
9.942SerLeu: 9.942 ± 1.967
1.446SerMet: 1.446 ± 0.271
5.965SerAsn: 5.965 ± 0.349
2.892SerPro: 2.892 ± 0.371
3.254SerGln: 3.254 ± 0.206
2.531SerArg: 2.531 ± 0.544
13.738SerSer: 13.738 ± 4.272
6.869SerThr: 6.869 ± 1.392
5.604SerVal: 5.604 ± 1.128
1.446SerTrp: 1.446 ± 0.535
3.796SerTyr: 3.796 ± 0.483
0.0SerXaa: 0.0 ± 0.0
Thr
2.531ThrAla: 2.531 ± 0.689
2.711ThrCys: 2.711 ± 1.937
3.615ThrAsp: 3.615 ± 0.714
2.711ThrGlu: 2.711 ± 0.912
1.988ThrPhe: 1.988 ± 0.894
2.531ThrGly: 2.531 ± 0.322
1.265ThrHis: 1.265 ± 0.78
5.965ThrIle: 5.965 ± 1.092
1.627ThrLys: 1.627 ± 0.389
6.508ThrLeu: 6.508 ± 0.337
1.627ThrMet: 1.627 ± 0.418
3.254ThrAsn: 3.254 ± 1.013
1.446ThrPro: 1.446 ± 0.633
1.988ThrGln: 1.988 ± 0.668
1.808ThrArg: 1.808 ± 0.485
6.327ThrSer: 6.327 ± 0.618
2.892ThrThr: 2.892 ± 1.523
3.977ThrVal: 3.977 ± 0.872
0.723ThrTrp: 0.723 ± 0.364
1.988ThrTyr: 1.988 ± 0.936
0.0ThrXaa: 0.0 ± 0.0
Val
2.892ValAla: 2.892 ± 1.693
1.085ValCys: 1.085 ± 0.267
2.169ValAsp: 2.169 ± 0.529
2.711ValGlu: 2.711 ± 0.139
1.265ValPhe: 1.265 ± 0.249
3.435ValGly: 3.435 ± 0.476
0.723ValHis: 0.723 ± 0.356
3.254ValIle: 3.254 ± 0.683
5.604ValLys: 5.604 ± 0.62
4.7ValLeu: 4.7 ± 0.545
0.723ValMet: 0.723 ± 0.64
3.435ValAsn: 3.435 ± 1.289
1.085ValPro: 1.085 ± 0.185
1.446ValGln: 1.446 ± 0.633
1.808ValArg: 1.808 ± 0.357
5.604ValSer: 5.604 ± 1.473
1.808ValThr: 1.808 ± 1.238
1.988ValVal: 1.988 ± 0.27
0.542ValTrp: 0.542 ± 0.426
1.265ValTyr: 1.265 ± 0.474
0.0ValXaa: 0.0 ± 0.0
Trp
0.181TrpAla: 0.181 ± 0.091
0.362TrpCys: 0.362 ± 0.503
0.181TrpAsp: 0.181 ± 0.091
0.362TrpGlu: 0.362 ± 0.178
0.542TrpPhe: 0.542 ± 0.13
0.542TrpGly: 0.542 ± 0.13
0.181TrpHis: 0.181 ± 0.091
0.542TrpIle: 0.542 ± 0.426
0.904TrpLys: 0.904 ± 0.298
0.723TrpLeu: 0.723 ± 0.288
0.181TrpMet: 0.181 ± 0.091
0.0TrpAsn: 0.0 ± 0.0
0.181TrpPro: 0.181 ± 0.091
1.085TrpGln: 1.085 ± 0.402
0.362TrpArg: 0.362 ± 0.32
1.627TrpSer: 1.627 ± 0.318
0.362TrpThr: 0.362 ± 0.178
0.904TrpVal: 0.904 ± 0.243
0.0TrpTrp: 0.0 ± 0.0
0.542TrpTyr: 0.542 ± 0.273
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.723TyrAla: 0.723 ± 0.316
1.446TyrCys: 1.446 ± 0.535
1.808TyrAsp: 1.808 ± 0.485
2.169TyrGlu: 2.169 ± 0.15
1.627TyrPhe: 1.627 ± 0.133
1.808TyrGly: 1.808 ± 0.373
1.085TyrHis: 1.085 ± 0.267
3.615TyrIle: 3.615 ± 0.386
2.531TyrLys: 2.531 ± 0.689
3.796TyrLeu: 3.796 ± 0.958
1.446TyrMet: 1.446 ± 0.221
1.808TyrAsn: 1.808 ± 1.109
1.446TyrPro: 1.446 ± 0.713
2.35TyrGln: 2.35 ± 0.239
1.988TyrArg: 1.988 ± 0.27
3.615TyrSer: 3.615 ± 1.192
1.085TyrThr: 1.085 ± 0.545
1.627TyrVal: 1.627 ± 0.318
0.0TyrTrp: 0.0 ± 0.0
0.904TyrTyr: 0.904 ± 0.191
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (5533 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski