Amino acid dipepetide frequency for Sweet potato chlorotic fleck virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.131AlaAla: 6.131 ± 4.236
1.362AlaCys: 1.362 ± 0.781
2.044AlaAsp: 2.044 ± 0.944
2.725AlaGlu: 2.725 ± 1.012
3.747AlaPhe: 3.747 ± 1.196
2.725AlaGly: 2.725 ± 1.012
0.681AlaHis: 0.681 ± 0.504
4.428AlaIle: 4.428 ± 1.46
6.131AlaLys: 6.131 ± 1.119
6.131AlaLeu: 6.131 ± 1.121
0.681AlaMet: 0.681 ± 0.532
2.725AlaAsn: 2.725 ± 1.011
1.703AlaPro: 1.703 ± 0.593
2.384AlaGln: 2.384 ± 1.459
2.044AlaArg: 2.044 ± 1.024
3.747AlaSer: 3.747 ± 0.732
3.747AlaThr: 3.747 ± 0.531
3.406AlaVal: 3.406 ± 0.594
0.341AlaTrp: 0.341 ± 0.179
2.725AlaTyr: 2.725 ± 0.979
0.0AlaXaa: 0.0 ± 0.0
Cys
1.703CysAla: 1.703 ± 0.814
0.681CysCys: 0.681 ± 0.824
1.362CysAsp: 1.362 ± 1.224
2.384CysGlu: 2.384 ± 1.826
2.384CysPhe: 2.384 ± 1.102
2.725CysGly: 2.725 ± 2.449
0.341CysHis: 0.341 ± 0.826
2.044CysIle: 2.044 ± 0.68
1.362CysLys: 1.362 ± 0.781
3.406CysLeu: 3.406 ± 1.565
0.0CysMet: 0.0 ± 0.0
1.703CysAsn: 1.703 ± 0.558
0.681CysPro: 0.681 ± 0.824
0.681CysGln: 0.681 ± 0.357
0.681CysArg: 0.681 ± 0.504
1.022CysSer: 1.022 ± 0.445
1.362CysThr: 1.362 ± 1.378
1.022CysVal: 1.022 ± 1.499
0.0CysTrp: 0.0 ± 0.0
1.022CysTyr: 1.022 ± 0.536
0.0CysXaa: 0.0 ± 0.0
Asp
4.768AspAla: 4.768 ± 1.339
1.022AspCys: 1.022 ± 1.889
1.703AspAsp: 1.703 ± 0.893
4.087AspGlu: 4.087 ± 1.648
4.428AspPhe: 4.428 ± 1.215
2.725AspGly: 2.725 ± 0.952
0.681AspHis: 0.681 ± 0.357
2.725AspIle: 2.725 ± 1.607
4.768AspLys: 4.768 ± 1.473
4.087AspLeu: 4.087 ± 1.548
0.681AspMet: 0.681 ± 0.357
2.044AspAsn: 2.044 ± 0.716
2.044AspPro: 2.044 ± 0.891
1.362AspGln: 1.362 ± 1.336
1.703AspArg: 1.703 ± 0.558
3.747AspSer: 3.747 ± 1.231
2.725AspThr: 2.725 ± 1.011
3.065AspVal: 3.065 ± 0.957
1.362AspTrp: 1.362 ± 1.224
2.044AspTyr: 2.044 ± 0.723
0.0AspXaa: 0.0 ± 0.0
Glu
4.768GluAla: 4.768 ± 1.923
2.725GluCys: 2.725 ± 0.729
3.406GluAsp: 3.406 ± 1.007
6.131GluGlu: 6.131 ± 2.282
3.065GluPhe: 3.065 ± 1.239
5.109GluGly: 5.109 ± 1.524
2.044GluHis: 2.044 ± 1.072
4.428GluIle: 4.428 ± 1.225
3.406GluLys: 3.406 ± 1.421
5.109GluLeu: 5.109 ± 1.587
2.725GluMet: 2.725 ± 1.429
3.406GluAsn: 3.406 ± 0.806
2.725GluPro: 2.725 ± 0.849
3.065GluGln: 3.065 ± 1.608
4.428GluArg: 4.428 ± 0.674
5.109GluSer: 5.109 ± 1.062
2.384GluThr: 2.384 ± 0.9
2.384GluVal: 2.384 ± 0.629
0.341GluTrp: 0.341 ± 0.179
2.044GluTyr: 2.044 ± 1.116
0.0GluXaa: 0.0 ± 0.0
Phe
4.087PheAla: 4.087 ± 0.661
0.341PheCys: 0.341 ± 0.179
4.087PheAsp: 4.087 ± 0.891
5.79PheGlu: 5.79 ± 2.119
2.044PhePhe: 2.044 ± 0.886
2.725PheGly: 2.725 ± 0.944
0.681PheHis: 0.681 ± 0.357
4.428PheIle: 4.428 ± 1.321
4.428PheLys: 4.428 ± 1.85
5.109PheLeu: 5.109 ± 1.809
1.022PheMet: 1.022 ± 1.376
3.065PheAsn: 3.065 ± 0.799
1.022PhePro: 1.022 ± 0.472
1.022PheGln: 1.022 ± 0.445
5.45PheArg: 5.45 ± 1.543
4.768PheSer: 4.768 ± 1.38
2.384PheThr: 2.384 ± 0.824
3.747PheVal: 3.747 ± 1.187
0.0PheTrp: 0.0 ± 0.0
2.044PheTyr: 2.044 ± 1.072
0.0PheXaa: 0.0 ± 0.0
Gly
2.384GlyAla: 2.384 ± 0.826
1.362GlyCys: 1.362 ± 0.839
5.79GlyAsp: 5.79 ± 1.034
2.384GlyGlu: 2.384 ± 0.629
2.044GlyPhe: 2.044 ± 1.007
4.768GlyGly: 4.768 ± 0.885
0.681GlyHis: 0.681 ± 0.357
3.747GlyIle: 3.747 ± 0.947
5.45GlyLys: 5.45 ± 1.47
6.471GlyLeu: 6.471 ± 1.303
0.681GlyMet: 0.681 ± 0.824
3.747GlyAsn: 3.747 ± 1.065
1.362GlyPro: 1.362 ± 0.715
2.725GlyGln: 2.725 ± 1.302
4.428GlyArg: 4.428 ± 1.037
3.406GlySer: 3.406 ± 1.599
4.087GlyThr: 4.087 ± 1.308
5.79GlyVal: 5.79 ± 1.572
1.362GlyTrp: 1.362 ± 0.505
1.703GlyTyr: 1.703 ± 0.893
0.0GlyXaa: 0.0 ± 0.0
His
0.681HisAla: 0.681 ± 0.504
0.681HisCys: 0.681 ± 0.824
1.362HisAsp: 1.362 ± 0.505
0.681HisGlu: 0.681 ± 0.357
1.022HisPhe: 1.022 ± 0.536
1.022HisGly: 1.022 ± 0.967
0.681HisHis: 0.681 ± 0.357
0.681HisIle: 0.681 ± 0.357
1.022HisLys: 1.022 ± 0.536
1.703HisLeu: 1.703 ± 0.558
0.341HisMet: 0.341 ± 0.726
1.022HisAsn: 1.022 ± 0.536
0.341HisPro: 0.341 ± 0.179
0.341HisGln: 0.341 ± 0.179
0.681HisArg: 0.681 ± 1.109
2.384HisSer: 2.384 ± 0.636
0.681HisThr: 0.681 ± 0.357
1.362HisVal: 1.362 ± 0.505
0.0HisTrp: 0.0 ± 0.0
0.681HisTyr: 0.681 ± 0.488
0.0HisXaa: 0.0 ± 0.0
Ile
4.087IleAla: 4.087 ± 2.531
3.065IleCys: 3.065 ± 1.239
2.384IleAsp: 2.384 ± 1.239
5.45IleGlu: 5.45 ± 1.87
2.384IlePhe: 2.384 ± 1.059
3.065IleGly: 3.065 ± 1.223
1.362IleHis: 1.362 ± 0.715
2.384IleIle: 2.384 ± 0.812
5.45IleLys: 5.45 ± 1.698
4.087IleLeu: 4.087 ± 1.271
0.681IleMet: 0.681 ± 0.488
2.044IleAsn: 2.044 ± 0.609
1.703IlePro: 1.703 ± 1.088
1.362IleGln: 1.362 ± 0.715
4.087IleArg: 4.087 ± 1.126
5.45IleSer: 5.45 ± 0.774
3.747IleThr: 3.747 ± 2.716
3.747IleVal: 3.747 ± 1.693
0.341IleTrp: 0.341 ± 0.179
3.065IleTyr: 3.065 ± 1.398
0.0IleXaa: 0.0 ± 0.0
Lys
4.087LysAla: 4.087 ± 1.516
1.022LysCys: 1.022 ± 0.897
4.087LysAsp: 4.087 ± 1.678
6.471LysGlu: 6.471 ± 1.467
4.768LysPhe: 4.768 ± 0.706
4.087LysGly: 4.087 ± 1.551
1.362LysHis: 1.362 ± 1.008
3.747LysIle: 3.747 ± 1.231
3.065LysLys: 3.065 ± 1.581
6.471LysLeu: 6.471 ± 1.808
1.022LysMet: 1.022 ± 0.689
3.747LysAsn: 3.747 ± 1.534
4.428LysPro: 4.428 ± 1.716
0.0LysGln: 0.0 ± 0.0
3.747LysArg: 3.747 ± 0.732
5.79LysSer: 5.79 ± 1.45
3.406LysThr: 3.406 ± 0.901
5.79LysVal: 5.79 ± 1.07
1.022LysTrp: 1.022 ± 1.083
3.065LysTyr: 3.065 ± 1.581
0.0LysXaa: 0.0 ± 0.0
Leu
4.428LeuAla: 4.428 ± 1.727
1.703LeuCys: 1.703 ± 2.283
5.109LeuAsp: 5.109 ± 1.702
5.45LeuGlu: 5.45 ± 0.88
3.747LeuPhe: 3.747 ± 1.065
4.087LeuGly: 4.087 ± 1.678
0.341LeuHis: 0.341 ± 0.179
6.812LeuIle: 6.812 ± 2.001
7.834LeuLys: 7.834 ± 0.829
6.471LeuLeu: 6.471 ± 1.605
4.087LeuMet: 4.087 ± 0.878
5.79LeuAsn: 5.79 ± 1.686
4.087LeuPro: 4.087 ± 1.541
2.384LeuGln: 2.384 ± 0.693
3.406LeuArg: 3.406 ± 1.34
8.174LeuSer: 8.174 ± 1.695
6.131LeuThr: 6.131 ± 1.475
7.153LeuVal: 7.153 ± 1.418
0.341LeuTrp: 0.341 ± 1.381
3.065LeuTyr: 3.065 ± 1.473
0.0LeuXaa: 0.0 ± 0.0
Met
0.341MetAla: 0.341 ± 0.591
0.341MetCys: 0.341 ± 0.179
1.022MetAsp: 1.022 ± 0.445
2.384MetGlu: 2.384 ± 0.961
1.022MetPhe: 1.022 ± 0.536
2.044MetGly: 2.044 ± 0.886
0.0MetHis: 0.0 ± 0.0
1.703MetIle: 1.703 ± 0.593
1.703MetLys: 1.703 ± 0.593
2.384MetLeu: 2.384 ± 1.201
0.341MetMet: 0.341 ± 0.179
0.341MetAsn: 0.341 ± 0.179
0.681MetPro: 0.681 ± 1.277
0.681MetGln: 0.681 ± 1.032
0.681MetArg: 0.681 ± 0.824
1.362MetSer: 1.362 ± 0.628
0.681MetThr: 0.681 ± 1.277
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.341MetTyr: 0.341 ± 0.179
0.0MetXaa: 0.0 ± 0.0
Asn
1.362AsnAla: 1.362 ± 0.715
2.044AsnCys: 2.044 ± 1.626
1.703AsnAsp: 1.703 ± 0.893
3.747AsnGlu: 3.747 ± 1.188
4.428AsnPhe: 4.428 ± 1.011
3.065AsnGly: 3.065 ± 1.173
0.681AsnHis: 0.681 ± 0.504
1.703AsnIle: 1.703 ± 0.964
2.384AsnLys: 2.384 ± 1.598
7.153AsnLeu: 7.153 ± 1.82
0.681AsnMet: 0.681 ± 0.357
2.384AsnAsn: 2.384 ± 0.858
1.703AsnPro: 1.703 ± 0.593
0.0AsnGln: 0.0 ± 0.0
4.768AsnArg: 4.768 ± 1.47
2.725AsnSer: 2.725 ± 1.012
1.703AsnThr: 1.703 ± 0.691
1.362AsnVal: 1.362 ± 0.843
0.341AsnTrp: 0.341 ± 0.179
3.065AsnTyr: 3.065 ± 0.808
0.0AsnXaa: 0.0 ± 0.0
Pro
1.703ProAla: 1.703 ± 0.653
1.022ProCys: 1.022 ± 0.536
2.044ProAsp: 2.044 ± 1.183
1.703ProGlu: 1.703 ± 0.593
0.681ProPhe: 0.681 ± 0.357
1.703ProGly: 1.703 ± 0.917
1.022ProHis: 1.022 ± 0.653
2.044ProIle: 2.044 ± 0.68
3.065ProLys: 3.065 ± 0.928
3.747ProLeu: 3.747 ± 1.621
0.0ProMet: 0.0 ± 0.0
2.044ProAsn: 2.044 ± 1.795
2.044ProPro: 2.044 ± 2.009
3.065ProGln: 3.065 ± 2.269
2.384ProArg: 2.384 ± 0.985
1.703ProSer: 1.703 ± 2.166
2.044ProThr: 2.044 ± 1.072
2.044ProVal: 2.044 ± 0.944
0.341ProTrp: 0.341 ± 0.179
1.022ProTyr: 1.022 ± 0.653
0.0ProXaa: 0.0 ± 0.0
Gln
1.703GlnAla: 1.703 ± 1.772
0.681GlnCys: 0.681 ± 0.357
1.362GlnAsp: 1.362 ± 0.715
2.044GlnGlu: 2.044 ± 1.116
1.022GlnPhe: 1.022 ± 0.445
1.703GlnGly: 1.703 ± 0.593
1.022GlnHis: 1.022 ± 1.542
1.703GlnIle: 1.703 ± 0.691
0.341GlnLys: 0.341 ± 0.179
2.384GlnLeu: 2.384 ± 1.81
0.0GlnMet: 0.0 ± 0.0
1.022GlnAsn: 1.022 ± 0.445
0.341GlnPro: 0.341 ± 0.179
1.362GlnGln: 1.362 ± 1.67
1.362GlnArg: 1.362 ± 0.715
3.747GlnSer: 3.747 ± 0.842
1.362GlnThr: 1.362 ± 1.008
3.747GlnVal: 3.747 ± 1.033
0.341GlnTrp: 0.341 ± 0.179
1.022GlnTyr: 1.022 ± 0.536
0.0GlnXaa: 0.0 ± 0.0
Arg
5.109ArgAla: 5.109 ± 1.636
1.703ArgCys: 1.703 ± 1.141
4.087ArgAsp: 4.087 ± 1.077
3.065ArgGlu: 3.065 ± 0.799
5.45ArgPhe: 5.45 ± 1.958
4.768ArgGly: 4.768 ± 0.92
1.362ArgHis: 1.362 ± 1.355
2.725ArgIle: 2.725 ± 1.012
2.044ArgLys: 2.044 ± 0.609
4.087ArgLeu: 4.087 ± 0.631
1.362ArgMet: 1.362 ± 0.777
1.703ArgAsn: 1.703 ± 0.593
2.384ArgPro: 2.384 ± 1.696
0.341ArgGln: 0.341 ± 0.179
6.131ArgArg: 6.131 ± 1.801
4.087ArgSer: 4.087 ± 1.161
1.362ArgThr: 1.362 ± 1.123
4.428ArgVal: 4.428 ± 1.457
1.022ArgTrp: 1.022 ± 0.536
2.384ArgTyr: 2.384 ± 0.693
0.0ArgXaa: 0.0 ± 0.0
Ser
3.747SerAla: 3.747 ± 1.726
0.681SerCys: 0.681 ± 1.63
3.065SerAsp: 3.065 ± 1.416
3.065SerGlu: 3.065 ± 0.804
4.768SerPhe: 4.768 ± 1.572
7.834SerGly: 7.834 ± 1.134
1.703SerHis: 1.703 ± 0.558
3.747SerIle: 3.747 ± 1.294
8.174SerLys: 8.174 ± 0.909
4.768SerLeu: 4.768 ± 1.599
1.703SerMet: 1.703 ± 1.091
2.384SerAsn: 2.384 ± 2.968
3.065SerPro: 3.065 ± 0.731
3.065SerGln: 3.065 ± 0.869
4.768SerArg: 4.768 ± 0.885
6.812SerSer: 6.812 ± 1.935
4.428SerThr: 4.428 ± 1.248
6.471SerVal: 6.471 ± 1.71
0.341SerTrp: 0.341 ± 0.179
2.725SerTyr: 2.725 ± 0.703
0.0SerXaa: 0.0 ± 0.0
Thr
4.087ThrAla: 4.087 ± 1.439
0.681ThrCys: 0.681 ± 0.357
1.362ThrAsp: 1.362 ± 0.715
3.747ThrGlu: 3.747 ± 0.531
6.471ThrPhe: 6.471 ± 1.295
4.087ThrGly: 4.087 ± 1.363
1.703ThrHis: 1.703 ± 0.893
2.044ThrIle: 2.044 ± 1.007
3.406ThrLys: 3.406 ± 1.452
5.109ThrLeu: 5.109 ± 1.541
0.681ThrMet: 0.681 ± 0.504
2.384ThrAsn: 2.384 ± 1.237
0.341ThrPro: 0.341 ± 0.826
0.341ThrGln: 0.341 ± 0.179
3.406ThrArg: 3.406 ± 0.888
3.747ThrSer: 3.747 ± 1.277
3.065ThrThr: 3.065 ± 1.171
4.768ThrVal: 4.768 ± 1.725
0.341ThrTrp: 0.341 ± 0.179
1.362ThrTyr: 1.362 ± 2.149
0.0ThrXaa: 0.0 ± 0.0
Val
2.725ValAla: 2.725 ± 0.681
3.747ValCys: 3.747 ± 0.614
3.406ValAsp: 3.406 ± 1.755
4.087ValGlu: 4.087 ± 1.432
2.725ValPhe: 2.725 ± 1.429
3.747ValGly: 3.747 ± 2.017
0.341ValHis: 0.341 ± 0.584
5.45ValIle: 5.45 ± 3.132
3.747ValLys: 3.747 ± 1.339
7.834ValLeu: 7.834 ± 1.91
0.341ValMet: 0.341 ± 0.179
3.065ValAsn: 3.065 ± 0.766
3.065ValPro: 3.065 ± 2.647
2.384ValGln: 2.384 ± 1.251
3.406ValArg: 3.406 ± 1.412
4.768ValSer: 4.768 ± 1.47
6.131ValThr: 6.131 ± 3.122
3.406ValVal: 3.406 ± 2.517
0.341ValTrp: 0.341 ± 0.591
1.703ValTyr: 1.703 ± 1.272
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.341TrpCys: 0.341 ± 0.179
0.681TrpAsp: 0.681 ± 0.824
0.0TrpGlu: 0.0 ± 0.0
0.681TrpPhe: 0.681 ± 0.357
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.681TrpLys: 0.681 ± 0.357
1.022TrpLeu: 1.022 ± 0.653
0.0TrpMet: 0.0 ± 0.0
1.362TrpAsn: 1.362 ± 0.505
0.0TrpPro: 0.0 ± 0.0
0.341TrpGln: 0.341 ± 0.591
0.341TrpArg: 0.341 ± 0.179
1.022TrpSer: 1.022 ± 0.472
0.0TrpThr: 0.0 ± 0.0
1.022TrpVal: 1.022 ± 1.192
0.0TrpTrp: 0.0 ± 0.0
1.022TrpTyr: 1.022 ± 0.472
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.703TyrAla: 1.703 ± 0.728
1.362TyrCys: 1.362 ± 1.446
1.703TyrAsp: 1.703 ± 0.893
3.406TyrGlu: 3.406 ± 1.381
1.703TyrPhe: 1.703 ± 0.558
2.384TyrGly: 2.384 ± 1.049
0.681TyrHis: 0.681 ± 0.357
3.406TyrIle: 3.406 ± 1.335
2.725TyrLys: 2.725 ± 1.105
3.065TyrLeu: 3.065 ± 1.018
0.681TyrMet: 0.681 ± 1.181
1.022TyrAsn: 1.022 ± 0.78
1.703TyrPro: 1.703 ± 0.558
1.362TyrGln: 1.362 ± 0.472
1.703TyrArg: 1.703 ± 0.593
3.747TyrSer: 3.747 ± 0.732
1.703TyrThr: 1.703 ± 1.344
1.703TyrVal: 1.703 ± 0.653
0.341TyrTrp: 0.341 ± 0.179
1.703TyrTyr: 1.703 ± 0.893
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2937 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski