Amino acid dipepetide frequency for Microviridae phi-CA82

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.6AlaCys: 0.6 ± 0.526
4.796AlaAsp: 4.796 ± 1.926
5.396AlaGlu: 5.396 ± 1.463
1.199AlaPhe: 1.199 ± 0.808
2.998AlaGly: 2.998 ± 1.177
0.0AlaHis: 0.0 ± 0.0
3.597AlaIle: 3.597 ± 1.724
5.396AlaLys: 5.396 ± 1.133
3.597AlaLeu: 3.597 ± 1.568
1.799AlaMet: 1.799 ± 0.839
5.396AlaAsn: 5.396 ± 2.238
2.398AlaPro: 2.398 ± 1.179
4.796AlaGln: 4.796 ± 4.327
3.597AlaArg: 3.597 ± 0.983
5.396AlaSer: 5.396 ± 3.421
5.995AlaThr: 5.995 ± 2.726
1.799AlaVal: 1.799 ± 0.957
2.398AlaTrp: 2.398 ± 0.605
2.398AlaTyr: 2.398 ± 1.347
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.6CysCys: 0.6 ± 0.802
1.199CysAsp: 1.199 ± 0.495
0.0CysGlu: 0.0 ± 0.0
1.199CysPhe: 1.199 ± 1.604
2.398CysGly: 2.398 ± 0.843
0.6CysHis: 0.6 ± 0.526
1.799CysIle: 1.799 ± 0.734
1.199CysLys: 1.199 ± 0.713
1.199CysLeu: 1.199 ± 0.495
0.6CysMet: 0.6 ± 0.526
1.199CysAsn: 1.199 ± 0.74
0.0CysPro: 0.0 ± 0.0
0.6CysGln: 0.6 ± 0.526
0.6CysArg: 0.6 ± 0.77
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.197AspAla: 4.197 ± 1.382
0.0AspCys: 0.0 ± 0.0
2.398AspAsp: 2.398 ± 1.439
1.799AspGlu: 1.799 ± 1.228
0.6AspPhe: 0.6 ± 0.526
4.197AspGly: 4.197 ± 1.662
0.6AspHis: 0.6 ± 0.568
6.595AspIle: 6.595 ± 2.165
4.197AspLys: 4.197 ± 0.993
3.597AspLeu: 3.597 ± 2.259
0.6AspMet: 0.6 ± 0.404
7.794AspAsn: 7.794 ± 1.925
4.796AspPro: 4.796 ± 2.355
0.6AspGln: 0.6 ± 0.526
1.799AspArg: 1.799 ± 1.069
2.398AspSer: 2.398 ± 0.647
3.597AspThr: 3.597 ± 0.86
1.199AspVal: 1.199 ± 0.808
1.799AspTrp: 1.799 ± 0.816
4.796AspTyr: 4.796 ± 2.261
0.0AspXaa: 0.0 ± 0.0
Glu
1.799GluAla: 1.799 ± 0.922
1.199GluCys: 1.199 ± 0.713
3.597GluAsp: 3.597 ± 1.082
1.799GluGlu: 1.799 ± 1.212
2.398GluPhe: 2.398 ± 1.165
1.799GluGly: 1.799 ± 0.734
1.199GluHis: 1.199 ± 0.495
6.595GluIle: 6.595 ± 2.548
4.796GluLys: 4.796 ± 1.618
4.796GluLeu: 4.796 ± 1.281
1.199GluMet: 1.199 ± 0.495
5.995GluAsn: 5.995 ± 2.541
1.199GluPro: 1.199 ± 0.612
3.597GluGln: 3.597 ± 1.617
4.197GluArg: 4.197 ± 1.901
1.199GluSer: 1.199 ± 0.808
3.597GluThr: 3.597 ± 1.845
2.398GluVal: 2.398 ± 0.726
0.6GluTrp: 0.6 ± 0.526
4.197GluTyr: 4.197 ± 0.794
0.0GluXaa: 0.0 ± 0.0
Phe
2.398PheAla: 2.398 ± 1.164
1.199PheCys: 1.199 ± 0.495
0.6PheAsp: 0.6 ± 0.792
0.6PheGlu: 0.6 ± 0.568
1.199PhePhe: 1.199 ± 0.723
2.398PheGly: 2.398 ± 1.327
0.0PheHis: 0.0 ± 0.0
1.799PheIle: 1.799 ± 1.579
2.398PheLys: 2.398 ± 2.106
1.799PheLeu: 1.799 ± 1.212
1.799PheMet: 1.799 ± 1.313
2.398PheAsn: 2.398 ± 1.222
1.199PhePro: 1.199 ± 1.158
1.199PheGln: 1.199 ± 1.06
0.6PheArg: 0.6 ± 0.404
2.398PheSer: 2.398 ± 1.484
2.398PheThr: 2.398 ± 1.295
1.799PheVal: 1.799 ± 0.806
0.0PheTrp: 0.0 ± 0.0
1.199PheTyr: 1.199 ± 0.808
0.0PheXaa: 0.0 ± 0.0
Gly
4.197GlyAla: 4.197 ± 2.37
0.0GlyCys: 0.0 ± 0.0
2.398GlyAsp: 2.398 ± 1.295
4.197GlyGlu: 4.197 ± 1.685
4.796GlyPhe: 4.796 ± 1.527
2.398GlyGly: 2.398 ± 1.22
1.199GlyHis: 1.199 ± 0.713
7.794GlyIle: 7.794 ± 2.348
4.197GlyLys: 4.197 ± 1.369
4.796GlyLeu: 4.796 ± 1.944
0.6GlyMet: 0.6 ± 0.568
3.597GlyAsn: 3.597 ± 1.47
0.0GlyPro: 0.0 ± 0.0
0.6GlyGln: 0.6 ± 0.568
2.998GlyArg: 2.998 ± 1.405
3.597GlySer: 3.597 ± 1.082
3.597GlyThr: 3.597 ± 1.246
3.597GlyVal: 3.597 ± 1.036
1.199GlyTrp: 1.199 ± 0.808
2.398GlyTyr: 2.398 ± 1.079
0.0GlyXaa: 0.0 ± 0.0
His
1.199HisAla: 1.199 ± 0.675
0.6HisCys: 0.6 ± 0.526
0.6HisAsp: 0.6 ± 0.404
0.6HisGlu: 0.6 ± 0.526
0.6HisPhe: 0.6 ± 0.526
1.799HisGly: 1.799 ± 0.857
0.6HisHis: 0.6 ± 0.568
0.6HisIle: 0.6 ± 0.404
1.199HisLys: 1.199 ± 1.158
1.199HisLeu: 1.199 ± 0.713
2.398HisMet: 2.398 ± 0.753
0.6HisAsn: 0.6 ± 0.526
0.0HisPro: 0.0 ± 0.0
1.199HisGln: 1.199 ± 0.495
1.199HisArg: 1.199 ± 0.744
2.398HisSer: 2.398 ± 1.094
1.799HisThr: 1.799 ± 0.986
0.6HisVal: 0.6 ± 0.596
0.0HisTrp: 0.0 ± 0.0
2.398HisTyr: 2.398 ± 2.106
0.0HisXaa: 0.0 ± 0.0
Ile
5.396IleAla: 5.396 ± 1.563
0.6IleCys: 0.6 ± 0.568
2.398IleAsp: 2.398 ± 0.915
6.595IleGlu: 6.595 ± 2.894
1.199IlePhe: 1.199 ± 0.705
3.597IleGly: 3.597 ± 1.934
1.799IleHis: 1.799 ± 1.443
2.398IleIle: 2.398 ± 1.362
7.194IleLys: 7.194 ± 1.626
4.796IleLeu: 4.796 ± 1.621
1.199IleMet: 1.199 ± 0.71
5.995IleAsn: 5.995 ± 0.843
1.199IlePro: 1.199 ± 0.725
1.799IleGln: 1.799 ± 0.806
2.398IleArg: 2.398 ± 0.8
1.799IleSer: 1.799 ± 0.957
4.197IleThr: 4.197 ± 2.529
1.799IleVal: 1.799 ± 2.057
1.199IleTrp: 1.199 ± 0.495
4.197IleTyr: 4.197 ± 2.014
0.0IleXaa: 0.0 ± 0.0
Lys
7.194LysAla: 7.194 ± 2.265
0.6LysCys: 0.6 ± 0.66
3.597LysAsp: 3.597 ± 1.745
4.796LysGlu: 4.796 ± 1.058
0.6LysPhe: 0.6 ± 0.8
7.194LysGly: 7.194 ± 1.375
2.398LysHis: 2.398 ± 0.756
2.398LysIle: 2.398 ± 1.904
7.194LysLys: 7.194 ± 2.099
5.995LysLeu: 5.995 ± 2.119
2.998LysMet: 2.998 ± 1.544
4.197LysAsn: 4.197 ± 1.516
2.998LysPro: 2.998 ± 1.057
4.197LysGln: 4.197 ± 0.819
2.398LysArg: 2.398 ± 0.953
6.595LysSer: 6.595 ± 1.649
2.998LysThr: 2.998 ± 1.869
4.796LysVal: 4.796 ± 1.558
2.398LysTrp: 2.398 ± 0.605
5.995LysTyr: 5.995 ± 2.013
0.0LysXaa: 0.0 ± 0.0
Leu
2.398LeuAla: 2.398 ± 1.282
0.6LeuCys: 0.6 ± 0.404
3.597LeuAsp: 3.597 ± 1.469
3.597LeuGlu: 3.597 ± 1.082
1.199LeuPhe: 1.199 ± 0.612
4.197LeuGly: 4.197 ± 1.737
1.199LeuHis: 1.199 ± 0.495
2.398LeuIle: 2.398 ± 0.99
2.398LeuLys: 2.398 ± 0.953
5.995LeuLeu: 5.995 ± 3.369
2.998LeuMet: 2.998 ± 1.266
5.396LeuAsn: 5.396 ± 1.553
5.995LeuPro: 5.995 ± 2.027
2.998LeuGln: 2.998 ± 2.458
5.396LeuArg: 5.396 ± 1.578
5.995LeuSer: 5.995 ± 1.375
4.796LeuThr: 4.796 ± 0.795
1.799LeuVal: 1.799 ± 2.057
1.799LeuTrp: 1.799 ± 0.816
4.197LeuTyr: 4.197 ± 1.187
0.0LeuXaa: 0.0 ± 0.0
Met
1.799MetAla: 1.799 ± 0.518
0.6MetCys: 0.6 ± 0.404
1.799MetAsp: 1.799 ± 1.138
1.199MetGlu: 1.199 ± 1.044
0.6MetPhe: 0.6 ± 0.568
1.199MetGly: 1.199 ± 0.495
1.199MetHis: 1.199 ± 0.495
1.199MetIle: 1.199 ± 1.053
2.998MetLys: 2.998 ± 1.597
1.199MetLeu: 1.199 ± 0.744
1.199MetMet: 1.199 ± 0.487
1.799MetAsn: 1.799 ± 0.874
2.398MetPro: 2.398 ± 0.99
4.796MetGln: 4.796 ± 2.556
1.199MetArg: 1.199 ± 0.718
3.597MetSer: 3.597 ± 2.289
4.197MetThr: 4.197 ± 1.205
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.199MetTyr: 1.199 ± 0.718
0.0MetXaa: 0.0 ± 0.0
Asn
7.794AsnAla: 7.794 ± 4.897
0.0AsnCys: 0.0 ± 0.0
2.398AsnAsp: 2.398 ± 1.218
3.597AsnGlu: 3.597 ± 0.676
2.398AsnPhe: 2.398 ± 1.158
2.398AsnGly: 2.398 ± 0.967
3.597AsnHis: 3.597 ± 1.389
5.396AsnIle: 5.396 ± 0.881
13.189AsnLys: 13.189 ± 2.791
6.595AsnLeu: 6.595 ± 1.924
3.597AsnMet: 3.597 ± 1.308
3.597AsnAsn: 3.597 ± 1.902
3.597AsnPro: 3.597 ± 1.632
0.6AsnGln: 0.6 ± 0.638
4.796AsnArg: 4.796 ± 2.02
4.796AsnSer: 4.796 ± 1.436
4.197AsnThr: 4.197 ± 1.772
4.197AsnVal: 4.197 ± 1.559
0.0AsnTrp: 0.0 ± 0.0
2.998AsnTyr: 2.998 ± 0.898
0.0AsnXaa: 0.0 ± 0.0
Pro
2.398ProAla: 2.398 ± 0.605
0.6ProCys: 0.6 ± 0.526
3.597ProAsp: 3.597 ± 1.863
2.998ProGlu: 2.998 ± 1.319
0.6ProPhe: 0.6 ± 0.404
1.799ProGly: 1.799 ± 0.734
1.799ProHis: 1.799 ± 0.939
2.398ProIle: 2.398 ± 1.213
1.799ProLys: 1.799 ± 1.265
3.597ProLeu: 3.597 ± 1.276
0.6ProMet: 0.6 ± 0.526
2.998ProAsn: 2.998 ± 1.264
0.6ProPro: 0.6 ± 0.66
2.998ProGln: 2.998 ± 1.451
0.6ProArg: 0.6 ± 0.8
0.6ProSer: 0.6 ± 0.404
2.998ProThr: 2.998 ± 0.888
3.597ProVal: 3.597 ± 1.344
1.199ProTrp: 1.199 ± 0.612
0.6ProTyr: 0.6 ± 0.404
0.0ProXaa: 0.0 ± 0.0
Gln
2.398GlnAla: 2.398 ± 1.051
0.6GlnCys: 0.6 ± 0.77
1.799GlnAsp: 1.799 ± 0.89
5.396GlnGlu: 5.396 ± 2.115
1.799GlnPhe: 1.799 ± 0.518
1.799GlnGly: 1.799 ± 0.816
1.799GlnHis: 1.799 ± 0.662
1.799GlnIle: 1.799 ± 1.069
1.799GlnLys: 1.799 ± 1.257
2.398GlnLeu: 2.398 ± 1.148
2.998GlnMet: 2.998 ± 2.458
6.595GlnAsn: 6.595 ± 1.772
0.6GlnPro: 0.6 ± 0.404
1.199GlnGln: 1.199 ± 1.053
3.597GlnArg: 3.597 ± 2.595
0.6GlnSer: 0.6 ± 0.638
2.998GlnThr: 2.998 ± 0.585
0.0GlnVal: 0.0 ± 0.0
0.6GlnTrp: 0.6 ± 0.638
3.597GlnTyr: 3.597 ± 1.34
0.0GlnXaa: 0.0 ± 0.0
Arg
2.998ArgAla: 2.998 ± 1.312
1.199ArgCys: 1.199 ± 0.495
1.799ArgAsp: 1.799 ± 0.518
1.799ArgGlu: 1.799 ± 0.89
1.199ArgPhe: 1.199 ± 1.321
2.998ArgGly: 2.998 ± 1.786
1.199ArgHis: 1.199 ± 1.053
2.998ArgIle: 2.998 ± 1.005
4.796ArgLys: 4.796 ± 1.829
2.998ArgLeu: 2.998 ± 1.285
2.998ArgMet: 2.998 ± 0.787
2.398ArgAsn: 2.398 ± 1.193
1.799ArgPro: 1.799 ± 0.734
1.799ArgGln: 1.799 ± 0.881
1.799ArgArg: 1.799 ± 0.876
2.998ArgSer: 2.998 ± 0.761
2.998ArgThr: 2.998 ± 1.056
2.398ArgVal: 2.398 ± 0.647
0.0ArgTrp: 0.0 ± 0.0
4.796ArgTyr: 4.796 ± 1.237
0.0ArgXaa: 0.0 ± 0.0
Ser
5.995SerAla: 5.995 ± 2.321
0.0SerCys: 0.0 ± 0.0
4.197SerAsp: 4.197 ± 1.248
5.995SerGlu: 5.995 ± 1.287
1.799SerPhe: 1.799 ± 0.848
4.197SerGly: 4.197 ± 0.797
0.0SerHis: 0.0 ± 0.0
1.799SerIle: 1.799 ± 0.867
2.998SerLys: 2.998 ± 1.097
4.796SerLeu: 4.796 ± 1.3
1.199SerMet: 1.199 ± 0.951
2.998SerAsn: 2.998 ± 1.262
1.799SerPro: 1.799 ± 1.212
2.998SerGln: 2.998 ± 1.291
4.197SerArg: 4.197 ± 1.766
2.398SerSer: 2.398 ± 1.514
6.595SerThr: 6.595 ± 2.076
2.398SerVal: 2.398 ± 1.327
0.6SerTrp: 0.6 ± 0.596
4.796SerTyr: 4.796 ± 1.787
0.0SerXaa: 0.0 ± 0.0
Thr
6.595ThrAla: 6.595 ± 2.15
0.6ThrCys: 0.6 ± 0.404
5.396ThrAsp: 5.396 ± 1.401
4.796ThrGlu: 4.796 ± 1.074
1.199ThrPhe: 1.199 ± 1.225
4.197ThrGly: 4.197 ± 0.949
0.0ThrHis: 0.0 ± 0.0
2.998ThrIle: 2.998 ± 1.413
4.197ThrLys: 4.197 ± 1.406
3.597ThrLeu: 3.597 ± 1.279
1.799ThrMet: 1.799 ± 1.023
6.595ThrAsn: 6.595 ± 2.051
2.398ThrPro: 2.398 ± 1.077
0.6ThrGln: 0.6 ± 0.638
3.597ThrArg: 3.597 ± 1.111
6.595ThrSer: 6.595 ± 1.654
1.799ThrThr: 1.799 ± 0.816
4.197ThrVal: 4.197 ± 1.458
1.199ThrTrp: 1.199 ± 0.675
3.597ThrTyr: 3.597 ± 0.943
0.0ThrXaa: 0.0 ± 0.0
Val
1.199ValAla: 1.199 ± 0.725
1.199ValCys: 1.199 ± 1.604
1.799ValAsp: 1.799 ± 0.968
0.6ValGlu: 0.6 ± 0.404
0.0ValPhe: 0.0 ± 0.0
0.6ValGly: 0.6 ± 0.404
0.6ValHis: 0.6 ± 0.404
1.799ValIle: 1.799 ± 1.153
4.197ValLys: 4.197 ± 2.534
2.398ValLeu: 2.398 ± 0.605
0.6ValMet: 0.6 ± 0.617
1.799ValAsn: 1.799 ± 0.847
4.197ValPro: 4.197 ± 1.798
1.799ValGln: 1.799 ± 0.518
1.799ValArg: 1.799 ± 0.775
5.396ValSer: 5.396 ± 2.465
3.597ValThr: 3.597 ± 1.553
0.6ValVal: 0.6 ± 0.404
1.199ValTrp: 1.199 ± 0.808
2.998ValTyr: 2.998 ± 1.036
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.199TrpCys: 1.199 ± 0.495
2.398TrpAsp: 2.398 ± 0.605
0.0TrpGlu: 0.0 ± 0.0
1.199TrpPhe: 1.199 ± 0.808
1.799TrpGly: 1.799 ± 0.816
0.0TrpHis: 0.0 ± 0.0
1.199TrpIle: 1.199 ± 0.495
1.799TrpLys: 1.799 ± 0.816
1.199TrpLeu: 1.199 ± 0.808
0.6TrpMet: 0.6 ± 0.638
1.799TrpAsn: 1.799 ± 0.775
0.0TrpPro: 0.0 ± 0.0
1.199TrpGln: 1.199 ± 0.883
0.0TrpArg: 0.0 ± 0.0
0.6TrpSer: 0.6 ± 0.568
1.199TrpThr: 1.199 ± 0.808
0.6TrpVal: 0.6 ± 0.526
0.0TrpTrp: 0.0 ± 0.0
0.6TrpTyr: 0.6 ± 0.404
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.597TyrAla: 3.597 ± 1.409
1.199TyrCys: 1.199 ± 1.053
7.194TyrAsp: 7.194 ± 1.946
2.398TyrGlu: 2.398 ± 0.873
3.597TyrPhe: 3.597 ± 1.485
4.197TyrGly: 4.197 ± 0.819
1.799TyrHis: 1.799 ± 0.614
4.197TyrIle: 4.197 ± 1.976
3.597TyrLys: 3.597 ± 0.739
2.398TyrLeu: 2.398 ± 0.996
1.799TyrMet: 1.799 ± 0.614
6.595TyrAsn: 6.595 ± 1.071
1.199TyrPro: 1.199 ± 0.921
4.796TyrGln: 4.796 ± 1.265
1.199TyrArg: 1.199 ± 0.713
2.398TyrSer: 2.398 ± 0.605
2.998TyrThr: 2.998 ± 1.54
0.6TyrVal: 0.6 ± 0.526
1.199TyrTrp: 1.199 ± 0.611
1.199TyrTyr: 1.199 ± 0.808
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (1669 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski