Amino acid dipepetide frequency for Microviridae sp. ctKAt32

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.439AlaAla: 8.439 ± 3.536
2.11AlaCys: 2.11 ± 0.83
3.516AlaAsp: 3.516 ± 1.897
3.516AlaGlu: 3.516 ± 1.318
2.813AlaPhe: 2.813 ± 1.253
8.439AlaGly: 8.439 ± 3.031
2.813AlaHis: 2.813 ± 1.187
1.406AlaIle: 1.406 ± 0.842
6.329AlaLys: 6.329 ± 2.753
8.439AlaLeu: 8.439 ± 2.618
2.813AlaMet: 2.813 ± 1.684
2.813AlaAsn: 2.813 ± 1.619
4.219AlaPro: 4.219 ± 2.096
2.11AlaGln: 2.11 ± 1.162
6.329AlaArg: 6.329 ± 1.349
7.736AlaSer: 7.736 ± 3.169
5.626AlaThr: 5.626 ± 2.719
4.219AlaVal: 4.219 ± 1.65
0.703AlaTrp: 0.703 ± 0.63
3.516AlaTyr: 3.516 ± 0.821
0.0AlaXaa: 0.0 ± 0.0
Cys
0.703CysAla: 0.703 ± 0.459
0.0CysCys: 0.0 ± 0.0
1.406CysAsp: 1.406 ± 1.437
0.703CysGlu: 0.703 ± 0.904
0.703CysPhe: 0.703 ± 0.63
2.11CysGly: 2.11 ± 1.427
0.0CysHis: 0.0 ± 0.0
1.406CysIle: 1.406 ± 1.014
0.0CysLys: 0.0 ± 0.0
2.11CysLeu: 2.11 ± 1.102
0.703CysMet: 0.703 ± 0.63
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.703CysArg: 0.703 ± 0.63
0.703CysSer: 0.703 ± 0.904
0.0CysThr: 0.0 ± 0.0
2.11CysVal: 2.11 ± 0.809
0.0CysTrp: 0.0 ± 0.0
1.406CysTyr: 1.406 ± 0.962
0.0CysXaa: 0.0 ± 0.0
Asp
4.219AspAla: 4.219 ± 1.269
0.703AspCys: 0.703 ± 0.63
2.813AspAsp: 2.813 ± 1.215
3.516AspGlu: 3.516 ± 1.537
4.923AspPhe: 4.923 ± 1.719
2.11AspGly: 2.11 ± 0.809
0.703AspHis: 0.703 ± 0.459
2.813AspIle: 2.813 ± 1.304
3.516AspLys: 3.516 ± 2.568
3.516AspLeu: 3.516 ± 1.168
2.11AspMet: 2.11 ± 1.444
3.516AspAsn: 3.516 ± 1.315
2.11AspPro: 2.11 ± 1.421
2.11AspGln: 2.11 ± 0.537
2.813AspArg: 2.813 ± 0.537
2.11AspSer: 2.11 ± 1.156
3.516AspThr: 3.516 ± 2.294
3.516AspVal: 3.516 ± 1.531
0.0AspTrp: 0.0 ± 0.0
3.516AspTyr: 3.516 ± 1.214
0.0AspXaa: 0.0 ± 0.0
Glu
4.219GluAla: 4.219 ± 1.089
0.703GluCys: 0.703 ± 0.719
1.406GluAsp: 1.406 ± 0.562
4.219GluGlu: 4.219 ± 1.59
2.813GluPhe: 2.813 ± 0.79
4.219GluGly: 4.219 ± 1.704
0.703GluHis: 0.703 ± 0.459
4.219GluIle: 4.219 ± 1.31
2.813GluLys: 2.813 ± 1.483
0.703GluLeu: 0.703 ± 0.719
0.703GluMet: 0.703 ± 1.347
1.406GluAsn: 1.406 ± 0.562
3.516GluPro: 3.516 ± 2.821
2.11GluGln: 2.11 ± 0.835
4.923GluArg: 4.923 ± 1.172
1.406GluSer: 1.406 ± 0.962
3.516GluThr: 3.516 ± 0.821
4.219GluVal: 4.219 ± 1.537
0.703GluTrp: 0.703 ± 0.459
4.219GluTyr: 4.219 ± 0.995
0.0GluXaa: 0.0 ± 0.0
Phe
4.219PheAla: 4.219 ± 1.481
0.0PheCys: 0.0 ± 0.0
2.11PheAsp: 2.11 ± 1.156
2.11PheGlu: 2.11 ± 1.102
4.219PhePhe: 4.219 ± 1.709
3.516PheGly: 3.516 ± 1.595
0.703PheHis: 0.703 ± 0.63
2.813PheIle: 2.813 ± 0.991
2.11PheLys: 2.11 ± 0.948
2.11PheLeu: 2.11 ± 1.049
3.516PheMet: 3.516 ± 1.303
4.923PheAsn: 4.923 ± 0.732
2.11PhePro: 2.11 ± 1.427
3.516PheGln: 3.516 ± 1.192
2.11PheArg: 2.11 ± 1.376
4.923PheSer: 4.923 ± 1.702
3.516PheThr: 3.516 ± 1.214
3.516PheVal: 3.516 ± 1.642
0.0PheTrp: 0.0 ± 0.0
1.406PheTyr: 1.406 ± 0.562
0.0PheXaa: 0.0 ± 0.0
Gly
7.736GlyAla: 7.736 ± 2.067
2.11GlyCys: 2.11 ± 1.815
6.329GlyAsp: 6.329 ± 0.524
4.219GlyGlu: 4.219 ± 1.631
3.516GlyPhe: 3.516 ± 0.993
6.329GlyGly: 6.329 ± 1.586
1.406GlyHis: 1.406 ± 0.896
5.626GlyIle: 5.626 ± 1.797
5.626GlyLys: 5.626 ± 1.329
6.329GlyLeu: 6.329 ± 2.207
0.703GlyMet: 0.703 ± 0.719
3.516GlyAsn: 3.516 ± 1.435
3.516GlyPro: 3.516 ± 1.214
2.11GlyGln: 2.11 ± 0.948
0.703GlyArg: 0.703 ± 0.63
6.329GlySer: 6.329 ± 1.62
6.329GlyThr: 6.329 ± 2.384
4.219GlyVal: 4.219 ± 2.335
0.0GlyTrp: 0.0 ± 0.0
2.813GlyTyr: 2.813 ± 1.189
0.0GlyXaa: 0.0 ± 0.0
His
1.406HisAla: 1.406 ± 0.562
1.406HisCys: 1.406 ± 0.562
0.703HisAsp: 0.703 ± 0.459
0.703HisGlu: 0.703 ± 0.459
1.406HisPhe: 1.406 ± 0.918
2.813HisGly: 2.813 ± 1.513
0.0HisHis: 0.0 ± 0.0
0.703HisIle: 0.703 ± 0.459
0.703HisLys: 0.703 ± 0.904
0.703HisLeu: 0.703 ± 0.459
0.703HisMet: 0.703 ± 0.73
0.0HisAsn: 0.0 ± 0.0
0.703HisPro: 0.703 ± 0.832
0.0HisGln: 0.0 ± 0.0
0.703HisArg: 0.703 ± 0.459
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
1.406HisVal: 1.406 ± 0.636
0.0HisTrp: 0.0 ± 0.0
1.406HisTyr: 1.406 ± 1.261
0.0HisXaa: 0.0 ± 0.0
Ile
2.11IleAla: 2.11 ± 0.732
0.703IleCys: 0.703 ± 0.459
2.813IleAsp: 2.813 ± 1.189
0.0IleGlu: 0.0 ± 0.0
2.813IlePhe: 2.813 ± 1.048
4.219IleGly: 4.219 ± 1.338
0.0IleHis: 0.0 ± 0.0
0.703IleIle: 0.703 ± 0.904
3.516IleLys: 3.516 ± 1.128
2.11IleLeu: 2.11 ± 0.795
0.703IleMet: 0.703 ± 0.459
4.923IleAsn: 4.923 ± 1.34
2.813IlePro: 2.813 ± 1.996
0.703IleGln: 0.703 ± 0.459
0.0IleArg: 0.0 ± 0.0
2.813IleSer: 2.813 ± 1.517
3.516IleThr: 3.516 ± 1.214
2.813IleVal: 2.813 ± 1.943
2.813IleTrp: 2.813 ± 1.189
2.11IleTyr: 2.11 ± 1.289
0.0IleXaa: 0.0 ± 0.0
Lys
4.219LysAla: 4.219 ± 1.975
0.0LysCys: 0.0 ± 0.0
3.516LysAsp: 3.516 ± 1.202
5.626LysGlu: 5.626 ± 2.147
2.11LysPhe: 2.11 ± 0.83
6.329LysGly: 6.329 ± 2.521
0.703LysHis: 0.703 ± 0.904
2.813LysIle: 2.813 ± 0.79
3.516LysLys: 3.516 ± 2.032
2.813LysLeu: 2.813 ± 0.984
3.516LysMet: 3.516 ± 1.689
2.11LysAsn: 2.11 ± 1.402
3.516LysPro: 3.516 ± 1.071
1.406LysGln: 1.406 ± 0.891
6.329LysArg: 6.329 ± 2.892
2.11LysSer: 2.11 ± 0.537
2.813LysThr: 2.813 ± 1.213
3.516LysVal: 3.516 ± 1.643
0.703LysTrp: 0.703 ± 0.63
0.703LysTyr: 0.703 ± 0.459
0.0LysXaa: 0.0 ± 0.0
Leu
7.032LeuAla: 7.032 ± 1.923
0.703LeuCys: 0.703 ± 0.832
2.11LeuAsp: 2.11 ± 1.049
2.11LeuGlu: 2.11 ± 1.402
2.813LeuPhe: 2.813 ± 1.209
4.923LeuGly: 4.923 ± 1.34
0.703LeuHis: 0.703 ± 0.73
4.923LeuIle: 4.923 ± 2.001
4.923LeuLys: 4.923 ± 1.214
0.703LeuLeu: 0.703 ± 0.459
0.703LeuMet: 0.703 ± 0.904
3.516LeuAsn: 3.516 ± 0.993
5.626LeuPro: 5.626 ± 2.246
4.219LeuGln: 4.219 ± 1.295
3.516LeuArg: 3.516 ± 0.821
6.329LeuSer: 6.329 ± 2.101
2.11LeuThr: 2.11 ± 0.537
4.219LeuVal: 4.219 ± 1.469
0.703LeuTrp: 0.703 ± 0.459
3.516LeuTyr: 3.516 ± 1.304
0.0LeuXaa: 0.0 ± 0.0
Met
2.813MetAla: 2.813 ± 1.439
0.0MetCys: 0.0 ± 0.0
5.626MetAsp: 5.626 ± 1.607
2.813MetGlu: 2.813 ± 2.131
3.516MetPhe: 3.516 ± 1.214
2.813MetGly: 2.813 ± 2.093
0.0MetHis: 0.0 ± 0.0
2.11MetIle: 2.11 ± 1.461
1.406MetLys: 1.406 ± 0.842
1.406MetLeu: 1.406 ± 0.842
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
2.11MetPro: 2.11 ± 0.795
1.406MetGln: 1.406 ± 0.918
2.813MetArg: 2.813 ± 1.483
4.219MetSer: 4.219 ± 3.195
2.11MetThr: 2.11 ± 1.081
2.813MetVal: 2.813 ± 1.684
0.0MetTrp: 0.0 ± 0.0
0.703MetTyr: 0.703 ± 0.73
0.0MetXaa: 0.0 ± 0.0
Asn
2.11AsnAla: 2.11 ± 0.537
1.406AsnCys: 1.406 ± 1.261
2.11AsnAsp: 2.11 ± 1.081
3.516AsnGlu: 3.516 ± 0.979
1.406AsnPhe: 1.406 ± 0.562
3.516AsnGly: 3.516 ± 1.174
0.703AsnHis: 0.703 ± 0.459
2.11AsnIle: 2.11 ± 0.951
0.703AsnLys: 0.703 ± 0.459
4.219AsnLeu: 4.219 ± 2.048
2.11AsnMet: 2.11 ± 0.835
1.406AsnAsn: 1.406 ± 0.562
2.813AsnPro: 2.813 ± 0.537
1.406AsnGln: 1.406 ± 0.891
4.923AsnArg: 4.923 ± 2.005
3.516AsnSer: 3.516 ± 1.308
2.11AsnThr: 2.11 ± 1.289
3.516AsnVal: 3.516 ± 1.899
1.406AsnTrp: 1.406 ± 0.636
2.813AsnTyr: 2.813 ± 1.585
0.0AsnXaa: 0.0 ± 0.0
Pro
4.923ProAla: 4.923 ± 2.628
1.406ProCys: 1.406 ± 1.261
2.813ProAsp: 2.813 ± 0.918
3.516ProGlu: 3.516 ± 0.973
2.11ProPhe: 2.11 ± 1.8
4.923ProGly: 4.923 ± 1.868
0.703ProHis: 0.703 ± 0.63
2.813ProIle: 2.813 ± 1.835
2.813ProLys: 2.813 ± 2.101
2.11ProLeu: 2.11 ± 0.809
2.813ProMet: 2.813 ± 1.439
1.406ProAsn: 1.406 ± 0.562
2.11ProPro: 2.11 ± 0.795
3.516ProGln: 3.516 ± 1.776
1.406ProArg: 1.406 ± 1.261
4.923ProSer: 4.923 ± 1.774
0.0ProThr: 0.0 ± 0.0
7.736ProVal: 7.736 ± 2.258
0.703ProTrp: 0.703 ± 0.459
0.703ProTyr: 0.703 ± 0.459
0.0ProXaa: 0.0 ± 0.0
Gln
3.516GlnAla: 3.516 ± 0.993
0.703GlnCys: 0.703 ± 0.63
2.813GlnAsp: 2.813 ± 1.602
2.813GlnGlu: 2.813 ± 1.996
0.703GlnPhe: 0.703 ± 0.459
2.813GlnGly: 2.813 ± 1.271
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
2.813GlnLys: 2.813 ± 1.189
3.516GlnLeu: 3.516 ± 1.588
4.219GlnMet: 4.219 ± 3.078
2.11GlnAsn: 2.11 ± 0.835
1.406GlnPro: 1.406 ± 0.896
1.406GlnGln: 1.406 ± 0.636
4.219GlnArg: 4.219 ± 1.269
2.11GlnSer: 2.11 ± 1.421
1.406GlnThr: 1.406 ± 0.636
0.703GlnVal: 0.703 ± 0.459
0.703GlnTrp: 0.703 ± 0.63
0.703GlnTyr: 0.703 ± 0.73
0.0GlnXaa: 0.0 ± 0.0
Arg
6.329ArgAla: 6.329 ± 1.622
0.703ArgCys: 0.703 ± 0.63
3.516ArgAsp: 3.516 ± 0.47
4.923ArgGlu: 4.923 ± 1.82
1.406ArgPhe: 1.406 ± 0.918
2.11ArgGly: 2.11 ± 0.809
2.11ArgHis: 2.11 ± 0.862
0.0ArgIle: 0.0 ± 0.0
3.516ArgLys: 3.516 ± 1.87
6.329ArgLeu: 6.329 ± 1.867
4.923ArgMet: 4.923 ± 1.474
4.219ArgAsn: 4.219 ± 2.227
2.813ArgPro: 2.813 ± 1.123
2.813ArgGln: 2.813 ± 1.151
2.11ArgArg: 2.11 ± 0.809
3.516ArgSer: 3.516 ± 1.128
2.11ArgThr: 2.11 ± 0.951
3.516ArgVal: 3.516 ± 1.174
0.0ArgTrp: 0.0 ± 0.0
4.219ArgTyr: 4.219 ± 1.31
0.0ArgXaa: 0.0 ± 0.0
Ser
10.549SerAla: 10.549 ± 4.802
0.703SerCys: 0.703 ± 0.459
2.813SerAsp: 2.813 ± 1.123
2.11SerGlu: 2.11 ± 1.297
2.813SerPhe: 2.813 ± 0.918
5.626SerGly: 5.626 ± 1.564
1.406SerHis: 1.406 ± 0.918
4.219SerIle: 4.219 ± 1.646
2.813SerLys: 2.813 ± 1.439
7.032SerLeu: 7.032 ± 1.656
1.406SerMet: 1.406 ± 0.999
4.219SerAsn: 4.219 ± 1.658
2.813SerPro: 2.813 ± 0.878
1.406SerGln: 1.406 ± 0.636
4.219SerArg: 4.219 ± 1.362
4.219SerSer: 4.219 ± 1.369
2.813SerThr: 2.813 ± 1.835
6.329SerVal: 6.329 ± 2.061
0.0SerTrp: 0.0 ± 0.0
1.406SerTyr: 1.406 ± 0.918
0.0SerXaa: 0.0 ± 0.0
Thr
5.626ThrAla: 5.626 ± 2.375
0.703ThrCys: 0.703 ± 0.904
0.703ThrAsp: 0.703 ± 0.73
2.11ThrGlu: 2.11 ± 1.156
5.626ThrPhe: 5.626 ± 2.121
4.219ThrGly: 4.219 ± 2.048
0.0ThrHis: 0.0 ± 0.0
1.406ThrIle: 1.406 ± 0.918
2.11ThrLys: 2.11 ± 1.421
0.703ThrLeu: 0.703 ± 0.63
1.406ThrMet: 1.406 ± 0.842
2.813ThrAsn: 2.813 ± 1.145
2.11ThrPro: 2.11 ± 0.835
3.516ThrGln: 3.516 ± 1.168
3.516ThrArg: 3.516 ± 0.973
2.813ThrSer: 2.813 ± 1.189
3.516ThrThr: 3.516 ± 2.294
2.813ThrVal: 2.813 ± 1.513
0.0ThrTrp: 0.0 ± 0.0
2.11ThrTyr: 2.11 ± 0.809
0.0ThrXaa: 0.0 ± 0.0
Val
3.516ValAla: 3.516 ± 1.967
0.0ValCys: 0.0 ± 0.0
2.813ValAsp: 2.813 ± 1.123
4.219ValGlu: 4.219 ± 1.483
4.219ValPhe: 4.219 ± 1.183
4.923ValGly: 4.923 ± 1.327
0.703ValHis: 0.703 ± 0.459
1.406ValIle: 1.406 ± 1.064
4.923ValLys: 4.923 ± 2.426
7.032ValLeu: 7.032 ± 1.427
3.516ValMet: 3.516 ± 1.599
2.813ValAsn: 2.813 ± 1.517
7.032ValPro: 7.032 ± 1.931
0.703ValGln: 0.703 ± 0.73
6.329ValArg: 6.329 ± 1.199
5.626ValSer: 5.626 ± 2.48
1.406ValThr: 1.406 ± 1.261
4.923ValVal: 4.923 ± 1.853
0.0ValTrp: 0.0 ± 0.0
2.813ValTyr: 2.813 ± 1.048
0.0ValXaa: 0.0 ± 0.0
Trp
0.703TrpAla: 0.703 ± 0.63
0.0TrpCys: 0.0 ± 0.0
0.703TrpAsp: 0.703 ± 0.459
0.0TrpGlu: 0.0 ± 0.0
0.703TrpPhe: 0.703 ± 0.459
0.0TrpGly: 0.0 ± 0.0
1.406TrpHis: 1.406 ± 0.918
0.0TrpIle: 0.0 ± 0.0
2.11TrpLys: 2.11 ± 1.891
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.703TrpAsn: 0.703 ± 0.459
1.406TrpPro: 1.406 ± 0.918
0.703TrpGln: 0.703 ± 0.459
0.703TrpArg: 0.703 ± 0.73
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.516TyrAla: 3.516 ± 0.842
0.703TyrCys: 0.703 ± 0.719
3.516TyrAsp: 3.516 ± 1.066
0.703TyrGlu: 0.703 ± 0.459
2.813TyrPhe: 2.813 ± 1.189
3.516TyrGly: 3.516 ± 1.304
0.703TyrHis: 0.703 ± 0.63
0.703TyrIle: 0.703 ± 0.459
2.11TyrLys: 2.11 ± 0.862
3.516TyrLeu: 3.516 ± 0.889
1.406TyrMet: 1.406 ± 0.636
1.406TyrAsn: 1.406 ± 0.918
0.703TyrPro: 0.703 ± 0.459
3.516TyrGln: 3.516 ± 1.897
2.813TyrArg: 2.813 ± 1.209
3.516TyrSer: 3.516 ± 0.821
1.406TyrThr: 1.406 ± 0.562
2.813TyrVal: 2.813 ± 1.959
0.703TyrTrp: 0.703 ± 0.459
1.406TyrTyr: 1.406 ± 1.261
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1423 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski