Amino acid dipepetide frequency for Thermus virus IN93

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.69AlaAla: 12.69 ± 1.977
0.846AlaCys: 0.846 ± 0.391
3.046AlaAsp: 3.046 ± 0.67
3.723AlaGlu: 3.723 ± 0.945
5.076AlaPhe: 5.076 ± 1.038
8.799AlaGly: 8.799 ± 1.279
2.03AlaHis: 2.03 ± 0.589
3.723AlaIle: 3.723 ± 0.753
4.23AlaLys: 4.23 ± 0.828
16.244AlaLeu: 16.244 ± 1.749
2.369AlaMet: 2.369 ± 0.492
0.846AlaAsn: 0.846 ± 0.279
6.43AlaPro: 6.43 ± 1.327
7.614AlaGln: 7.614 ± 1.416
10.152AlaArg: 10.152 ± 1.211
4.738AlaSer: 4.738 ± 0.846
4.907AlaThr: 4.907 ± 0.781
8.122AlaVal: 8.122 ± 1.315
3.723AlaTrp: 3.723 ± 0.677
4.23AlaTyr: 4.23 ± 0.97
0.0AlaXaa: 0.0 ± 0.0
Cys
0.677CysAla: 0.677 ± 0.356
0.0CysCys: 0.0 ± 0.0
0.338CysAsp: 0.338 ± 0.228
0.846CysGlu: 0.846 ± 0.448
0.169CysPhe: 0.169 ± 0.182
1.523CysGly: 1.523 ± 0.559
0.169CysHis: 0.169 ± 0.166
0.0CysIle: 0.0 ± 0.0
0.169CysLys: 0.169 ± 0.18
0.508CysLeu: 0.508 ± 0.283
0.338CysMet: 0.338 ± 0.285
0.0CysAsn: 0.0 ± 0.0
1.523CysPro: 1.523 ± 0.476
0.677CysGln: 0.677 ± 0.351
0.338CysArg: 0.338 ± 0.207
0.338CysSer: 0.338 ± 0.226
0.169CysThr: 0.169 ± 0.155
0.338CysVal: 0.338 ± 0.23
0.169CysTrp: 0.169 ± 0.143
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.046AspAla: 3.046 ± 0.75
1.015AspCys: 1.015 ± 0.452
1.692AspAsp: 1.692 ± 0.516
1.015AspGlu: 1.015 ± 0.397
1.692AspPhe: 1.692 ± 0.484
5.076AspGly: 5.076 ± 0.794
0.338AspHis: 0.338 ± 0.286
2.538AspIle: 2.538 ± 0.59
0.338AspLys: 0.338 ± 0.218
5.076AspLeu: 5.076 ± 0.824
0.846AspMet: 0.846 ± 0.285
0.677AspAsn: 0.677 ± 0.34
4.738AspPro: 4.738 ± 0.917
1.692AspGln: 1.692 ± 0.473
2.2AspArg: 2.2 ± 0.711
2.2AspSer: 2.2 ± 0.604
2.707AspThr: 2.707 ± 0.472
4.061AspVal: 4.061 ± 0.805
1.184AspTrp: 1.184 ± 0.538
1.015AspTyr: 1.015 ± 0.308
0.0AspXaa: 0.0 ± 0.0
Glu
11.844GluAla: 11.844 ± 1.827
0.508GluCys: 0.508 ± 0.311
4.061GluAsp: 4.061 ± 0.759
6.43GluGlu: 6.43 ± 1.634
1.692GluPhe: 1.692 ± 0.628
5.076GluGly: 5.076 ± 0.932
0.508GluHis: 0.508 ± 0.296
2.03GluIle: 2.03 ± 0.715
1.523GluLys: 1.523 ± 0.539
4.738GluLeu: 4.738 ± 0.901
1.184GluMet: 1.184 ± 0.431
1.523GluAsn: 1.523 ± 0.407
3.046GluPro: 3.046 ± 0.831
0.846GluGln: 0.846 ± 0.325
5.245GluArg: 5.245 ± 0.979
1.354GluSer: 1.354 ± 0.364
1.523GluThr: 1.523 ± 0.419
7.107GluVal: 7.107 ± 1.075
1.015GluTrp: 1.015 ± 0.479
2.2GluTyr: 2.2 ± 0.628
0.0GluXaa: 0.0 ± 0.0
Phe
3.384PheAla: 3.384 ± 0.707
0.338PheCys: 0.338 ± 0.23
1.184PheAsp: 1.184 ± 0.433
1.184PheGlu: 1.184 ± 0.391
0.846PhePhe: 0.846 ± 0.333
2.2PheGly: 2.2 ± 0.69
1.184PheHis: 1.184 ± 0.476
0.846PheIle: 0.846 ± 0.375
0.338PheLys: 0.338 ± 0.203
3.892PheLeu: 3.892 ± 0.834
0.338PheMet: 0.338 ± 0.236
0.846PheAsn: 0.846 ± 0.348
2.876PhePro: 2.876 ± 0.687
1.015PheGln: 1.015 ± 0.394
2.707PheArg: 2.707 ± 0.732
1.692PheSer: 1.692 ± 0.424
2.369PheThr: 2.369 ± 0.747
1.184PheVal: 1.184 ± 0.432
0.677PheTrp: 0.677 ± 0.293
0.338PheTyr: 0.338 ± 0.194
0.0PheXaa: 0.0 ± 0.0
Gly
6.937GlyAla: 6.937 ± 1.112
0.846GlyCys: 0.846 ± 0.374
4.399GlyAsp: 4.399 ± 0.768
6.261GlyGlu: 6.261 ± 1.513
2.369GlyPhe: 2.369 ± 0.704
10.998GlyGly: 10.998 ± 1.742
1.861GlyHis: 1.861 ± 0.561
2.538GlyIle: 2.538 ± 0.551
3.384GlyLys: 3.384 ± 0.738
9.306GlyLeu: 9.306 ± 1.215
2.03GlyMet: 2.03 ± 0.609
1.184GlyAsn: 1.184 ± 0.5
4.738GlyPro: 4.738 ± 0.806
4.399GlyGln: 4.399 ± 1.091
9.137GlyArg: 9.137 ± 1.313
6.261GlySer: 6.261 ± 1.122
4.907GlyThr: 4.907 ± 1.287
7.445GlyVal: 7.445 ± 1.096
1.692GlyTrp: 1.692 ± 0.548
3.215GlyTyr: 3.215 ± 0.706
0.0GlyXaa: 0.0 ± 0.0
His
1.692HisAla: 1.692 ± 0.378
0.0HisCys: 0.0 ± 0.0
0.677HisAsp: 0.677 ± 0.319
0.169HisGlu: 0.169 ± 0.164
1.015HisPhe: 1.015 ± 0.459
1.015HisGly: 1.015 ± 0.432
0.677HisHis: 0.677 ± 0.406
0.677HisIle: 0.677 ± 0.377
0.677HisLys: 0.677 ± 0.333
2.2HisLeu: 2.2 ± 0.632
0.169HisMet: 0.169 ± 0.162
0.0HisAsn: 0.0 ± 0.0
1.354HisPro: 1.354 ± 0.551
0.0HisGln: 0.0 ± 0.0
1.184HisArg: 1.184 ± 0.652
0.508HisSer: 0.508 ± 0.266
1.015HisThr: 1.015 ± 0.421
2.2HisVal: 2.2 ± 0.66
0.169HisTrp: 0.169 ± 0.16
0.508HisTyr: 0.508 ± 0.288
0.0HisXaa: 0.0 ± 0.0
Ile
3.723IleAla: 3.723 ± 0.678
0.508IleCys: 0.508 ± 0.267
1.184IleAsp: 1.184 ± 0.492
2.03IleGlu: 2.03 ± 0.431
0.338IlePhe: 0.338 ± 0.227
3.046IleGly: 3.046 ± 0.72
0.508IleHis: 0.508 ± 0.263
1.015IleIle: 1.015 ± 0.518
1.184IleLys: 1.184 ± 0.429
3.892IleLeu: 3.892 ± 0.914
0.169IleMet: 0.169 ± 0.167
1.354IleAsn: 1.354 ± 0.55
2.03IlePro: 2.03 ± 0.645
1.523IleGln: 1.523 ± 0.522
3.215IleArg: 3.215 ± 0.7
1.861IleSer: 1.861 ± 0.707
1.523IleThr: 1.523 ± 0.811
2.538IleVal: 2.538 ± 0.65
0.169IleTrp: 0.169 ± 0.172
1.861IleTyr: 1.861 ± 0.454
0.0IleXaa: 0.0 ± 0.0
Lys
3.723LysAla: 3.723 ± 0.83
0.169LysCys: 0.169 ± 0.179
1.692LysAsp: 1.692 ± 0.401
2.2LysGlu: 2.2 ± 0.525
0.169LysPhe: 0.169 ± 0.17
3.723LysGly: 3.723 ± 0.642
0.338LysHis: 0.338 ± 0.232
0.169LysIle: 0.169 ± 0.17
0.677LysLys: 0.677 ± 0.252
2.876LysLeu: 2.876 ± 0.661
1.015LysMet: 1.015 ± 0.344
1.184LysAsn: 1.184 ± 0.498
2.369LysPro: 2.369 ± 0.72
1.184LysGln: 1.184 ± 0.387
3.892LysArg: 3.892 ± 0.768
1.354LysSer: 1.354 ± 0.461
1.523LysThr: 1.523 ± 0.5
2.2LysVal: 2.2 ± 0.588
0.508LysTrp: 0.508 ± 0.317
0.338LysTyr: 0.338 ± 0.242
0.0LysXaa: 0.0 ± 0.0
Leu
13.875LeuAla: 13.875 ± 1.602
0.677LeuCys: 0.677 ± 0.382
5.076LeuAsp: 5.076 ± 0.639
8.122LeuGlu: 8.122 ± 1.149
2.876LeuPhe: 2.876 ± 0.524
10.321LeuGly: 10.321 ± 1.485
1.354LeuHis: 1.354 ± 0.37
3.215LeuIle: 3.215 ± 0.911
4.061LeuLys: 4.061 ± 0.617
9.306LeuLeu: 9.306 ± 1.361
2.2LeuMet: 2.2 ± 0.69
2.538LeuAsn: 2.538 ± 0.556
5.584LeuPro: 5.584 ± 0.944
3.384LeuGln: 3.384 ± 0.751
10.321LeuArg: 10.321 ± 1.653
2.538LeuSer: 2.538 ± 0.751
3.892LeuThr: 3.892 ± 0.71
8.46LeuVal: 8.46 ± 1.179
2.876LeuTrp: 2.876 ± 0.726
3.215LeuTyr: 3.215 ± 0.745
0.0LeuXaa: 0.0 ± 0.0
Met
2.707MetAla: 2.707 ± 0.745
0.0MetCys: 0.0 ± 0.0
1.354MetAsp: 1.354 ± 0.355
1.354MetGlu: 1.354 ± 0.469
0.338MetPhe: 0.338 ± 0.23
2.2MetGly: 2.2 ± 0.597
0.169MetHis: 0.169 ± 0.164
0.169MetIle: 0.169 ± 0.176
0.846MetLys: 0.846 ± 0.358
1.523MetLeu: 1.523 ± 0.402
0.169MetMet: 0.169 ± 0.156
0.508MetAsn: 0.508 ± 0.287
1.354MetPro: 1.354 ± 0.437
0.846MetGln: 0.846 ± 0.36
2.03MetArg: 2.03 ± 0.498
0.846MetSer: 0.846 ± 0.343
1.692MetThr: 1.692 ± 0.496
1.523MetVal: 1.523 ± 0.509
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.2AsnAla: 2.2 ± 0.61
0.338AsnCys: 0.338 ± 0.215
1.015AsnAsp: 1.015 ± 0.363
1.015AsnGlu: 1.015 ± 0.497
0.508AsnPhe: 0.508 ± 0.347
1.861AsnGly: 1.861 ± 0.657
0.0AsnHis: 0.0 ± 0.0
0.338AsnIle: 0.338 ± 0.215
0.677AsnLys: 0.677 ± 0.304
3.723AsnLeu: 3.723 ± 0.668
0.677AsnMet: 0.677 ± 0.367
1.184AsnAsn: 1.184 ± 0.571
2.538AsnPro: 2.538 ± 0.642
1.184AsnGln: 1.184 ± 0.483
1.523AsnArg: 1.523 ± 0.421
1.523AsnSer: 1.523 ± 0.741
0.846AsnThr: 0.846 ± 0.366
0.508AsnVal: 0.508 ± 0.293
0.338AsnTrp: 0.338 ± 0.211
0.846AsnTyr: 0.846 ± 0.399
0.0AsnXaa: 0.0 ± 0.0
Pro
7.276ProAla: 7.276 ± 1.367
0.508ProCys: 0.508 ± 0.282
3.553ProAsp: 3.553 ± 0.745
5.584ProGlu: 5.584 ± 1.106
1.861ProPhe: 1.861 ± 0.585
7.445ProGly: 7.445 ± 0.929
1.184ProHis: 1.184 ± 0.48
1.861ProIle: 1.861 ± 0.616
2.2ProLys: 2.2 ± 0.59
6.937ProLeu: 6.937 ± 1.269
1.861ProMet: 1.861 ± 0.548
1.523ProAsn: 1.523 ± 0.611
7.445ProPro: 7.445 ± 1.713
3.046ProGln: 3.046 ± 0.731
4.23ProArg: 4.23 ± 1.137
4.399ProSer: 4.399 ± 0.857
3.553ProThr: 3.553 ± 1.188
5.415ProVal: 5.415 ± 1.029
1.523ProTrp: 1.523 ± 0.395
1.692ProTyr: 1.692 ± 0.604
0.0ProXaa: 0.0 ± 0.0
Gln
6.599GlnAla: 6.599 ± 1.134
0.338GlnCys: 0.338 ± 0.231
2.03GlnAsp: 2.03 ± 0.571
2.538GlnGlu: 2.538 ± 0.688
1.354GlnPhe: 1.354 ± 0.492
3.215GlnGly: 3.215 ± 0.641
0.169GlnHis: 0.169 ± 0.164
3.046GlnIle: 3.046 ± 0.764
1.861GlnLys: 1.861 ± 0.476
2.03GlnLeu: 2.03 ± 0.637
0.846GlnMet: 0.846 ± 0.327
1.354GlnAsn: 1.354 ± 0.463
2.876GlnPro: 2.876 ± 0.752
1.692GlnGln: 1.692 ± 0.471
2.369GlnArg: 2.369 ± 0.542
2.707GlnSer: 2.707 ± 0.705
2.2GlnThr: 2.2 ± 0.569
2.876GlnVal: 2.876 ± 0.784
0.338GlnTrp: 0.338 ± 0.233
1.354GlnTyr: 1.354 ± 0.458
0.0GlnXaa: 0.0 ± 0.0
Arg
8.799ArgAla: 8.799 ± 1.373
0.338ArgCys: 0.338 ± 0.21
3.215ArgAsp: 3.215 ± 0.671
6.261ArgGlu: 6.261 ± 1.387
2.03ArgPhe: 2.03 ± 0.571
4.907ArgGly: 4.907 ± 0.761
2.2ArgHis: 2.2 ± 0.816
4.061ArgIle: 4.061 ± 0.595
2.876ArgLys: 2.876 ± 0.808
9.645ArgLeu: 9.645 ± 1.411
1.184ArgMet: 1.184 ± 0.407
1.692ArgAsn: 1.692 ± 0.597
6.768ArgPro: 6.768 ± 1.178
3.384ArgGln: 3.384 ± 0.746
7.445ArgArg: 7.445 ± 1.371
1.523ArgSer: 1.523 ± 0.465
2.876ArgThr: 2.876 ± 0.639
9.137ArgVal: 9.137 ± 1.239
1.523ArgTrp: 1.523 ± 0.438
3.384ArgTyr: 3.384 ± 0.791
0.0ArgXaa: 0.0 ± 0.0
Ser
2.538SerAla: 2.538 ± 0.513
0.338SerCys: 0.338 ± 0.218
2.03SerAsp: 2.03 ± 0.623
2.707SerGlu: 2.707 ± 0.681
1.354SerPhe: 1.354 ± 0.431
7.107SerGly: 7.107 ± 1.128
0.677SerHis: 0.677 ± 0.309
1.523SerIle: 1.523 ± 0.583
1.015SerLys: 1.015 ± 0.378
2.369SerLeu: 2.369 ± 0.67
1.015SerMet: 1.015 ± 0.389
1.692SerAsn: 1.692 ± 0.722
3.215SerPro: 3.215 ± 0.741
2.03SerGln: 2.03 ± 0.673
2.538SerArg: 2.538 ± 0.774
4.907SerSer: 4.907 ± 1.356
2.369SerThr: 2.369 ± 0.917
3.046SerVal: 3.046 ± 0.653
1.354SerTrp: 1.354 ± 0.45
1.354SerTyr: 1.354 ± 0.509
0.0SerXaa: 0.0 ± 0.0
Thr
3.892ThrAla: 3.892 ± 1.205
0.508ThrCys: 0.508 ± 0.275
1.861ThrAsp: 1.861 ± 0.656
2.538ThrGlu: 2.538 ± 0.843
1.184ThrPhe: 1.184 ± 0.436
5.245ThrGly: 5.245 ± 0.974
0.677ThrHis: 0.677 ± 0.392
1.692ThrIle: 1.692 ± 0.515
1.184ThrLys: 1.184 ± 0.493
3.892ThrLeu: 3.892 ± 0.653
0.677ThrMet: 0.677 ± 0.399
2.03ThrAsn: 2.03 ± 0.726
5.584ThrPro: 5.584 ± 1.436
1.523ThrGln: 1.523 ± 0.451
2.538ThrArg: 2.538 ± 0.515
2.369ThrSer: 2.369 ± 0.705
1.692ThrThr: 1.692 ± 0.556
2.2ThrVal: 2.2 ± 0.6
1.692ThrTrp: 1.692 ± 0.413
1.861ThrTyr: 1.861 ± 0.508
0.0ThrXaa: 0.0 ± 0.0
Val
10.152ValAla: 10.152 ± 1.183
0.508ValCys: 0.508 ± 0.284
2.707ValAsp: 2.707 ± 0.591
5.922ValGlu: 5.922 ± 1.196
3.046ValPhe: 3.046 ± 0.702
6.091ValGly: 6.091 ± 0.763
1.015ValHis: 1.015 ± 0.497
2.876ValIle: 2.876 ± 0.749
2.707ValLys: 2.707 ± 0.622
8.968ValLeu: 8.968 ± 0.946
1.861ValMet: 1.861 ± 0.504
1.184ValAsn: 1.184 ± 0.369
6.091ValPro: 6.091 ± 0.796
2.876ValGln: 2.876 ± 0.915
7.445ValArg: 7.445 ± 1.067
2.369ValSer: 2.369 ± 0.649
2.369ValThr: 2.369 ± 0.668
6.091ValVal: 6.091 ± 0.911
2.707ValTrp: 2.707 ± 0.794
1.861ValTyr: 1.861 ± 0.683
0.0ValXaa: 0.0 ± 0.0
Trp
3.046TrpAla: 3.046 ± 0.648
0.338TrpCys: 0.338 ± 0.221
0.846TrpAsp: 0.846 ± 0.365
2.03TrpGlu: 2.03 ± 0.838
0.338TrpPhe: 0.338 ± 0.238
1.692TrpGly: 1.692 ± 0.468
0.338TrpHis: 0.338 ± 0.332
1.015TrpIle: 1.015 ± 0.341
0.508TrpLys: 0.508 ± 0.303
2.538TrpLeu: 2.538 ± 0.646
0.338TrpMet: 0.338 ± 0.243
0.677TrpAsn: 0.677 ± 0.279
1.523TrpPro: 1.523 ± 0.477
1.184TrpGln: 1.184 ± 0.449
1.861TrpArg: 1.861 ± 0.643
0.846TrpSer: 0.846 ± 0.331
1.184TrpThr: 1.184 ± 0.443
1.692TrpVal: 1.692 ± 0.666
0.508TrpTrp: 0.508 ± 0.362
0.338TrpTyr: 0.338 ± 0.235
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.738TyrAla: 4.738 ± 0.937
0.169TyrCys: 0.169 ± 0.166
1.184TyrAsp: 1.184 ± 0.438
1.523TyrGlu: 1.523 ± 0.536
1.184TyrPhe: 1.184 ± 0.5
2.538TyrGly: 2.538 ± 0.619
0.508TyrHis: 0.508 ± 0.251
0.338TyrIle: 0.338 ± 0.213
0.846TyrLys: 0.846 ± 0.412
4.23TyrLeu: 4.23 ± 0.672
0.169TyrMet: 0.169 ± 0.168
0.846TyrAsn: 0.846 ± 0.328
1.015TyrPro: 1.015 ± 0.376
1.692TyrGln: 1.692 ± 0.502
2.876TyrArg: 2.876 ± 0.726
0.846TyrSer: 0.846 ± 0.319
1.523TyrThr: 1.523 ± 0.679
2.707TyrVal: 2.707 ± 0.491
0.677TyrTrp: 0.677 ± 0.426
1.523TyrTyr: 1.523 ± 0.61
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 39 proteins (5911 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski