Amino acid dipepetide frequency for Mycobacterium virus Heldan

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.757AlaAla: 10.757 ± 1.061
0.629AlaCys: 0.629 ± 0.24
5.599AlaAsp: 5.599 ± 0.775
6.291AlaGlu: 6.291 ± 0.75
2.957AlaPhe: 2.957 ± 0.532
7.612AlaGly: 7.612 ± 0.64
1.699AlaHis: 1.699 ± 0.378
3.712AlaIle: 3.712 ± 0.519
4.215AlaLys: 4.215 ± 0.627
9.248AlaLeu: 9.248 ± 0.881
2.516AlaMet: 2.516 ± 0.342
2.453AlaAsn: 2.453 ± 0.514
5.913AlaPro: 5.913 ± 0.88
3.712AlaGln: 3.712 ± 0.461
5.976AlaArg: 5.976 ± 0.597
4.404AlaSer: 4.404 ± 0.515
5.347AlaThr: 5.347 ± 0.453
7.864AlaVal: 7.864 ± 0.681
2.139AlaTrp: 2.139 ± 0.321
2.076AlaTyr: 2.076 ± 0.436
0.0AlaXaa: 0.0 ± 0.0
Cys
0.566CysAla: 0.566 ± 0.173
0.126CysCys: 0.126 ± 0.128
0.755CysAsp: 0.755 ± 0.245
0.692CysGlu: 0.692 ± 0.218
0.126CysPhe: 0.126 ± 0.094
0.881CysGly: 0.881 ± 0.234
0.252CysHis: 0.252 ± 0.126
0.566CysIle: 0.566 ± 0.207
0.44CysLys: 0.44 ± 0.159
0.692CysLeu: 0.692 ± 0.271
0.189CysMet: 0.189 ± 0.117
0.503CysAsn: 0.503 ± 0.18
0.566CysPro: 0.566 ± 0.243
0.44CysGln: 0.44 ± 0.164
1.069CysArg: 1.069 ± 0.248
0.44CysSer: 0.44 ± 0.171
0.44CysThr: 0.44 ± 0.183
0.44CysVal: 0.44 ± 0.178
0.063CysTrp: 0.063 ± 0.053
0.44CysTyr: 0.44 ± 0.2
0.0CysXaa: 0.0 ± 0.0
Asp
6.102AspAla: 6.102 ± 0.682
0.755AspCys: 0.755 ± 0.196
4.592AspAsp: 4.592 ± 0.575
4.655AspGlu: 4.655 ± 0.575
2.579AspPhe: 2.579 ± 0.381
5.788AspGly: 5.788 ± 0.565
1.447AspHis: 1.447 ± 0.333
2.705AspIle: 2.705 ± 0.419
2.139AspLys: 2.139 ± 0.376
5.221AspLeu: 5.221 ± 0.612
1.321AspMet: 1.321 ± 0.251
2.516AspAsn: 2.516 ± 0.45
4.592AspPro: 4.592 ± 0.565
1.761AspGln: 1.761 ± 0.323
4.089AspArg: 4.089 ± 0.708
3.271AspSer: 3.271 ± 0.472
3.775AspThr: 3.775 ± 0.416
4.089AspVal: 4.089 ± 0.396
1.51AspTrp: 1.51 ± 0.303
2.328AspTyr: 2.328 ± 0.435
0.0AspXaa: 0.0 ± 0.0
Glu
7.36GluAla: 7.36 ± 0.758
0.629GluCys: 0.629 ± 0.191
4.467GluAsp: 4.467 ± 0.653
4.655GluGlu: 4.655 ± 0.627
2.831GluPhe: 2.831 ± 0.441
4.907GluGly: 4.907 ± 0.654
1.51GluHis: 1.51 ± 0.29
2.705GluIle: 2.705 ± 0.43
2.265GluLys: 2.265 ± 0.372
6.731GluLeu: 6.731 ± 0.792
1.636GluMet: 1.636 ± 0.367
2.265GluAsn: 2.265 ± 0.381
2.957GluPro: 2.957 ± 0.567
2.579GluGln: 2.579 ± 0.405
4.278GluArg: 4.278 ± 0.598
3.523GluSer: 3.523 ± 0.442
3.334GluThr: 3.334 ± 0.452
5.41GluVal: 5.41 ± 0.645
1.069GluTrp: 1.069 ± 0.23
2.453GluTyr: 2.453 ± 0.359
0.0GluXaa: 0.0 ± 0.0
Phe
3.145PheAla: 3.145 ± 0.43
0.315PheCys: 0.315 ± 0.188
2.831PheAsp: 2.831 ± 0.459
2.579PheGlu: 2.579 ± 0.362
0.44PhePhe: 0.44 ± 0.17
3.46PheGly: 3.46 ± 0.458
0.755PheHis: 0.755 ± 0.235
1.636PheIle: 1.636 ± 0.294
0.818PheLys: 0.818 ± 0.233
2.894PheLeu: 2.894 ± 0.534
0.755PheMet: 0.755 ± 0.193
1.761PheAsn: 1.761 ± 0.356
1.95PhePro: 1.95 ± 0.389
1.258PheGln: 1.258 ± 0.256
2.076PheArg: 2.076 ± 0.334
1.824PheSer: 1.824 ± 0.351
1.887PheThr: 1.887 ± 0.39
2.265PheVal: 2.265 ± 0.391
0.44PheTrp: 0.44 ± 0.166
0.881PheTyr: 0.881 ± 0.22
0.0PheXaa: 0.0 ± 0.0
Gly
6.228GlyAla: 6.228 ± 0.73
1.069GlyCys: 1.069 ± 0.29
5.976GlyAsp: 5.976 ± 0.767
4.278GlyGlu: 4.278 ± 0.547
3.46GlyPhe: 3.46 ± 0.555
7.423GlyGly: 7.423 ± 1.328
2.013GlyHis: 2.013 ± 0.362
3.334GlyIle: 3.334 ± 0.496
3.397GlyLys: 3.397 ± 0.534
6.039GlyLeu: 6.039 ± 0.639
1.95GlyMet: 1.95 ± 0.324
3.271GlyAsn: 3.271 ± 0.58
6.983GlyPro: 6.983 ± 2.373
2.894GlyGln: 2.894 ± 0.411
5.033GlyArg: 5.033 ± 0.617
5.159GlySer: 5.159 ± 0.612
5.284GlyThr: 5.284 ± 0.583
5.976GlyVal: 5.976 ± 0.572
1.384GlyTrp: 1.384 ± 0.31
2.894GlyTyr: 2.894 ± 0.401
0.0GlyXaa: 0.0 ± 0.0
His
1.824HisAla: 1.824 ± 0.301
0.189HisCys: 0.189 ± 0.113
1.573HisAsp: 1.573 ± 0.301
1.132HisGlu: 1.132 ± 0.271
0.692HisPhe: 0.692 ± 0.185
2.328HisGly: 2.328 ± 0.54
0.377HisHis: 0.377 ± 0.147
1.384HisIle: 1.384 ± 0.299
0.944HisLys: 0.944 ± 0.245
1.384HisLeu: 1.384 ± 0.329
0.44HisMet: 0.44 ± 0.158
0.566HisAsn: 0.566 ± 0.168
1.195HisPro: 1.195 ± 0.286
0.881HisGln: 0.881 ± 0.234
1.887HisArg: 1.887 ± 0.468
0.881HisSer: 0.881 ± 0.29
1.132HisThr: 1.132 ± 0.272
0.881HisVal: 0.881 ± 0.225
0.44HisTrp: 0.44 ± 0.176
0.881HisTyr: 0.881 ± 0.226
0.0HisXaa: 0.0 ± 0.0
Ile
5.284IleAla: 5.284 ± 0.677
0.315IleCys: 0.315 ± 0.142
3.397IleAsp: 3.397 ± 0.438
4.278IleGlu: 4.278 ± 0.58
1.384IlePhe: 1.384 ± 0.385
4.404IleGly: 4.404 ± 0.472
1.258IleHis: 1.258 ± 0.278
1.636IleIle: 1.636 ± 0.293
1.384IleLys: 1.384 ± 0.321
2.768IleLeu: 2.768 ± 0.409
0.818IleMet: 0.818 ± 0.305
1.384IleAsn: 1.384 ± 0.266
3.46IlePro: 3.46 ± 0.396
1.573IleGln: 1.573 ± 0.403
3.523IleArg: 3.523 ± 0.466
2.076IleSer: 2.076 ± 0.477
3.145IleThr: 3.145 ± 0.437
3.083IleVal: 3.083 ± 0.399
0.755IleTrp: 0.755 ± 0.193
1.132IleTyr: 1.132 ± 0.274
0.0IleXaa: 0.0 ± 0.0
Lys
4.592LysAla: 4.592 ± 0.568
0.126LysCys: 0.126 ± 0.104
2.391LysAsp: 2.391 ± 0.411
2.328LysGlu: 2.328 ± 0.445
0.944LysPhe: 0.944 ± 0.196
4.152LysGly: 4.152 ± 0.652
0.944LysHis: 0.944 ± 0.249
2.139LysIle: 2.139 ± 0.418
2.391LysLys: 2.391 ± 0.441
3.145LysLeu: 3.145 ± 0.357
1.069LysMet: 1.069 ± 0.269
1.132LysAsn: 1.132 ± 0.275
2.265LysPro: 2.265 ± 0.483
1.384LysGln: 1.384 ± 0.298
2.768LysArg: 2.768 ± 0.522
2.516LysSer: 2.516 ± 0.439
2.265LysThr: 2.265 ± 0.406
3.271LysVal: 3.271 ± 0.479
0.818LysTrp: 0.818 ± 0.263
1.007LysTyr: 1.007 ± 0.238
0.0LysXaa: 0.0 ± 0.0
Leu
8.304LeuAla: 8.304 ± 0.683
0.692LeuCys: 0.692 ± 0.311
5.033LeuAsp: 5.033 ± 0.645
6.165LeuGlu: 6.165 ± 0.479
1.95LeuPhe: 1.95 ± 0.3
5.851LeuGly: 5.851 ± 0.705
1.447LeuHis: 1.447 ± 0.382
4.467LeuIle: 4.467 ± 0.533
3.837LeuLys: 3.837 ± 0.439
5.725LeuLeu: 5.725 ± 0.588
2.328LeuMet: 2.328 ± 0.326
2.894LeuAsn: 2.894 ± 0.4
3.9LeuPro: 3.9 ± 0.525
2.265LeuGln: 2.265 ± 0.446
6.668LeuArg: 6.668 ± 0.614
4.215LeuSer: 4.215 ± 0.54
5.221LeuThr: 5.221 ± 0.53
5.221LeuVal: 5.221 ± 0.515
1.321LeuTrp: 1.321 ± 0.324
2.453LeuTyr: 2.453 ± 0.489
0.0LeuXaa: 0.0 ± 0.0
Met
2.831MetAla: 2.831 ± 0.468
0.063MetCys: 0.063 ± 0.064
1.95MetAsp: 1.95 ± 0.339
1.384MetGlu: 1.384 ± 0.287
0.692MetPhe: 0.692 ± 0.218
1.636MetGly: 1.636 ± 0.375
0.377MetHis: 0.377 ± 0.184
1.51MetIle: 1.51 ± 0.28
1.51MetLys: 1.51 ± 0.263
1.384MetLeu: 1.384 ± 0.242
0.44MetMet: 0.44 ± 0.197
1.007MetAsn: 1.007 ± 0.255
1.573MetPro: 1.573 ± 0.383
0.818MetGln: 0.818 ± 0.251
1.384MetArg: 1.384 ± 0.343
2.265MetSer: 2.265 ± 0.347
2.139MetThr: 2.139 ± 0.411
1.384MetVal: 1.384 ± 0.297
0.252MetTrp: 0.252 ± 0.124
0.692MetTyr: 0.692 ± 0.195
0.0MetXaa: 0.0 ± 0.0
Asn
3.397AsnAla: 3.397 ± 0.476
0.503AsnCys: 0.503 ± 0.173
1.824AsnAsp: 1.824 ± 0.322
1.887AsnGlu: 1.887 ± 0.303
1.384AsnPhe: 1.384 ± 0.225
3.46AsnGly: 3.46 ± 0.582
0.881AsnHis: 0.881 ± 0.212
2.265AsnIle: 2.265 ± 0.434
0.881AsnLys: 0.881 ± 0.223
2.265AsnLeu: 2.265 ± 0.381
0.818AsnMet: 0.818 ± 0.291
0.944AsnAsn: 0.944 ± 0.257
2.139AsnPro: 2.139 ± 0.405
1.007AsnGln: 1.007 ± 0.241
1.887AsnArg: 1.887 ± 0.364
1.447AsnSer: 1.447 ± 0.306
2.013AsnThr: 2.013 ± 0.324
2.705AsnVal: 2.705 ± 0.517
0.629AsnTrp: 0.629 ± 0.194
1.384AsnTyr: 1.384 ± 0.261
0.0AsnXaa: 0.0 ± 0.0
Pro
5.41ProAla: 5.41 ± 0.527
0.503ProCys: 0.503 ± 0.176
3.712ProAsp: 3.712 ± 0.519
3.963ProGlu: 3.963 ± 0.539
1.636ProPhe: 1.636 ± 0.332
5.347ProGly: 5.347 ± 0.768
1.195ProHis: 1.195 ± 0.209
2.265ProIle: 2.265 ± 0.304
3.02ProLys: 3.02 ± 0.602
3.963ProLeu: 3.963 ± 0.607
1.321ProMet: 1.321 ± 0.343
2.579ProAsn: 2.579 ± 0.475
2.894ProPro: 2.894 ± 0.569
3.523ProGln: 3.523 ± 1.192
3.271ProArg: 3.271 ± 0.558
2.957ProSer: 2.957 ± 0.392
3.523ProThr: 3.523 ± 0.439
4.278ProVal: 4.278 ± 0.521
0.881ProTrp: 0.881 ± 0.348
1.258ProTyr: 1.258 ± 0.292
0.0ProXaa: 0.0 ± 0.0
Gln
3.775GlnAla: 3.775 ± 0.585
0.126GlnCys: 0.126 ± 0.075
1.636GlnAsp: 1.636 ± 0.266
2.265GlnGlu: 2.265 ± 0.363
1.636GlnPhe: 1.636 ± 0.364
4.404GlnGly: 4.404 ± 1.859
0.755GlnHis: 0.755 ± 0.2
1.761GlnIle: 1.761 ± 0.354
1.195GlnLys: 1.195 ± 0.252
3.46GlnLeu: 3.46 ± 0.611
1.258GlnMet: 1.258 ± 0.304
0.692GlnAsn: 0.692 ± 0.199
1.699GlnPro: 1.699 ± 0.335
1.95GlnGln: 1.95 ± 0.526
3.02GlnArg: 3.02 ± 0.483
2.013GlnSer: 2.013 ± 0.389
1.447GlnThr: 1.447 ± 0.338
2.705GlnVal: 2.705 ± 0.42
0.566GlnTrp: 0.566 ± 0.205
0.755GlnTyr: 0.755 ± 0.193
0.0GlnXaa: 0.0 ± 0.0
Arg
5.599ArgAla: 5.599 ± 0.635
1.195ArgCys: 1.195 ± 0.366
3.963ArgAsp: 3.963 ± 0.491
4.97ArgGlu: 4.97 ± 0.553
2.202ArgPhe: 2.202 ± 0.341
4.718ArgGly: 4.718 ± 0.551
1.447ArgHis: 1.447 ± 0.313
3.9ArgIle: 3.9 ± 0.508
3.649ArgLys: 3.649 ± 0.525
5.284ArgLeu: 5.284 ± 0.625
2.642ArgMet: 2.642 ± 0.401
2.328ArgAsn: 2.328 ± 0.389
2.894ArgPro: 2.894 ± 0.483
2.391ArgGln: 2.391 ± 0.39
5.913ArgArg: 5.913 ± 0.817
4.215ArgSer: 4.215 ± 0.6
3.083ArgThr: 3.083 ± 0.497
5.536ArgVal: 5.536 ± 0.594
1.699ArgTrp: 1.699 ± 0.374
1.761ArgTyr: 1.761 ± 0.36
0.0ArgXaa: 0.0 ± 0.0
Ser
3.649SerAla: 3.649 ± 0.448
0.189SerCys: 0.189 ± 0.121
2.957SerAsp: 2.957 ± 0.49
3.649SerGlu: 3.649 ± 0.476
3.271SerPhe: 3.271 ± 0.429
5.096SerGly: 5.096 ± 0.649
1.321SerHis: 1.321 ± 0.291
2.579SerIle: 2.579 ± 0.338
2.076SerLys: 2.076 ± 0.53
4.467SerLeu: 4.467 ± 0.573
1.636SerMet: 1.636 ± 0.33
1.51SerAsn: 1.51 ± 0.294
2.202SerPro: 2.202 ± 0.313
2.579SerGln: 2.579 ± 0.44
3.837SerArg: 3.837 ± 0.413
2.768SerSer: 2.768 ± 0.444
2.265SerThr: 2.265 ± 0.473
3.9SerVal: 3.9 ± 0.468
1.258SerTrp: 1.258 ± 0.225
1.887SerTyr: 1.887 ± 0.339
0.0SerXaa: 0.0 ± 0.0
Thr
5.221ThrAla: 5.221 ± 0.651
0.881ThrCys: 0.881 ± 0.281
3.145ThrAsp: 3.145 ± 0.422
4.089ThrGlu: 4.089 ± 0.538
2.139ThrPhe: 2.139 ± 0.373
5.536ThrGly: 5.536 ± 0.878
0.755ThrHis: 0.755 ± 0.22
2.265ThrIle: 2.265 ± 0.471
2.139ThrLys: 2.139 ± 0.413
5.41ThrLeu: 5.41 ± 0.629
1.007ThrMet: 1.007 ± 0.185
1.384ThrAsn: 1.384 ± 0.414
3.649ThrPro: 3.649 ± 0.421
2.391ThrGln: 2.391 ± 0.466
3.9ThrArg: 3.9 ± 0.626
1.699ThrSer: 1.699 ± 0.257
2.768ThrThr: 2.768 ± 0.373
5.033ThrVal: 5.033 ± 0.664
1.258ThrTrp: 1.258 ± 0.274
1.699ThrTyr: 1.699 ± 0.325
0.0ThrXaa: 0.0 ± 0.0
Val
6.417ValAla: 6.417 ± 0.681
0.755ValCys: 0.755 ± 0.181
5.913ValAsp: 5.913 ± 0.615
5.033ValGlu: 5.033 ± 0.578
2.202ValPhe: 2.202 ± 0.393
4.026ValGly: 4.026 ± 0.54
1.51ValHis: 1.51 ± 0.369
3.649ValIle: 3.649 ± 0.506
3.775ValLys: 3.775 ± 0.54
5.725ValLeu: 5.725 ± 0.594
1.447ValMet: 1.447 ± 0.293
2.705ValAsn: 2.705 ± 0.439
4.467ValPro: 4.467 ± 0.588
1.887ValGln: 1.887 ± 0.341
5.347ValArg: 5.347 ± 0.615
5.159ValSer: 5.159 ± 0.624
4.215ValThr: 4.215 ± 0.592
5.976ValVal: 5.976 ± 0.585
1.195ValTrp: 1.195 ± 0.359
2.139ValTyr: 2.139 ± 0.412
0.0ValXaa: 0.0 ± 0.0
Trp
1.887TrpAla: 1.887 ± 0.355
0.377TrpCys: 0.377 ± 0.159
1.258TrpAsp: 1.258 ± 0.305
1.195TrpGlu: 1.195 ± 0.236
0.692TrpPhe: 0.692 ± 0.205
1.321TrpGly: 1.321 ± 0.305
0.315TrpHis: 0.315 ± 0.142
1.447TrpIle: 1.447 ± 0.303
0.629TrpLys: 0.629 ± 0.163
1.321TrpLeu: 1.321 ± 0.316
0.629TrpMet: 0.629 ± 0.169
0.629TrpAsn: 0.629 ± 0.279
0.692TrpPro: 0.692 ± 0.217
0.755TrpGln: 0.755 ± 0.212
0.944TrpArg: 0.944 ± 0.219
1.069TrpSer: 1.069 ± 0.257
1.195TrpThr: 1.195 ± 0.308
1.447TrpVal: 1.447 ± 0.28
0.629TrpTrp: 0.629 ± 0.196
0.503TrpTyr: 0.503 ± 0.211
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.768TyrAla: 2.768 ± 0.47
0.252TyrCys: 0.252 ± 0.112
2.265TyrAsp: 2.265 ± 0.446
2.076TyrGlu: 2.076 ± 0.389
0.755TyrPhe: 0.755 ± 0.21
1.699TyrGly: 1.699 ± 0.321
0.755TyrHis: 0.755 ± 0.223
1.132TyrIle: 1.132 ± 0.263
0.944TyrLys: 0.944 ± 0.302
2.894TyrLeu: 2.894 ± 0.42
1.007TyrMet: 1.007 ± 0.244
1.195TyrAsn: 1.195 ± 0.245
1.51TyrPro: 1.51 ± 0.289
1.069TyrGln: 1.069 ± 0.253
2.516TyrArg: 2.516 ± 0.468
1.195TyrSer: 1.195 ± 0.286
1.887TyrThr: 1.887 ± 0.313
2.076TyrVal: 2.076 ± 0.384
0.629TyrTrp: 0.629 ± 0.223
1.132TyrTyr: 1.132 ± 0.32
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 89 proteins (15897 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski