Amino acid dipepetide frequency for Escherichia virus Lambda_4C10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.321AlaAla: 10.321 ± 2.102
1.079AlaCys: 1.079 ± 0.435
4.655AlaAsp: 4.655 ± 0.605
7.893AlaGlu: 7.893 ± 0.902
3.373AlaPhe: 3.373 ± 0.494
7.42AlaGly: 7.42 ± 0.817
1.417AlaHis: 1.417 ± 0.3
6.206AlaIle: 6.206 ± 0.709
3.778AlaLys: 3.778 ± 0.447
7.893AlaLeu: 7.893 ± 0.924
2.631AlaMet: 2.631 ± 0.496
3.98AlaAsn: 3.98 ± 0.577
2.563AlaPro: 2.563 ± 0.415
5.059AlaGln: 5.059 ± 0.933
6.004AlaArg: 6.004 ± 0.769
5.869AlaSer: 5.869 ± 0.931
5.599AlaThr: 5.599 ± 0.934
6.409AlaVal: 6.409 ± 0.66
1.754AlaTrp: 1.754 ± 0.358
2.563AlaTyr: 2.563 ± 0.319
0.0AlaXaa: 0.0 ± 0.0
Cys
1.214CysAla: 1.214 ± 0.34
0.472CysCys: 0.472 ± 0.225
0.809CysAsp: 0.809 ± 0.227
0.742CysGlu: 0.742 ± 0.262
0.135CysPhe: 0.135 ± 0.107
1.079CysGly: 1.079 ± 0.325
0.27CysHis: 0.27 ± 0.172
0.54CysIle: 0.54 ± 0.2
0.472CysLys: 0.472 ± 0.175
0.809CysLeu: 0.809 ± 0.227
0.337CysMet: 0.337 ± 0.137
0.405CysAsn: 0.405 ± 0.15
0.54CysPro: 0.54 ± 0.188
0.405CysGln: 0.405 ± 0.157
0.944CysArg: 0.944 ± 0.288
1.079CysSer: 1.079 ± 0.277
0.675CysThr: 0.675 ± 0.228
0.675CysVal: 0.675 ± 0.207
0.202CysTrp: 0.202 ± 0.11
0.675CysTyr: 0.675 ± 0.193
0.0CysXaa: 0.0 ± 0.0
Asp
5.194AspAla: 5.194 ± 0.451
0.27AspCys: 0.27 ± 0.142
4.047AspAsp: 4.047 ± 0.526
3.643AspGlu: 3.643 ± 0.586
1.889AspPhe: 1.889 ± 0.288
5.194AspGly: 5.194 ± 0.703
0.675AspHis: 0.675 ± 0.26
4.115AspIle: 4.115 ± 0.657
3.305AspLys: 3.305 ± 0.459
4.452AspLeu: 4.452 ± 0.654
1.484AspMet: 1.484 ± 0.332
2.361AspAsn: 2.361 ± 0.375
2.226AspPro: 2.226 ± 0.429
1.282AspGln: 1.282 ± 0.311
2.833AspArg: 2.833 ± 0.489
3.103AspSer: 3.103 ± 0.407
2.698AspThr: 2.698 ± 0.344
4.25AspVal: 4.25 ± 0.769
0.944AspTrp: 0.944 ± 0.364
1.889AspTyr: 1.889 ± 0.385
0.0AspXaa: 0.0 ± 0.0
Glu
5.801GluAla: 5.801 ± 0.968
0.809GluCys: 0.809 ± 0.285
2.968GluAsp: 2.968 ± 0.402
3.44GluGlu: 3.44 ± 0.642
1.956GluPhe: 1.956 ± 0.432
3.71GluGly: 3.71 ± 0.505
1.282GluHis: 1.282 ± 0.312
3.643GluIle: 3.643 ± 0.51
3.913GluLys: 3.913 ± 0.567
5.801GluLeu: 5.801 ± 0.714
1.686GluMet: 1.686 ± 0.343
2.091GluAsn: 2.091 ± 0.32
2.226GluPro: 2.226 ± 0.338
3.98GluGln: 3.98 ± 0.641
3.71GluArg: 3.71 ± 0.791
4.047GluSer: 4.047 ± 0.572
3.508GluThr: 3.508 ± 0.493
3.575GluVal: 3.575 ± 0.412
1.147GluTrp: 1.147 ± 0.266
1.956GluTyr: 1.956 ± 0.364
0.0GluXaa: 0.0 ± 0.0
Phe
2.024PheAla: 2.024 ± 0.405
0.405PheCys: 0.405 ± 0.144
2.631PheAsp: 2.631 ± 0.405
1.889PheGlu: 1.889 ± 0.382
1.214PhePhe: 1.214 ± 0.334
2.766PheGly: 2.766 ± 0.589
0.877PheHis: 0.877 ± 0.225
1.889PheIle: 1.889 ± 0.425
2.294PheLys: 2.294 ± 0.371
2.901PheLeu: 2.901 ± 0.446
1.282PheMet: 1.282 ± 0.264
1.484PheAsn: 1.484 ± 0.296
1.686PhePro: 1.686 ± 0.299
0.809PheGln: 0.809 ± 0.242
2.766PheArg: 2.766 ± 0.444
3.508PheSer: 3.508 ± 0.487
2.496PheThr: 2.496 ± 0.398
2.361PheVal: 2.361 ± 0.363
0.405PheTrp: 0.405 ± 0.148
0.809PheTyr: 0.809 ± 0.257
0.0PheXaa: 0.0 ± 0.0
Gly
6.139GlyAla: 6.139 ± 0.822
0.675GlyCys: 0.675 ± 0.189
4.924GlyAsp: 4.924 ± 0.571
3.643GlyGlu: 3.643 ± 0.604
2.698GlyPhe: 2.698 ± 0.513
5.194GlyGly: 5.194 ± 0.94
1.484GlyHis: 1.484 ± 0.388
4.992GlyIle: 4.992 ± 0.636
5.599GlyLys: 5.599 ± 0.646
6.004GlyLeu: 6.004 ± 0.787
3.036GlyMet: 3.036 ± 0.483
2.563GlyAsn: 2.563 ± 0.373
1.349GlyPro: 1.349 ± 0.242
3.036GlyGln: 3.036 ± 0.458
4.182GlyArg: 4.182 ± 0.421
4.047GlySer: 4.047 ± 0.521
4.182GlyThr: 4.182 ± 0.632
5.869GlyVal: 5.869 ± 0.53
1.147GlyTrp: 1.147 ± 0.21
2.024GlyTyr: 2.024 ± 0.341
0.0GlyXaa: 0.0 ± 0.0
His
1.821HisAla: 1.821 ± 0.425
0.337HisCys: 0.337 ± 0.15
0.877HisAsp: 0.877 ± 0.228
0.607HisGlu: 0.607 ± 0.189
0.675HisPhe: 0.675 ± 0.195
1.821HisGly: 1.821 ± 0.344
0.405HisHis: 0.405 ± 0.194
1.282HisIle: 1.282 ± 0.295
1.282HisLys: 1.282 ± 0.264
1.956HisLeu: 1.956 ± 0.456
0.742HisMet: 0.742 ± 0.231
0.944HisAsn: 0.944 ± 0.268
0.742HisPro: 0.742 ± 0.249
0.54HisGln: 0.54 ± 0.184
1.147HisArg: 1.147 ± 0.232
0.472HisSer: 0.472 ± 0.204
1.079HisThr: 1.079 ± 0.309
1.079HisVal: 1.079 ± 0.266
0.135HisTrp: 0.135 ± 0.103
1.012HisTyr: 1.012 ± 0.207
0.0HisXaa: 0.0 ± 0.0
Ile
5.464IleAla: 5.464 ± 0.687
0.877IleCys: 0.877 ± 0.239
3.103IleAsp: 3.103 ± 0.424
3.845IleGlu: 3.845 ± 0.58
1.754IlePhe: 1.754 ± 0.3
3.305IleGly: 3.305 ± 0.521
0.809IleHis: 0.809 ± 0.292
3.44IleIle: 3.44 ± 0.63
2.698IleLys: 2.698 ± 0.541
3.373IleLeu: 3.373 ± 0.484
1.079IleMet: 1.079 ± 0.22
3.238IleAsn: 3.238 ± 0.53
2.159IlePro: 2.159 ± 0.448
1.754IleGln: 1.754 ± 0.361
3.238IleArg: 3.238 ± 0.423
4.52IleSer: 4.52 ± 0.558
3.778IleThr: 3.778 ± 0.63
3.778IleVal: 3.778 ± 0.547
0.675IleTrp: 0.675 ± 0.179
1.686IleTyr: 1.686 ± 0.488
0.0IleXaa: 0.0 ± 0.0
Lys
5.599LysAla: 5.599 ± 0.702
0.809LysCys: 0.809 ± 0.271
3.103LysAsp: 3.103 ± 0.459
3.373LysGlu: 3.373 ± 0.567
1.619LysPhe: 1.619 ± 0.39
3.44LysGly: 3.44 ± 0.456
1.619LysHis: 1.619 ± 0.32
3.238LysIle: 3.238 ± 0.547
3.575LysLys: 3.575 ± 0.551
3.913LysLeu: 3.913 ± 0.533
1.147LysMet: 1.147 ± 0.266
2.496LysAsn: 2.496 ± 0.391
2.091LysPro: 2.091 ± 0.428
1.889LysGln: 1.889 ± 0.346
3.44LysArg: 3.44 ± 0.448
3.238LysSer: 3.238 ± 0.488
3.44LysThr: 3.44 ± 0.49
3.508LysVal: 3.508 ± 0.494
1.012LysTrp: 1.012 ± 0.24
1.821LysTyr: 1.821 ± 0.333
0.0LysXaa: 0.0 ± 0.0
Leu
8.837LeuAla: 8.837 ± 0.905
1.282LeuCys: 1.282 ± 0.301
4.52LeuAsp: 4.52 ± 0.622
3.44LeuGlu: 3.44 ± 0.379
2.968LeuPhe: 2.968 ± 0.585
4.924LeuGly: 4.924 ± 0.479
1.417LeuHis: 1.417 ± 0.323
3.913LeuIle: 3.913 ± 0.64
5.734LeuLys: 5.734 ± 0.676
7.353LeuLeu: 7.353 ± 0.943
2.226LeuMet: 2.226 ± 0.37
3.305LeuAsn: 3.305 ± 0.421
4.52LeuPro: 4.52 ± 0.601
3.238LeuGln: 3.238 ± 0.567
5.464LeuArg: 5.464 ± 0.748
6.139LeuSer: 6.139 ± 0.672
5.936LeuThr: 5.936 ± 0.867
5.532LeuVal: 5.532 ± 0.525
1.349LeuTrp: 1.349 ± 0.265
2.294LeuTyr: 2.294 ± 0.428
0.0LeuXaa: 0.0 ± 0.0
Met
3.508MetAla: 3.508 ± 0.63
0.067MetCys: 0.067 ± 0.066
1.147MetAsp: 1.147 ± 0.338
0.809MetGlu: 0.809 ± 0.253
1.417MetPhe: 1.417 ± 0.336
1.686MetGly: 1.686 ± 0.329
0.337MetHis: 0.337 ± 0.215
0.944MetIle: 0.944 ± 0.223
2.024MetLys: 2.024 ± 0.384
3.103MetLeu: 3.103 ± 0.425
0.877MetMet: 0.877 ± 0.224
1.147MetAsn: 1.147 ± 0.261
1.417MetPro: 1.417 ± 0.344
0.877MetGln: 0.877 ± 0.265
1.956MetArg: 1.956 ± 0.36
2.294MetSer: 2.294 ± 0.476
2.226MetThr: 2.226 ± 0.379
2.294MetVal: 2.294 ± 0.358
0.405MetTrp: 0.405 ± 0.145
0.337MetTyr: 0.337 ± 0.144
0.0MetXaa: 0.0 ± 0.0
Asn
3.98AsnAla: 3.98 ± 0.654
0.337AsnCys: 0.337 ± 0.129
2.294AsnAsp: 2.294 ± 0.399
2.428AsnGlu: 2.428 ± 0.451
1.214AsnPhe: 1.214 ± 0.316
4.317AsnGly: 4.317 ± 0.49
1.214AsnHis: 1.214 ± 0.274
1.889AsnIle: 1.889 ± 0.348
2.428AsnLys: 2.428 ± 0.442
2.496AsnLeu: 2.496 ± 0.374
1.282AsnMet: 1.282 ± 0.318
2.024AsnAsn: 2.024 ± 0.49
1.889AsnPro: 1.889 ± 0.337
1.552AsnGln: 1.552 ± 0.325
2.563AsnArg: 2.563 ± 0.521
1.754AsnSer: 1.754 ± 0.339
2.024AsnThr: 2.024 ± 0.401
2.226AsnVal: 2.226 ± 0.432
0.472AsnTrp: 0.472 ± 0.144
0.877AsnTyr: 0.877 ± 0.247
0.0AsnXaa: 0.0 ± 0.0
Pro
4.047ProAla: 4.047 ± 0.495
0.405ProCys: 0.405 ± 0.169
3.238ProAsp: 3.238 ± 0.499
3.103ProGlu: 3.103 ± 0.615
1.754ProPhe: 1.754 ± 0.313
3.305ProGly: 3.305 ± 0.429
0.54ProHis: 0.54 ± 0.144
1.484ProIle: 1.484 ± 0.405
1.214ProLys: 1.214 ± 0.307
2.496ProLeu: 2.496 ± 0.405
0.607ProMet: 0.607 ± 0.219
1.214ProAsn: 1.214 ± 0.292
1.417ProPro: 1.417 ± 0.355
1.754ProGln: 1.754 ± 0.378
2.159ProArg: 2.159 ± 0.34
2.428ProSer: 2.428 ± 0.397
2.361ProThr: 2.361 ± 0.409
3.778ProVal: 3.778 ± 0.523
0.809ProTrp: 0.809 ± 0.255
0.877ProTyr: 0.877 ± 0.297
0.0ProXaa: 0.0 ± 0.0
Gln
4.317GlnAla: 4.317 ± 0.804
0.405GlnCys: 0.405 ± 0.145
1.417GlnAsp: 1.417 ± 0.278
2.024GlnGlu: 2.024 ± 0.448
1.619GlnPhe: 1.619 ± 0.359
2.294GlnGly: 2.294 ± 0.382
1.012GlnHis: 1.012 ± 0.297
2.294GlnIle: 2.294 ± 0.386
2.294GlnLys: 2.294 ± 0.34
3.238GlnLeu: 3.238 ± 0.455
1.214GlnMet: 1.214 ± 0.301
1.686GlnAsn: 1.686 ± 0.39
1.552GlnPro: 1.552 ± 0.266
3.171GlnGln: 3.171 ± 0.687
3.575GlnArg: 3.575 ± 0.487
3.036GlnSer: 3.036 ± 0.488
2.361GlnThr: 2.361 ± 0.471
3.238GlnVal: 3.238 ± 0.445
0.607GlnTrp: 0.607 ± 0.206
1.147GlnTyr: 1.147 ± 0.278
0.0GlnXaa: 0.0 ± 0.0
Arg
4.587ArgAla: 4.587 ± 0.506
0.607ArgCys: 0.607 ± 0.233
3.103ArgAsp: 3.103 ± 0.547
5.059ArgGlu: 5.059 ± 0.65
2.563ArgPhe: 2.563 ± 0.439
3.845ArgGly: 3.845 ± 0.545
1.349ArgHis: 1.349 ± 0.296
3.575ArgIle: 3.575 ± 0.57
2.833ArgLys: 2.833 ± 0.409
6.341ArgLeu: 6.341 ± 0.751
2.159ArgMet: 2.159 ± 0.318
2.496ArgAsn: 2.496 ± 0.515
2.294ArgPro: 2.294 ± 0.415
3.643ArgGln: 3.643 ± 0.585
4.79ArgArg: 4.79 ± 0.858
2.563ArgSer: 2.563 ± 0.391
3.036ArgThr: 3.036 ± 0.362
3.913ArgVal: 3.913 ± 0.599
1.417ArgTrp: 1.417 ± 0.297
2.091ArgTyr: 2.091 ± 0.396
0.0ArgXaa: 0.0 ± 0.0
Ser
6.881SerAla: 6.881 ± 0.655
0.877SerCys: 0.877 ± 0.211
3.238SerAsp: 3.238 ± 0.374
4.385SerGlu: 4.385 ± 0.496
2.361SerPhe: 2.361 ± 0.35
6.611SerGly: 6.611 ± 0.917
1.282SerHis: 1.282 ± 0.291
2.631SerIle: 2.631 ± 0.441
2.698SerLys: 2.698 ± 0.5
4.857SerLeu: 4.857 ± 0.649
2.091SerMet: 2.091 ± 0.459
1.821SerAsn: 1.821 ± 0.308
2.631SerPro: 2.631 ± 0.455
2.766SerGln: 2.766 ± 0.345
3.71SerArg: 3.71 ± 0.46
3.845SerSer: 3.845 ± 0.47
3.171SerThr: 3.171 ± 0.449
5.397SerVal: 5.397 ± 0.574
0.675SerTrp: 0.675 ± 0.232
2.024SerTyr: 2.024 ± 0.316
0.0SerXaa: 0.0 ± 0.0
Thr
6.274ThrAla: 6.274 ± 0.724
0.877ThrCys: 0.877 ± 0.201
2.631ThrAsp: 2.631 ± 0.424
4.317ThrGlu: 4.317 ± 0.561
2.361ThrPhe: 2.361 ± 0.323
4.79ThrGly: 4.79 ± 0.646
1.079ThrHis: 1.079 ± 0.273
2.901ThrIle: 2.901 ± 0.463
2.159ThrLys: 2.159 ± 0.387
6.004ThrLeu: 6.004 ± 0.671
1.012ThrMet: 1.012 ± 0.228
1.282ThrAsn: 1.282 ± 0.366
3.373ThrPro: 3.373 ± 0.667
2.294ThrGln: 2.294 ± 0.382
3.373ThrArg: 3.373 ± 0.429
3.44ThrSer: 3.44 ± 0.497
3.575ThrThr: 3.575 ± 0.635
4.452ThrVal: 4.452 ± 0.839
1.012ThrTrp: 1.012 ± 0.278
2.563ThrTyr: 2.563 ± 0.508
0.0ThrXaa: 0.0 ± 0.0
Val
6.746ValAla: 6.746 ± 0.689
1.147ValCys: 1.147 ± 0.353
4.385ValAsp: 4.385 ± 0.426
4.385ValGlu: 4.385 ± 0.583
2.901ValPhe: 2.901 ± 0.368
3.71ValGly: 3.71 ± 0.645
1.147ValHis: 1.147 ± 0.245
3.238ValIle: 3.238 ± 0.521
4.115ValLys: 4.115 ± 0.597
6.274ValLeu: 6.274 ± 0.65
2.361ValMet: 2.361 ± 0.347
3.575ValAsn: 3.575 ± 0.428
2.698ValPro: 2.698 ± 0.487
2.226ValGln: 2.226 ± 0.472
2.766ValArg: 2.766 ± 0.397
5.127ValSer: 5.127 ± 0.619
5.059ValThr: 5.059 ± 0.727
4.79ValVal: 4.79 ± 0.604
1.282ValTrp: 1.282 ± 0.269
2.159ValTyr: 2.159 ± 0.422
0.0ValXaa: 0.0 ± 0.0
Trp
1.282TrpAla: 1.282 ± 0.25
0.202TrpCys: 0.202 ± 0.106
1.147TrpAsp: 1.147 ± 0.273
0.877TrpGlu: 0.877 ± 0.276
0.472TrpPhe: 0.472 ± 0.161
0.944TrpGly: 0.944 ± 0.221
0.405TrpHis: 0.405 ± 0.222
0.809TrpIle: 0.809 ± 0.206
0.607TrpLys: 0.607 ± 0.168
2.091TrpLeu: 2.091 ± 0.409
0.607TrpMet: 0.607 ± 0.19
0.27TrpAsn: 0.27 ± 0.151
0.607TrpPro: 0.607 ± 0.218
0.607TrpGln: 0.607 ± 0.21
1.079TrpArg: 1.079 ± 0.319
1.214TrpSer: 1.214 ± 0.247
1.079TrpThr: 1.079 ± 0.279
1.012TrpVal: 1.012 ± 0.249
0.202TrpTrp: 0.202 ± 0.172
0.607TrpTyr: 0.607 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.428TyrAla: 2.428 ± 0.346
0.54TyrCys: 0.54 ± 0.187
1.619TyrAsp: 1.619 ± 0.238
1.349TyrGlu: 1.349 ± 0.347
1.552TyrPhe: 1.552 ± 0.357
2.428TyrGly: 2.428 ± 0.463
0.337TyrHis: 0.337 ± 0.146
1.552TyrIle: 1.552 ± 0.331
1.349TyrLys: 1.349 ± 0.294
3.036TyrLeu: 3.036 ± 0.5
0.944TyrMet: 0.944 ± 0.322
0.877TyrAsn: 0.877 ± 0.201
1.012TyrPro: 1.012 ± 0.315
1.619TyrGln: 1.619 ± 0.436
2.631TyrArg: 2.631 ± 0.425
2.226TyrSer: 2.226 ± 0.418
1.484TyrThr: 1.484 ± 0.321
1.956TyrVal: 1.956 ± 0.328
0.472TyrTrp: 0.472 ± 0.192
1.147TyrTyr: 1.147 ± 0.292
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (14825 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski