Amino acid dipepetide frequency for Escherichia virus Lambda_2B8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.554AlaAla: 11.554 ± 1.511
0.667AlaCys: 0.667 ± 0.281
4.888AlaAsp: 4.888 ± 0.637
7.628AlaGlu: 7.628 ± 1.044
3.407AlaPhe: 3.407 ± 0.472
8.221AlaGly: 8.221 ± 1.139
1.185AlaHis: 1.185 ± 0.297
5.481AlaIle: 5.481 ± 0.598
3.925AlaLys: 3.925 ± 0.509
7.703AlaLeu: 7.703 ± 0.783
2.963AlaMet: 2.963 ± 0.456
2.888AlaAsn: 2.888 ± 0.417
2.666AlaPro: 2.666 ± 0.521
4.296AlaGln: 4.296 ± 0.769
5.851AlaArg: 5.851 ± 0.647
6.443AlaSer: 6.443 ± 0.604
5.481AlaThr: 5.481 ± 0.784
6.592AlaVal: 6.592 ± 0.63
1.703AlaTrp: 1.703 ± 0.473
2.666AlaTyr: 2.666 ± 0.414
0.0AlaXaa: 0.0 ± 0.0
Cys
0.815CysAla: 0.815 ± 0.291
0.296CysCys: 0.296 ± 0.171
0.296CysAsp: 0.296 ± 0.134
0.593CysGlu: 0.593 ± 0.216
0.444CysPhe: 0.444 ± 0.204
1.259CysGly: 1.259 ± 0.36
0.593CysHis: 0.593 ± 0.218
1.037CysIle: 1.037 ± 0.344
0.518CysLys: 0.518 ± 0.182
1.111CysLeu: 1.111 ± 0.275
0.074CysMet: 0.074 ± 0.065
0.667CysAsn: 0.667 ± 0.232
0.296CysPro: 0.296 ± 0.152
0.148CysGln: 0.148 ± 0.103
1.185CysArg: 1.185 ± 0.319
1.111CysSer: 1.111 ± 0.278
0.815CysThr: 0.815 ± 0.245
0.741CysVal: 0.741 ± 0.237
0.296CysTrp: 0.296 ± 0.13
0.593CysTyr: 0.593 ± 0.204
0.0CysXaa: 0.0 ± 0.0
Asp
5.629AspAla: 5.629 ± 0.644
0.815AspCys: 0.815 ± 0.218
4.148AspAsp: 4.148 ± 0.59
4.148AspGlu: 4.148 ± 0.588
1.778AspPhe: 1.778 ± 0.407
6.295AspGly: 6.295 ± 0.678
0.444AspHis: 0.444 ± 0.163
3.999AspIle: 3.999 ± 0.621
2.963AspLys: 2.963 ± 0.527
3.999AspLeu: 3.999 ± 0.722
1.926AspMet: 1.926 ± 0.356
2.0AspAsn: 2.0 ± 0.369
2.74AspPro: 2.74 ± 0.613
1.037AspGln: 1.037 ± 0.263
2.666AspArg: 2.666 ± 0.465
3.555AspSer: 3.555 ± 0.483
3.481AspThr: 3.481 ± 0.573
3.925AspVal: 3.925 ± 0.612
1.259AspTrp: 1.259 ± 0.359
1.703AspTyr: 1.703 ± 0.317
0.0AspXaa: 0.0 ± 0.0
Glu
5.703GluAla: 5.703 ± 0.814
0.889GluCys: 0.889 ± 0.27
3.037GluAsp: 3.037 ± 0.46
4.37GluGlu: 4.37 ± 0.682
2.518GluPhe: 2.518 ± 0.457
3.555GluGly: 3.555 ± 0.535
1.778GluHis: 1.778 ± 0.399
3.185GluIle: 3.185 ± 0.405
3.629GluLys: 3.629 ± 0.424
6.147GluLeu: 6.147 ± 0.731
2.074GluMet: 2.074 ± 0.442
2.814GluAsn: 2.814 ± 0.559
1.926GluPro: 1.926 ± 0.412
4.666GluGln: 4.666 ± 0.558
4.148GluArg: 4.148 ± 0.584
4.518GluSer: 4.518 ± 0.661
3.703GluThr: 3.703 ± 0.6
3.185GluVal: 3.185 ± 0.545
1.037GluTrp: 1.037 ± 0.285
1.926GluTyr: 1.926 ± 0.333
0.0GluXaa: 0.0 ± 0.0
Phe
1.926PheAla: 1.926 ± 0.339
0.667PheCys: 0.667 ± 0.197
2.814PheAsp: 2.814 ± 0.504
1.407PheGlu: 1.407 ± 0.3
1.185PhePhe: 1.185 ± 0.321
2.518PheGly: 2.518 ± 0.311
0.667PheHis: 0.667 ± 0.196
1.703PheIle: 1.703 ± 0.376
1.481PheLys: 1.481 ± 0.361
2.37PheLeu: 2.37 ± 0.457
1.037PheMet: 1.037 ± 0.276
1.333PheAsn: 1.333 ± 0.394
1.555PhePro: 1.555 ± 0.383
0.889PheGln: 0.889 ± 0.189
2.814PheArg: 2.814 ± 0.365
2.666PheSer: 2.666 ± 0.499
2.814PheThr: 2.814 ± 0.431
2.518PheVal: 2.518 ± 0.433
0.518PheTrp: 0.518 ± 0.176
1.185PheTyr: 1.185 ± 0.256
0.0PheXaa: 0.0 ± 0.0
Gly
5.777GlyAla: 5.777 ± 0.895
0.889GlyCys: 0.889 ± 0.236
4.962GlyAsp: 4.962 ± 0.546
4.814GlyGlu: 4.814 ± 0.727
1.852GlyPhe: 1.852 ± 0.315
6.073GlyGly: 6.073 ± 0.885
1.259GlyHis: 1.259 ± 0.416
4.518GlyIle: 4.518 ± 0.473
4.814GlyLys: 4.814 ± 0.719
5.481GlyLeu: 5.481 ± 0.593
2.888GlyMet: 2.888 ± 0.551
4.592GlyAsn: 4.592 ± 0.62
2.888GlyPro: 2.888 ± 1.449
2.963GlyGln: 2.963 ± 0.494
4.222GlyArg: 4.222 ± 0.495
4.444GlySer: 4.444 ± 0.592
4.222GlyThr: 4.222 ± 0.709
5.036GlyVal: 5.036 ± 0.628
1.778GlyTrp: 1.778 ± 0.324
2.666GlyTyr: 2.666 ± 0.46
0.0GlyXaa: 0.0 ± 0.0
His
1.333HisAla: 1.333 ± 0.378
0.222HisCys: 0.222 ± 0.125
0.963HisAsp: 0.963 ± 0.252
1.185HisGlu: 1.185 ± 0.364
0.815HisPhe: 0.815 ± 0.231
1.407HisGly: 1.407 ± 0.354
0.37HisHis: 0.37 ± 0.183
0.963HisIle: 0.963 ± 0.24
1.333HisLys: 1.333 ± 0.317
1.703HisLeu: 1.703 ± 0.382
0.296HisMet: 0.296 ± 0.161
0.815HisAsn: 0.815 ± 0.224
0.963HisPro: 0.963 ± 0.276
0.593HisGln: 0.593 ± 0.208
1.111HisArg: 1.111 ± 0.263
0.815HisSer: 0.815 ± 0.271
0.741HisThr: 0.741 ± 0.246
1.185HisVal: 1.185 ± 0.322
0.296HisTrp: 0.296 ± 0.181
0.889HisTyr: 0.889 ± 0.259
0.0HisXaa: 0.0 ± 0.0
Ile
4.666IleAla: 4.666 ± 0.583
0.815IleCys: 0.815 ± 0.257
3.037IleAsp: 3.037 ± 0.57
3.037IleGlu: 3.037 ± 0.461
1.037IlePhe: 1.037 ± 0.286
2.888IleGly: 2.888 ± 0.492
0.667IleHis: 0.667 ± 0.234
2.592IleIle: 2.592 ± 0.498
3.037IleLys: 3.037 ± 0.504
3.555IleLeu: 3.555 ± 0.564
1.111IleMet: 1.111 ± 0.331
2.518IleAsn: 2.518 ± 0.476
2.814IlePro: 2.814 ± 0.385
2.592IleGln: 2.592 ± 0.43
3.259IleArg: 3.259 ± 0.48
4.222IleSer: 4.222 ± 0.445
4.148IleThr: 4.148 ± 0.724
3.037IleVal: 3.037 ± 0.409
0.593IleTrp: 0.593 ± 0.279
1.407IleTyr: 1.407 ± 0.359
0.0IleXaa: 0.0 ± 0.0
Lys
5.184LysAla: 5.184 ± 0.627
0.667LysCys: 0.667 ± 0.23
2.74LysAsp: 2.74 ± 0.492
3.629LysGlu: 3.629 ± 0.465
1.333LysPhe: 1.333 ± 0.295
3.925LysGly: 3.925 ± 0.747
1.555LysHis: 1.555 ± 0.323
2.518LysIle: 2.518 ± 0.54
3.703LysLys: 3.703 ± 0.648
4.073LysLeu: 4.073 ± 0.617
1.333LysMet: 1.333 ± 0.343
2.666LysAsn: 2.666 ± 0.606
2.222LysPro: 2.222 ± 0.35
3.185LysGln: 3.185 ± 0.5
3.555LysArg: 3.555 ± 0.513
2.666LysSer: 2.666 ± 0.414
3.111LysThr: 3.111 ± 0.592
2.592LysVal: 2.592 ± 0.453
1.407LysTrp: 1.407 ± 0.265
1.481LysTyr: 1.481 ± 0.335
0.0LysXaa: 0.0 ± 0.0
Leu
8.814LeuAla: 8.814 ± 0.828
0.889LeuCys: 0.889 ± 0.214
3.925LeuAsp: 3.925 ± 0.554
3.703LeuGlu: 3.703 ± 0.519
2.444LeuPhe: 2.444 ± 0.495
4.518LeuGly: 4.518 ± 0.603
1.703LeuHis: 1.703 ± 0.403
3.481LeuIle: 3.481 ± 0.609
4.073LeuLys: 4.073 ± 0.589
5.777LeuLeu: 5.777 ± 0.624
2.148LeuMet: 2.148 ± 0.42
3.407LeuAsn: 3.407 ± 0.515
3.925LeuPro: 3.925 ± 0.539
2.888LeuGln: 2.888 ± 0.477
5.258LeuArg: 5.258 ± 0.592
6.221LeuSer: 6.221 ± 0.753
6.666LeuThr: 6.666 ± 0.759
3.703LeuVal: 3.703 ± 0.492
1.481LeuTrp: 1.481 ± 0.276
2.37LeuTyr: 2.37 ± 0.465
0.0LeuXaa: 0.0 ± 0.0
Met
2.963MetAla: 2.963 ± 0.556
0.148MetCys: 0.148 ± 0.121
1.555MetAsp: 1.555 ± 0.383
1.111MetGlu: 1.111 ± 0.291
1.333MetPhe: 1.333 ± 0.335
1.481MetGly: 1.481 ± 0.236
0.518MetHis: 0.518 ± 0.228
0.963MetIle: 0.963 ± 0.31
2.592MetLys: 2.592 ± 0.491
2.0MetLeu: 2.0 ± 0.265
0.889MetMet: 0.889 ± 0.257
1.259MetAsn: 1.259 ± 0.32
1.185MetPro: 1.185 ± 0.285
1.629MetGln: 1.629 ± 0.376
1.778MetArg: 1.778 ± 0.366
1.926MetSer: 1.926 ± 0.358
2.74MetThr: 2.74 ± 0.542
2.0MetVal: 2.0 ± 0.39
0.296MetTrp: 0.296 ± 0.135
0.518MetTyr: 0.518 ± 0.217
0.0MetXaa: 0.0 ± 0.0
Asn
3.999AsnAla: 3.999 ± 0.659
0.667AsnCys: 0.667 ± 0.227
2.37AsnAsp: 2.37 ± 0.355
2.74AsnGlu: 2.74 ± 0.419
1.555AsnPhe: 1.555 ± 0.387
4.073AsnGly: 4.073 ± 0.525
1.037AsnHis: 1.037 ± 0.297
2.666AsnIle: 2.666 ± 0.472
2.296AsnLys: 2.296 ± 0.531
2.296AsnLeu: 2.296 ± 0.368
0.889AsnMet: 0.889 ± 0.223
1.778AsnAsn: 1.778 ± 0.311
1.926AsnPro: 1.926 ± 0.314
1.481AsnGln: 1.481 ± 0.377
2.666AsnArg: 2.666 ± 0.488
1.926AsnSer: 1.926 ± 0.36
2.444AsnThr: 2.444 ± 0.381
2.814AsnVal: 2.814 ± 0.546
0.37AsnTrp: 0.37 ± 0.133
1.259AsnTyr: 1.259 ± 0.276
0.0AsnXaa: 0.0 ± 0.0
Pro
4.37ProAla: 4.37 ± 0.631
0.518ProCys: 0.518 ± 0.186
4.073ProAsp: 4.073 ± 0.557
3.407ProGlu: 3.407 ± 0.671
1.185ProPhe: 1.185 ± 0.306
3.259ProGly: 3.259 ± 0.467
0.518ProHis: 0.518 ± 0.182
1.185ProIle: 1.185 ± 0.298
2.074ProLys: 2.074 ± 0.607
2.74ProLeu: 2.74 ± 0.511
0.889ProMet: 0.889 ± 0.224
1.481ProAsn: 1.481 ± 0.331
1.555ProPro: 1.555 ± 0.43
1.778ProGln: 1.778 ± 0.395
1.333ProArg: 1.333 ± 0.344
2.666ProSer: 2.666 ± 0.438
2.074ProThr: 2.074 ± 0.405
3.851ProVal: 3.851 ± 0.534
0.815ProTrp: 0.815 ± 0.26
0.815ProTyr: 0.815 ± 0.35
0.0ProXaa: 0.0 ± 0.0
Gln
4.444GlnAla: 4.444 ± 0.724
1.037GlnCys: 1.037 ± 0.269
1.481GlnAsp: 1.481 ± 0.341
3.037GlnGlu: 3.037 ± 0.443
1.407GlnPhe: 1.407 ± 0.327
3.037GlnGly: 3.037 ± 0.52
0.815GlnHis: 0.815 ± 0.206
2.222GlnIle: 2.222 ± 0.372
2.444GlnLys: 2.444 ± 0.44
3.555GlnLeu: 3.555 ± 0.46
1.333GlnMet: 1.333 ± 0.332
2.0GlnAsn: 2.0 ± 0.381
1.333GlnPro: 1.333 ± 0.35
2.37GlnGln: 2.37 ± 0.452
2.518GlnArg: 2.518 ± 0.434
3.555GlnSer: 3.555 ± 0.417
2.37GlnThr: 2.37 ± 0.468
3.333GlnVal: 3.333 ± 0.419
0.593GlnTrp: 0.593 ± 0.185
1.407GlnTyr: 1.407 ± 0.278
0.0GlnXaa: 0.0 ± 0.0
Arg
4.592ArgAla: 4.592 ± 0.602
0.741ArgCys: 0.741 ± 0.254
3.629ArgAsp: 3.629 ± 0.686
4.666ArgGlu: 4.666 ± 0.647
2.222ArgPhe: 2.222 ± 0.407
4.148ArgGly: 4.148 ± 0.595
1.185ArgHis: 1.185 ± 0.306
3.703ArgIle: 3.703 ± 0.474
3.259ArgLys: 3.259 ± 0.596
4.962ArgLeu: 4.962 ± 0.583
2.222ArgMet: 2.222 ± 0.348
2.74ArgAsn: 2.74 ± 0.478
1.852ArgPro: 1.852 ± 0.324
3.185ArgGln: 3.185 ± 0.496
5.407ArgArg: 5.407 ± 0.883
3.259ArgSer: 3.259 ± 0.538
3.259ArgThr: 3.259 ± 0.457
3.851ArgVal: 3.851 ± 0.695
1.185ArgTrp: 1.185 ± 0.31
2.074ArgTyr: 2.074 ± 0.421
0.0ArgXaa: 0.0 ± 0.0
Ser
8.221SerAla: 8.221 ± 1.023
0.593SerCys: 0.593 ± 0.231
4.222SerAsp: 4.222 ± 0.602
4.444SerGlu: 4.444 ± 0.561
2.296SerPhe: 2.296 ± 0.435
7.036SerGly: 7.036 ± 0.805
1.037SerHis: 1.037 ± 0.206
1.926SerIle: 1.926 ± 0.353
2.666SerLys: 2.666 ± 0.465
4.518SerLeu: 4.518 ± 0.61
2.074SerMet: 2.074 ± 0.36
1.703SerAsn: 1.703 ± 0.274
3.111SerPro: 3.111 ± 0.43
3.037SerGln: 3.037 ± 0.391
5.258SerArg: 5.258 ± 0.663
3.407SerSer: 3.407 ± 0.549
3.481SerThr: 3.481 ± 0.499
4.888SerVal: 4.888 ± 0.646
0.889SerTrp: 0.889 ± 0.261
2.0SerTyr: 2.0 ± 0.423
0.0SerXaa: 0.0 ± 0.0
Thr
6.666ThrAla: 6.666 ± 0.844
0.741ThrCys: 0.741 ± 0.222
3.629ThrAsp: 3.629 ± 0.481
4.148ThrGlu: 4.148 ± 0.671
2.37ThrPhe: 2.37 ± 0.562
5.333ThrGly: 5.333 ± 0.878
1.111ThrHis: 1.111 ± 0.284
2.74ThrIle: 2.74 ± 0.439
2.963ThrLys: 2.963 ± 0.41
6.295ThrLeu: 6.295 ± 0.641
1.037ThrMet: 1.037 ± 0.34
1.778ThrAsn: 1.778 ± 0.343
3.333ThrPro: 3.333 ± 0.619
2.814ThrGln: 2.814 ± 0.42
2.888ThrArg: 2.888 ± 0.335
4.148ThrSer: 4.148 ± 0.629
3.481ThrThr: 3.481 ± 0.592
3.777ThrVal: 3.777 ± 0.901
1.111ThrTrp: 1.111 ± 0.248
1.703ThrTyr: 1.703 ± 0.336
0.0ThrXaa: 0.0 ± 0.0
Val
5.407ValAla: 5.407 ± 0.58
0.815ValCys: 0.815 ± 0.274
3.777ValAsp: 3.777 ± 0.476
3.925ValGlu: 3.925 ± 0.485
2.592ValPhe: 2.592 ± 0.514
3.111ValGly: 3.111 ± 0.538
0.667ValHis: 0.667 ± 0.208
3.629ValIle: 3.629 ± 0.523
3.851ValLys: 3.851 ± 0.633
5.407ValLeu: 5.407 ± 0.715
2.222ValMet: 2.222 ± 0.412
3.333ValAsn: 3.333 ± 0.474
2.37ValPro: 2.37 ± 0.43
2.518ValGln: 2.518 ± 0.495
3.481ValArg: 3.481 ± 0.413
5.184ValSer: 5.184 ± 0.873
4.444ValThr: 4.444 ± 0.756
4.666ValVal: 4.666 ± 0.576
0.593ValTrp: 0.593 ± 0.25
1.926ValTyr: 1.926 ± 0.396
0.0ValXaa: 0.0 ± 0.0
Trp
1.481TrpAla: 1.481 ± 0.3
0.37TrpCys: 0.37 ± 0.136
0.963TrpAsp: 0.963 ± 0.312
0.667TrpGlu: 0.667 ± 0.213
0.667TrpPhe: 0.667 ± 0.228
1.259TrpGly: 1.259 ± 0.328
0.518TrpHis: 0.518 ± 0.201
0.963TrpIle: 0.963 ± 0.272
0.889TrpLys: 0.889 ± 0.31
1.481TrpLeu: 1.481 ± 0.37
0.889TrpMet: 0.889 ± 0.259
0.593TrpAsn: 0.593 ± 0.203
0.593TrpPro: 0.593 ± 0.222
0.815TrpGln: 0.815 ± 0.218
0.741TrpArg: 0.741 ± 0.26
1.259TrpSer: 1.259 ± 0.235
1.037TrpThr: 1.037 ± 0.219
0.741TrpVal: 0.741 ± 0.236
0.37TrpTrp: 0.37 ± 0.217
0.667TrpTyr: 0.667 ± 0.196
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.666TyrAla: 2.666 ± 0.432
0.444TyrCys: 0.444 ± 0.164
2.0TyrAsp: 2.0 ± 0.352
2.148TyrGlu: 2.148 ± 0.421
1.629TyrPhe: 1.629 ± 0.384
2.74TyrGly: 2.74 ± 0.54
0.37TyrHis: 0.37 ± 0.156
1.555TyrIle: 1.555 ± 0.367
1.037TyrLys: 1.037 ± 0.261
2.222TyrLeu: 2.222 ± 0.522
0.518TyrMet: 0.518 ± 0.226
0.889TyrAsn: 0.889 ± 0.275
1.333TyrPro: 1.333 ± 0.318
1.407TyrGln: 1.407 ± 0.265
2.074TyrArg: 2.074 ± 0.31
2.814TyrSer: 2.814 ± 0.494
1.555TyrThr: 1.555 ± 0.276
1.481TyrVal: 1.481 ± 0.3
0.37TyrTrp: 0.37 ± 0.133
1.259TyrTyr: 1.259 ± 0.28
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (13503 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski