Amino acid dipepetide frequency for Escherichia virus Lambda_4A7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.315AlaAla: 10.315 ± 1.646
1.079AlaCys: 1.079 ± 0.419
4.719AlaAsp: 4.719 ± 0.595
7.82AlaGlu: 7.82 ± 0.745
3.573AlaPhe: 3.573 ± 0.399
7.416AlaGly: 7.416 ± 0.925
1.348AlaHis: 1.348 ± 0.239
6.202AlaIle: 6.202 ± 0.732
3.775AlaLys: 3.775 ± 0.414
7.888AlaLeu: 7.888 ± 0.794
2.562AlaMet: 2.562 ± 0.469
3.978AlaAsn: 3.978 ± 0.5
2.629AlaPro: 2.629 ± 0.418
4.921AlaGln: 4.921 ± 0.746
6.068AlaArg: 6.068 ± 0.707
5.73AlaSer: 5.73 ± 0.675
5.596AlaThr: 5.596 ± 0.995
6.337AlaVal: 6.337 ± 0.65
1.753AlaTrp: 1.753 ± 0.376
2.562AlaTyr: 2.562 ± 0.317
0.0AlaXaa: 0.0 ± 0.0
Cys
1.146CysAla: 1.146 ± 0.36
0.472CysCys: 0.472 ± 0.225
0.809CysAsp: 0.809 ± 0.26
0.742CysGlu: 0.742 ± 0.236
0.202CysPhe: 0.202 ± 0.122
1.079CysGly: 1.079 ± 0.347
0.27CysHis: 0.27 ± 0.147
0.539CysIle: 0.539 ± 0.191
0.539CysLys: 0.539 ± 0.17
0.876CysLeu: 0.876 ± 0.258
0.337CysMet: 0.337 ± 0.138
0.405CysAsn: 0.405 ± 0.132
0.539CysPro: 0.539 ± 0.2
0.405CysGln: 0.405 ± 0.15
0.944CysArg: 0.944 ± 0.274
1.079CysSer: 1.079 ± 0.287
0.674CysThr: 0.674 ± 0.196
0.674CysVal: 0.674 ± 0.207
0.202CysTrp: 0.202 ± 0.107
0.607CysTyr: 0.607 ± 0.181
0.0CysXaa: 0.0 ± 0.0
Asp
5.124AspAla: 5.124 ± 0.456
0.27AspCys: 0.27 ± 0.131
4.045AspAsp: 4.045 ± 0.492
3.641AspGlu: 3.641 ± 0.548
1.82AspPhe: 1.82 ± 0.276
5.191AspGly: 5.191 ± 0.638
0.674AspHis: 0.674 ± 0.261
4.112AspIle: 4.112 ± 0.609
3.169AspLys: 3.169 ± 0.45
4.45AspLeu: 4.45 ± 0.668
1.685AspMet: 1.685 ± 0.368
2.427AspAsn: 2.427 ± 0.422
2.09AspPro: 2.09 ± 0.43
1.281AspGln: 1.281 ± 0.278
2.832AspArg: 2.832 ± 0.414
3.101AspSer: 3.101 ± 0.379
2.697AspThr: 2.697 ± 0.361
4.247AspVal: 4.247 ± 0.592
0.944AspTrp: 0.944 ± 0.294
1.888AspTyr: 1.888 ± 0.325
0.0AspXaa: 0.0 ± 0.0
Glu
5.798GluAla: 5.798 ± 0.81
0.809GluCys: 0.809 ± 0.241
2.966GluAsp: 2.966 ± 0.405
3.438GluGlu: 3.438 ± 0.712
1.955GluPhe: 1.955 ± 0.404
3.708GluGly: 3.708 ± 0.549
1.281GluHis: 1.281 ± 0.306
3.573GluIle: 3.573 ± 0.46
3.978GluLys: 3.978 ± 0.573
5.798GluLeu: 5.798 ± 0.835
1.618GluMet: 1.618 ± 0.367
2.157GluAsn: 2.157 ± 0.311
2.225GluPro: 2.225 ± 0.347
3.978GluGln: 3.978 ± 0.571
3.573GluArg: 3.573 ± 0.713
4.045GluSer: 4.045 ± 0.526
3.506GluThr: 3.506 ± 0.506
3.573GluVal: 3.573 ± 0.37
1.146GluTrp: 1.146 ± 0.27
1.955GluTyr: 1.955 ± 0.342
0.0GluXaa: 0.0 ± 0.0
Phe
2.023PheAla: 2.023 ± 0.489
0.405PheCys: 0.405 ± 0.135
2.697PheAsp: 2.697 ± 0.45
1.753PheGlu: 1.753 ± 0.328
1.214PhePhe: 1.214 ± 0.392
2.966PheGly: 2.966 ± 0.546
0.944PheHis: 0.944 ± 0.243
1.955PheIle: 1.955 ± 0.415
2.225PheLys: 2.225 ± 0.377
3.101PheLeu: 3.101 ± 0.559
1.146PheMet: 1.146 ± 0.223
1.483PheAsn: 1.483 ± 0.274
1.685PhePro: 1.685 ± 0.321
0.809PheGln: 0.809 ± 0.289
2.832PheArg: 2.832 ± 0.418
3.641PheSer: 3.641 ± 0.483
2.562PheThr: 2.562 ± 0.327
2.292PheVal: 2.292 ± 0.323
0.405PheTrp: 0.405 ± 0.131
0.809PheTyr: 0.809 ± 0.242
0.0PheXaa: 0.0 ± 0.0
Gly
6.27GlyAla: 6.27 ± 0.889
0.674GlyCys: 0.674 ± 0.176
4.921GlyAsp: 4.921 ± 0.667
3.708GlyGlu: 3.708 ± 0.578
2.697GlyPhe: 2.697 ± 0.453
5.326GlyGly: 5.326 ± 0.811
1.551GlyHis: 1.551 ± 0.326
4.989GlyIle: 4.989 ± 0.556
5.663GlyLys: 5.663 ± 0.633
6.0GlyLeu: 6.0 ± 0.778
3.034GlyMet: 3.034 ± 0.503
2.562GlyAsn: 2.562 ± 0.414
1.416GlyPro: 1.416 ± 0.267
3.034GlyGln: 3.034 ± 0.432
4.18GlyArg: 4.18 ± 0.476
3.978GlySer: 3.978 ± 0.498
4.18GlyThr: 4.18 ± 0.541
5.798GlyVal: 5.798 ± 0.554
1.146GlyTrp: 1.146 ± 0.236
2.023GlyTyr: 2.023 ± 0.343
0.0GlyXaa: 0.0 ± 0.0
His
1.82HisAla: 1.82 ± 0.373
0.27HisCys: 0.27 ± 0.133
0.809HisAsp: 0.809 ± 0.219
0.674HisGlu: 0.674 ± 0.196
0.674HisPhe: 0.674 ± 0.18
1.82HisGly: 1.82 ± 0.348
0.405HisHis: 0.405 ± 0.189
1.281HisIle: 1.281 ± 0.333
1.214HisLys: 1.214 ± 0.269
1.888HisLeu: 1.888 ± 0.374
0.742HisMet: 0.742 ± 0.263
0.944HisAsn: 0.944 ± 0.259
0.742HisPro: 0.742 ± 0.198
0.539HisGln: 0.539 ± 0.191
1.146HisArg: 1.146 ± 0.252
0.405HisSer: 0.405 ± 0.184
1.079HisThr: 1.079 ± 0.243
1.214HisVal: 1.214 ± 0.293
0.135HisTrp: 0.135 ± 0.098
1.011HisTyr: 1.011 ± 0.226
0.0HisXaa: 0.0 ± 0.0
Ile
5.528IleAla: 5.528 ± 0.762
0.876IleCys: 0.876 ± 0.259
3.101IleAsp: 3.101 ± 0.326
3.708IleGlu: 3.708 ± 0.532
1.753IlePhe: 1.753 ± 0.298
3.169IleGly: 3.169 ± 0.548
0.876IleHis: 0.876 ± 0.333
3.506IleIle: 3.506 ± 0.636
2.697IleLys: 2.697 ± 0.525
3.371IleLeu: 3.371 ± 0.447
1.146IleMet: 1.146 ± 0.229
3.169IleAsn: 3.169 ± 0.531
2.09IlePro: 2.09 ± 0.364
1.82IleGln: 1.82 ± 0.351
3.303IleArg: 3.303 ± 0.501
4.517IleSer: 4.517 ± 0.551
3.775IleThr: 3.775 ± 0.673
3.708IleVal: 3.708 ± 0.499
0.674IleTrp: 0.674 ± 0.181
1.685IleTyr: 1.685 ± 0.467
0.0IleXaa: 0.0 ± 0.0
Lys
5.528LysAla: 5.528 ± 0.675
0.809LysCys: 0.809 ± 0.276
3.101LysAsp: 3.101 ± 0.439
3.303LysGlu: 3.303 ± 0.511
1.618LysPhe: 1.618 ± 0.358
3.438LysGly: 3.438 ± 0.379
1.618LysHis: 1.618 ± 0.35
3.371LysIle: 3.371 ± 0.634
3.573LysLys: 3.573 ± 0.605
3.843LysLeu: 3.843 ± 0.47
1.146LysMet: 1.146 ± 0.259
2.629LysAsn: 2.629 ± 0.344
2.157LysPro: 2.157 ± 0.432
1.888LysGln: 1.888 ± 0.363
3.438LysArg: 3.438 ± 0.433
3.169LysSer: 3.169 ± 0.429
3.438LysThr: 3.438 ± 0.472
3.438LysVal: 3.438 ± 0.54
1.011LysTrp: 1.011 ± 0.244
1.685LysTyr: 1.685 ± 0.3
0.0LysXaa: 0.0 ± 0.0
Leu
8.629LeuAla: 8.629 ± 0.91
1.348LeuCys: 1.348 ± 0.292
4.517LeuAsp: 4.517 ± 0.53
3.438LeuGlu: 3.438 ± 0.374
3.101LeuPhe: 3.101 ± 0.533
4.921LeuGly: 4.921 ± 0.482
1.483LeuHis: 1.483 ± 0.317
3.91LeuIle: 3.91 ± 0.588
5.798LeuLys: 5.798 ± 0.619
7.618LeuLeu: 7.618 ± 0.901
2.292LeuMet: 2.292 ± 0.408
3.371LeuAsn: 3.371 ± 0.483
4.517LeuPro: 4.517 ± 0.589
3.169LeuGln: 3.169 ± 0.481
5.528LeuArg: 5.528 ± 0.754
6.27LeuSer: 6.27 ± 0.624
5.933LeuThr: 5.933 ± 0.662
5.528LeuVal: 5.528 ± 0.492
1.348LeuTrp: 1.348 ± 0.331
2.36LeuTyr: 2.36 ± 0.379
0.0LeuXaa: 0.0 ± 0.0
Met
3.438MetAla: 3.438 ± 0.732
0.067MetCys: 0.067 ± 0.069
1.079MetAsp: 1.079 ± 0.299
0.876MetGlu: 0.876 ± 0.273
1.416MetPhe: 1.416 ± 0.322
1.753MetGly: 1.753 ± 0.354
0.27MetHis: 0.27 ± 0.172
0.876MetIle: 0.876 ± 0.199
2.09MetLys: 2.09 ± 0.426
3.101MetLeu: 3.101 ± 0.452
0.876MetMet: 0.876 ± 0.266
1.146MetAsn: 1.146 ± 0.247
1.551MetPro: 1.551 ± 0.412
0.944MetGln: 0.944 ± 0.226
1.955MetArg: 1.955 ± 0.309
2.427MetSer: 2.427 ± 0.484
2.225MetThr: 2.225 ± 0.326
2.225MetVal: 2.225 ± 0.378
0.405MetTrp: 0.405 ± 0.132
0.337MetTyr: 0.337 ± 0.135
0.0MetXaa: 0.0 ± 0.0
Asn
3.978AsnAla: 3.978 ± 0.577
0.337AsnCys: 0.337 ± 0.135
2.225AsnAsp: 2.225 ± 0.368
2.427AsnGlu: 2.427 ± 0.436
1.281AsnPhe: 1.281 ± 0.306
4.315AsnGly: 4.315 ± 0.548
1.146AsnHis: 1.146 ± 0.259
1.888AsnIle: 1.888 ± 0.287
2.427AsnLys: 2.427 ± 0.452
2.494AsnLeu: 2.494 ± 0.402
1.348AsnMet: 1.348 ± 0.287
2.023AsnAsn: 2.023 ± 0.392
1.955AsnPro: 1.955 ± 0.327
1.551AsnGln: 1.551 ± 0.281
2.629AsnArg: 2.629 ± 0.622
1.753AsnSer: 1.753 ± 0.333
2.023AsnThr: 2.023 ± 0.398
2.225AsnVal: 2.225 ± 0.379
0.472AsnTrp: 0.472 ± 0.157
0.876AsnTyr: 0.876 ± 0.224
0.0AsnXaa: 0.0 ± 0.0
Pro
4.112ProAla: 4.112 ± 0.523
0.405ProCys: 0.405 ± 0.168
3.236ProAsp: 3.236 ± 0.488
3.169ProGlu: 3.169 ± 0.508
1.82ProPhe: 1.82 ± 0.353
3.303ProGly: 3.303 ± 0.388
0.539ProHis: 0.539 ± 0.176
1.416ProIle: 1.416 ± 0.358
1.214ProLys: 1.214 ± 0.314
2.697ProLeu: 2.697 ± 0.439
0.607ProMet: 0.607 ± 0.204
1.214ProAsn: 1.214 ± 0.29
1.483ProPro: 1.483 ± 0.328
1.753ProGln: 1.753 ± 0.383
2.157ProArg: 2.157 ± 0.367
2.427ProSer: 2.427 ± 0.429
2.292ProThr: 2.292 ± 0.444
3.708ProVal: 3.708 ± 0.597
0.876ProTrp: 0.876 ± 0.25
0.876ProTyr: 0.876 ± 0.287
0.0ProXaa: 0.0 ± 0.0
Gln
4.382GlnAla: 4.382 ± 0.775
0.405GlnCys: 0.405 ± 0.138
1.483GlnAsp: 1.483 ± 0.301
2.023GlnGlu: 2.023 ± 0.423
1.618GlnPhe: 1.618 ± 0.299
2.292GlnGly: 2.292 ± 0.388
0.944GlnHis: 0.944 ± 0.306
2.225GlnIle: 2.225 ± 0.356
2.292GlnLys: 2.292 ± 0.411
3.236GlnLeu: 3.236 ± 0.435
1.214GlnMet: 1.214 ± 0.298
1.685GlnAsn: 1.685 ± 0.27
1.551GlnPro: 1.551 ± 0.273
3.236GlnGln: 3.236 ± 0.658
3.573GlnArg: 3.573 ± 0.561
2.899GlnSer: 2.899 ± 0.404
2.36GlnThr: 2.36 ± 0.435
3.236GlnVal: 3.236 ± 0.437
0.607GlnTrp: 0.607 ± 0.191
1.146GlnTyr: 1.146 ± 0.22
0.0GlnXaa: 0.0 ± 0.0
Arg
4.584ArgAla: 4.584 ± 0.536
0.607ArgCys: 0.607 ± 0.234
3.101ArgAsp: 3.101 ± 0.488
5.124ArgGlu: 5.124 ± 0.638
2.427ArgPhe: 2.427 ± 0.446
3.843ArgGly: 3.843 ± 0.484
1.348ArgHis: 1.348 ± 0.309
3.641ArgIle: 3.641 ± 0.432
2.832ArgLys: 2.832 ± 0.44
6.405ArgLeu: 6.405 ± 0.745
2.225ArgMet: 2.225 ± 0.365
2.427ArgAsn: 2.427 ± 0.508
2.494ArgPro: 2.494 ± 0.424
3.641ArgGln: 3.641 ± 0.642
4.787ArgArg: 4.787 ± 0.784
2.562ArgSer: 2.562 ± 0.41
2.899ArgThr: 2.899 ± 0.443
3.978ArgVal: 3.978 ± 0.628
1.348ArgTrp: 1.348 ± 0.325
2.09ArgTyr: 2.09 ± 0.34
0.0ArgXaa: 0.0 ± 0.0
Ser
6.877SerAla: 6.877 ± 0.645
0.876SerCys: 0.876 ± 0.193
3.236SerAsp: 3.236 ± 0.376
4.382SerGlu: 4.382 ± 0.57
2.427SerPhe: 2.427 ± 0.345
6.674SerGly: 6.674 ± 0.805
1.214SerHis: 1.214 ± 0.269
2.562SerIle: 2.562 ± 0.398
2.629SerLys: 2.629 ± 0.461
4.787SerLeu: 4.787 ± 0.647
2.09SerMet: 2.09 ± 0.45
1.82SerAsn: 1.82 ± 0.266
2.494SerPro: 2.494 ± 0.396
2.764SerGln: 2.764 ± 0.381
3.775SerArg: 3.775 ± 0.434
3.843SerSer: 3.843 ± 0.435
3.101SerThr: 3.101 ± 0.491
5.393SerVal: 5.393 ± 0.604
0.674SerTrp: 0.674 ± 0.205
2.023SerTyr: 2.023 ± 0.36
0.0SerXaa: 0.0 ± 0.0
Thr
6.27ThrAla: 6.27 ± 0.705
0.876ThrCys: 0.876 ± 0.218
2.629ThrAsp: 2.629 ± 0.393
4.315ThrGlu: 4.315 ± 0.646
2.292ThrPhe: 2.292 ± 0.317
4.787ThrGly: 4.787 ± 0.595
1.011ThrHis: 1.011 ± 0.22
2.832ThrIle: 2.832 ± 0.436
2.157ThrLys: 2.157 ± 0.323
6.135ThrLeu: 6.135 ± 0.64
1.011ThrMet: 1.011 ± 0.23
1.281ThrAsn: 1.281 ± 0.391
3.438ThrPro: 3.438 ± 0.639
2.292ThrGln: 2.292 ± 0.409
3.303ThrArg: 3.303 ± 0.399
3.371ThrSer: 3.371 ± 0.383
3.506ThrThr: 3.506 ± 0.561
4.45ThrVal: 4.45 ± 0.872
1.011ThrTrp: 1.011 ± 0.26
2.562ThrTyr: 2.562 ± 0.46
0.0ThrXaa: 0.0 ± 0.0
Val
6.742ValAla: 6.742 ± 0.681
1.146ValCys: 1.146 ± 0.33
4.247ValAsp: 4.247 ± 0.482
4.382ValGlu: 4.382 ± 0.552
3.034ValPhe: 3.034 ± 0.388
3.775ValGly: 3.775 ± 0.542
1.146ValHis: 1.146 ± 0.237
3.169ValIle: 3.169 ± 0.451
4.045ValLys: 4.045 ± 0.522
6.337ValLeu: 6.337 ± 0.564
2.292ValMet: 2.292 ± 0.35
3.506ValAsn: 3.506 ± 0.433
2.629ValPro: 2.629 ± 0.48
2.225ValGln: 2.225 ± 0.495
2.697ValArg: 2.697 ± 0.331
5.056ValSer: 5.056 ± 0.654
5.124ValThr: 5.124 ± 0.702
4.719ValVal: 4.719 ± 0.596
1.281ValTrp: 1.281 ± 0.262
2.157ValTyr: 2.157 ± 0.385
0.0ValXaa: 0.0 ± 0.0
Trp
1.281TrpAla: 1.281 ± 0.289
0.27TrpCys: 0.27 ± 0.122
1.146TrpAsp: 1.146 ± 0.257
0.876TrpGlu: 0.876 ± 0.278
0.472TrpPhe: 0.472 ± 0.153
0.944TrpGly: 0.944 ± 0.226
0.405TrpHis: 0.405 ± 0.151
0.809TrpIle: 0.809 ± 0.237
0.607TrpLys: 0.607 ± 0.202
2.09TrpLeu: 2.09 ± 0.421
0.607TrpMet: 0.607 ± 0.254
0.27TrpAsn: 0.27 ± 0.13
0.607TrpPro: 0.607 ± 0.176
0.539TrpGln: 0.539 ± 0.193
1.079TrpArg: 1.079 ± 0.278
1.214TrpSer: 1.214 ± 0.236
1.079TrpThr: 1.079 ± 0.272
1.011TrpVal: 1.011 ± 0.281
0.202TrpTrp: 0.202 ± 0.158
0.607TrpTyr: 0.607 ± 0.166
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.427TyrAla: 2.427 ± 0.318
0.539TyrCys: 0.539 ± 0.184
1.618TyrAsp: 1.618 ± 0.271
1.348TyrGlu: 1.348 ± 0.325
1.551TyrPhe: 1.551 ± 0.323
2.427TyrGly: 2.427 ± 0.429
0.337TyrHis: 0.337 ± 0.152
1.618TyrIle: 1.618 ± 0.33
1.281TyrLys: 1.281 ± 0.339
2.966TyrLeu: 2.966 ± 0.501
1.011TyrMet: 1.011 ± 0.301
0.876TyrAsn: 0.876 ± 0.201
1.011TyrPro: 1.011 ± 0.32
1.618TyrGln: 1.618 ± 0.385
2.562TyrArg: 2.562 ± 0.4
2.157TyrSer: 2.157 ± 0.361
1.483TyrThr: 1.483 ± 0.33
1.955TyrVal: 1.955 ± 0.356
0.472TyrTrp: 0.472 ± 0.176
1.011TyrTyr: 1.011 ± 0.24
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (14834 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski