Amino acid dipepetide frequency for Butyrivibrio virus Ceridwen

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.88AlaAla: 4.88 ± 1.228
0.496AlaCys: 0.496 ± 0.195
5.293AlaAsp: 5.293 ± 0.584
6.616AlaGlu: 6.616 ± 0.985
2.068AlaPhe: 2.068 ± 0.421
3.887AlaGly: 3.887 ± 0.7
0.662AlaHis: 0.662 ± 0.22
6.782AlaIle: 6.782 ± 1.153
4.632AlaLys: 4.632 ± 0.831
6.451AlaLeu: 6.451 ± 0.891
2.729AlaMet: 2.729 ± 0.667
4.962AlaAsn: 4.962 ± 0.574
1.902AlaPro: 1.902 ± 0.263
3.06AlaGln: 3.06 ± 0.454
2.812AlaArg: 2.812 ± 0.424
4.88AlaSer: 4.88 ± 0.664
4.797AlaThr: 4.797 ± 0.465
4.714AlaVal: 4.714 ± 0.772
1.323AlaTrp: 1.323 ± 0.364
3.639AlaTyr: 3.639 ± 0.453
0.0AlaXaa: 0.0 ± 0.0
Cys
0.992CysAla: 0.992 ± 0.239
0.579CysCys: 0.579 ± 0.259
0.331CysAsp: 0.331 ± 0.197
0.744CysGlu: 0.744 ± 0.356
0.579CysPhe: 0.579 ± 0.213
2.068CysGly: 2.068 ± 0.511
0.083CysHis: 0.083 ± 0.085
0.992CysIle: 0.992 ± 0.271
1.075CysLys: 1.075 ± 0.275
0.496CysLeu: 0.496 ± 0.187
0.331CysMet: 0.331 ± 0.142
0.744CysAsn: 0.744 ± 0.258
0.91CysPro: 0.91 ± 0.282
0.414CysGln: 0.414 ± 0.175
0.579CysArg: 0.579 ± 0.257
0.91CysSer: 0.91 ± 0.273
0.827CysThr: 0.827 ± 0.289
1.158CysVal: 1.158 ± 0.304
0.248CysTrp: 0.248 ± 0.138
1.158CysTyr: 1.158 ± 0.307
0.0CysXaa: 0.0 ± 0.0
Asp
4.383AspAla: 4.383 ± 0.611
0.579AspCys: 0.579 ± 0.249
4.714AspAsp: 4.714 ± 0.752
3.887AspGlu: 3.887 ± 0.555
2.729AspPhe: 2.729 ± 0.401
5.045AspGly: 5.045 ± 0.654
0.91AspHis: 0.91 ± 0.264
4.135AspIle: 4.135 ± 0.67
5.459AspLys: 5.459 ± 0.65
4.466AspLeu: 4.466 ± 0.551
1.737AspMet: 1.737 ± 0.422
3.143AspAsn: 3.143 ± 0.498
0.91AspPro: 0.91 ± 0.242
1.323AspGln: 1.323 ± 0.34
2.647AspArg: 2.647 ± 0.517
3.474AspSer: 3.474 ± 0.46
3.887AspThr: 3.887 ± 0.529
2.647AspVal: 2.647 ± 0.441
0.496AspTrp: 0.496 ± 0.192
2.812AspTyr: 2.812 ± 0.496
0.0AspXaa: 0.0 ± 0.0
Glu
6.534GluAla: 6.534 ± 0.737
1.075GluCys: 1.075 ± 0.362
3.722GluAsp: 3.722 ± 0.5
6.782GluGlu: 6.782 ± 1.065
2.481GluPhe: 2.481 ± 0.368
5.21GluGly: 5.21 ± 0.708
1.985GluHis: 1.985 ± 0.401
4.714GluIle: 4.714 ± 0.598
7.278GluLys: 7.278 ± 0.984
6.699GluLeu: 6.699 ± 0.795
2.895GluMet: 2.895 ± 0.579
3.556GluAsn: 3.556 ± 0.49
2.895GluPro: 2.895 ± 0.54
3.887GluGln: 3.887 ± 0.556
4.632GluArg: 4.632 ± 0.752
3.474GluSer: 3.474 ± 0.495
4.714GluThr: 4.714 ± 0.586
5.541GluVal: 5.541 ± 0.528
1.406GluTrp: 1.406 ± 0.284
4.549GluTyr: 4.549 ± 0.708
0.0GluXaa: 0.0 ± 0.0
Phe
2.564PheAla: 2.564 ± 0.545
0.662PheCys: 0.662 ± 0.281
1.82PheAsp: 1.82 ± 0.377
2.564PheGlu: 2.564 ± 0.554
1.241PhePhe: 1.241 ± 0.311
2.895PheGly: 2.895 ± 0.48
0.414PheHis: 0.414 ± 0.178
2.233PheIle: 2.233 ± 0.488
2.15PheLys: 2.15 ± 0.406
2.564PheLeu: 2.564 ± 0.569
0.579PheMet: 0.579 ± 0.193
2.068PheAsn: 2.068 ± 0.406
1.737PhePro: 1.737 ± 0.339
0.662PheGln: 0.662 ± 0.279
1.406PheArg: 1.406 ± 0.371
2.068PheSer: 2.068 ± 0.428
3.06PheThr: 3.06 ± 0.555
1.82PheVal: 1.82 ± 0.324
0.248PheTrp: 0.248 ± 0.139
1.571PheTyr: 1.571 ± 0.348
0.0PheXaa: 0.0 ± 0.0
Gly
6.286GlyAla: 6.286 ± 0.609
0.91GlyCys: 0.91 ± 0.317
4.383GlyAsp: 4.383 ± 0.658
4.962GlyGlu: 4.962 ± 0.574
3.308GlyPhe: 3.308 ± 0.589
4.632GlyGly: 4.632 ± 0.89
0.331GlyHis: 0.331 ± 0.217
4.88GlyIle: 4.88 ± 0.889
5.872GlyLys: 5.872 ± 0.669
4.218GlyLeu: 4.218 ± 0.749
2.068GlyMet: 2.068 ± 0.368
3.722GlyAsn: 3.722 ± 0.594
0.083GlyPro: 0.083 ± 0.073
1.82GlyGln: 1.82 ± 0.333
2.398GlyArg: 2.398 ± 0.492
3.226GlySer: 3.226 ± 0.569
4.962GlyThr: 4.962 ± 0.85
5.955GlyVal: 5.955 ± 0.648
0.992GlyTrp: 0.992 ± 0.247
2.812GlyTyr: 2.812 ± 0.398
0.0GlyXaa: 0.0 ± 0.0
His
0.579HisAla: 0.579 ± 0.171
0.414HisCys: 0.414 ± 0.177
0.496HisAsp: 0.496 ± 0.211
1.323HisGlu: 1.323 ± 0.291
0.496HisPhe: 0.496 ± 0.162
0.496HisGly: 0.496 ± 0.191
0.165HisHis: 0.165 ± 0.123
0.827HisIle: 0.827 ± 0.288
0.992HisLys: 0.992 ± 0.28
1.075HisLeu: 1.075 ± 0.266
0.414HisMet: 0.414 ± 0.184
0.496HisAsn: 0.496 ± 0.218
0.662HisPro: 0.662 ± 0.259
0.083HisGln: 0.083 ± 0.081
0.827HisArg: 0.827 ± 0.27
0.827HisSer: 0.827 ± 0.261
1.075HisThr: 1.075 ± 0.337
0.496HisVal: 0.496 ± 0.184
0.331HisTrp: 0.331 ± 0.161
0.662HisTyr: 0.662 ± 0.232
0.0HisXaa: 0.0 ± 0.0
Ile
5.128IleAla: 5.128 ± 0.798
1.158IleCys: 1.158 ± 0.327
4.88IleAsp: 4.88 ± 0.845
5.376IleGlu: 5.376 ± 0.65
1.241IlePhe: 1.241 ± 0.252
3.639IleGly: 3.639 ± 0.726
0.414IleHis: 0.414 ± 0.176
3.226IleIle: 3.226 ± 0.372
6.865IleLys: 6.865 ± 0.815
4.053IleLeu: 4.053 ± 0.666
1.406IleMet: 1.406 ± 0.323
4.053IleAsn: 4.053 ± 0.5
2.233IlePro: 2.233 ± 0.44
2.812IleGln: 2.812 ± 0.56
2.564IleArg: 2.564 ± 0.511
3.887IleSer: 3.887 ± 0.68
4.962IleThr: 4.962 ± 0.901
3.391IleVal: 3.391 ± 0.555
1.158IleTrp: 1.158 ± 0.321
2.729IleTyr: 2.729 ± 0.358
0.0IleXaa: 0.0 ± 0.0
Lys
6.947LysAla: 6.947 ± 0.739
1.158LysCys: 1.158 ± 0.31
4.135LysAsp: 4.135 ± 0.712
7.526LysGlu: 7.526 ± 1.162
2.481LysPhe: 2.481 ± 0.372
5.872LysGly: 5.872 ± 1.034
1.075LysHis: 1.075 ± 0.326
3.804LysIle: 3.804 ± 0.36
6.699LysLys: 6.699 ± 0.888
5.707LysLeu: 5.707 ± 0.591
2.068LysMet: 2.068 ± 0.387
3.97LysAsn: 3.97 ± 0.526
2.564LysPro: 2.564 ± 0.521
3.474LysGln: 3.474 ± 0.661
3.97LysArg: 3.97 ± 0.5
3.722LysSer: 3.722 ± 0.5
5.789LysThr: 5.789 ± 0.749
4.962LysVal: 4.962 ± 0.614
1.489LysTrp: 1.489 ± 0.356
3.97LysTyr: 3.97 ± 0.557
0.0LysXaa: 0.0 ± 0.0
Leu
5.128LeuAla: 5.128 ± 0.512
0.91LeuCys: 0.91 ± 0.249
5.045LeuAsp: 5.045 ± 0.632
5.872LeuGlu: 5.872 ± 0.833
1.902LeuPhe: 1.902 ± 0.441
4.632LeuGly: 4.632 ± 0.671
1.158LeuHis: 1.158 ± 0.35
6.038LeuIle: 6.038 ± 0.626
6.203LeuLys: 6.203 ± 0.758
3.97LeuLeu: 3.97 ± 0.542
1.323LeuMet: 1.323 ± 0.311
3.97LeuAsn: 3.97 ± 0.556
1.406LeuPro: 1.406 ± 0.319
2.895LeuGln: 2.895 ± 0.553
2.564LeuArg: 2.564 ± 0.424
3.887LeuSer: 3.887 ± 0.498
4.714LeuThr: 4.714 ± 0.583
4.135LeuVal: 4.135 ± 0.716
0.496LeuTrp: 0.496 ± 0.199
3.887LeuTyr: 3.887 ± 0.509
0.0LeuXaa: 0.0 ± 0.0
Met
2.481MetAla: 2.481 ± 0.423
0.496MetCys: 0.496 ± 0.208
2.068MetAsp: 2.068 ± 0.444
2.564MetGlu: 2.564 ± 0.508
0.496MetPhe: 0.496 ± 0.218
1.902MetGly: 1.902 ± 0.367
0.414MetHis: 0.414 ± 0.171
1.241MetIle: 1.241 ± 0.298
3.391MetLys: 3.391 ± 0.593
1.489MetLeu: 1.489 ± 0.382
0.827MetMet: 0.827 ± 0.251
1.323MetAsn: 1.323 ± 0.333
0.992MetPro: 0.992 ± 0.307
0.992MetGln: 0.992 ± 0.31
1.82MetArg: 1.82 ± 0.474
1.737MetSer: 1.737 ± 0.327
1.985MetThr: 1.985 ± 0.448
0.91MetVal: 0.91 ± 0.266
0.248MetTrp: 0.248 ± 0.132
0.496MetTyr: 0.496 ± 0.191
0.0MetXaa: 0.0 ± 0.0
Asn
3.722AsnAla: 3.722 ± 0.513
0.91AsnCys: 0.91 ± 0.305
2.895AsnAsp: 2.895 ± 0.687
4.797AsnGlu: 4.797 ± 0.575
1.489AsnPhe: 1.489 ± 0.44
4.714AsnGly: 4.714 ± 0.689
1.158AsnHis: 1.158 ± 0.285
3.722AsnIle: 3.722 ± 0.571
3.556AsnLys: 3.556 ± 0.529
3.06AsnLeu: 3.06 ± 0.568
1.737AsnMet: 1.737 ± 0.368
3.226AsnAsn: 3.226 ± 0.484
2.398AsnPro: 2.398 ± 0.475
2.398AsnGln: 2.398 ± 0.635
3.06AsnArg: 3.06 ± 0.469
2.977AsnSer: 2.977 ± 0.499
3.308AsnThr: 3.308 ± 0.497
3.06AsnVal: 3.06 ± 0.524
0.662AsnTrp: 0.662 ± 0.259
1.654AsnTyr: 1.654 ± 0.301
0.0AsnXaa: 0.0 ± 0.0
Pro
1.654ProAla: 1.654 ± 0.443
0.331ProCys: 0.331 ± 0.168
1.571ProAsp: 1.571 ± 0.389
2.481ProGlu: 2.481 ± 0.456
0.91ProPhe: 0.91 ± 0.289
0.0ProGly: 0.0 ± 0.0
0.579ProHis: 0.579 ± 0.217
1.902ProIle: 1.902 ± 0.356
1.902ProLys: 1.902 ± 0.449
2.895ProLeu: 2.895 ± 0.448
0.744ProMet: 0.744 ± 0.276
1.654ProAsn: 1.654 ± 0.401
0.744ProPro: 0.744 ± 0.322
0.579ProGln: 0.579 ± 0.253
1.241ProArg: 1.241 ± 0.317
2.895ProSer: 2.895 ± 0.455
2.316ProThr: 2.316 ± 0.42
1.241ProVal: 1.241 ± 0.402
0.165ProTrp: 0.165 ± 0.115
2.233ProTyr: 2.233 ± 0.503
0.0ProXaa: 0.0 ± 0.0
Gln
2.316GlnAla: 2.316 ± 0.455
0.083GlnCys: 0.083 ± 0.076
1.323GlnAsp: 1.323 ± 0.305
2.564GlnGlu: 2.564 ± 0.398
1.075GlnPhe: 1.075 ± 0.278
3.474GlnGly: 3.474 ± 0.521
0.662GlnHis: 0.662 ± 0.242
2.895GlnIle: 2.895 ± 0.483
3.474GlnLys: 3.474 ± 0.628
2.481GlnLeu: 2.481 ± 0.413
1.737GlnMet: 1.737 ± 0.331
2.729GlnAsn: 2.729 ± 0.687
0.992GlnPro: 0.992 ± 0.337
1.075GlnGln: 1.075 ± 0.356
1.406GlnArg: 1.406 ± 0.313
2.068GlnSer: 2.068 ± 0.363
2.812GlnThr: 2.812 ± 0.587
1.82GlnVal: 1.82 ± 0.377
0.248GlnTrp: 0.248 ± 0.129
1.323GlnTyr: 1.323 ± 0.345
0.0GlnXaa: 0.0 ± 0.0
Arg
3.143ArgAla: 3.143 ± 0.56
0.992ArgCys: 0.992 ± 0.304
2.15ArgAsp: 2.15 ± 0.526
5.128ArgGlu: 5.128 ± 0.734
1.654ArgPhe: 1.654 ± 0.341
3.391ArgGly: 3.391 ± 0.561
0.744ArgHis: 0.744 ± 0.245
2.481ArgIle: 2.481 ± 0.424
3.474ArgLys: 3.474 ± 0.492
3.804ArgLeu: 3.804 ± 0.589
0.91ArgMet: 0.91 ± 0.274
2.068ArgAsn: 2.068 ± 0.389
0.744ArgPro: 0.744 ± 0.227
1.82ArgGln: 1.82 ± 0.401
1.985ArgArg: 1.985 ± 0.45
1.902ArgSer: 1.902 ± 0.392
1.075ArgThr: 1.075 ± 0.315
2.481ArgVal: 2.481 ± 0.578
0.662ArgTrp: 0.662 ± 0.251
2.564ArgTyr: 2.564 ± 0.464
0.0ArgXaa: 0.0 ± 0.0
Ser
4.714SerAla: 4.714 ± 0.671
0.91SerCys: 0.91 ± 0.301
3.391SerAsp: 3.391 ± 0.46
5.128SerGlu: 5.128 ± 0.683
2.895SerPhe: 2.895 ± 0.416
4.383SerGly: 4.383 ± 0.576
0.496SerHis: 0.496 ± 0.22
3.308SerIle: 3.308 ± 0.502
4.383SerLys: 4.383 ± 0.584
3.97SerLeu: 3.97 ± 0.764
1.406SerMet: 1.406 ± 0.559
3.97SerAsn: 3.97 ± 0.489
0.992SerPro: 0.992 ± 0.362
2.233SerGln: 2.233 ± 0.539
1.985SerArg: 1.985 ± 0.381
2.316SerSer: 2.316 ± 0.541
2.729SerThr: 2.729 ± 0.616
3.887SerVal: 3.887 ± 0.614
0.744SerTrp: 0.744 ± 0.28
2.729SerTyr: 2.729 ± 0.485
0.0SerXaa: 0.0 ± 0.0
Thr
6.947ThrAla: 6.947 ± 1.122
1.406ThrCys: 1.406 ± 0.297
3.639ThrAsp: 3.639 ± 0.503
5.955ThrGlu: 5.955 ± 0.589
2.647ThrPhe: 2.647 ± 0.629
3.804ThrGly: 3.804 ± 0.818
0.496ThrHis: 0.496 ± 0.204
5.21ThrIle: 5.21 ± 0.844
4.88ThrLys: 4.88 ± 0.634
3.97ThrLeu: 3.97 ± 0.573
1.241ThrMet: 1.241 ± 0.283
2.895ThrAsn: 2.895 ± 0.546
2.233ThrPro: 2.233 ± 0.483
2.068ThrGln: 2.068 ± 0.496
2.398ThrArg: 2.398 ± 0.36
4.301ThrSer: 4.301 ± 0.856
4.301ThrThr: 4.301 ± 0.87
4.549ThrVal: 4.549 ± 0.757
1.075ThrTrp: 1.075 ± 0.227
2.729ThrTyr: 2.729 ± 0.452
0.0ThrXaa: 0.0 ± 0.0
Val
4.383ValAla: 4.383 ± 0.574
0.992ValCys: 0.992 ± 0.244
3.639ValAsp: 3.639 ± 0.456
5.128ValGlu: 5.128 ± 0.684
2.481ValPhe: 2.481 ± 0.431
3.722ValGly: 3.722 ± 0.527
0.496ValHis: 0.496 ± 0.216
2.895ValIle: 2.895 ± 0.519
3.722ValLys: 3.722 ± 0.565
3.308ValLeu: 3.308 ± 0.396
2.15ValMet: 2.15 ± 0.395
2.564ValAsn: 2.564 ± 0.541
2.481ValPro: 2.481 ± 0.458
2.564ValGln: 2.564 ± 0.474
2.068ValArg: 2.068 ± 0.343
4.135ValSer: 4.135 ± 0.513
6.12ValThr: 6.12 ± 0.871
3.06ValVal: 3.06 ± 0.598
0.662ValTrp: 0.662 ± 0.226
2.895ValTyr: 2.895 ± 0.526
0.0ValXaa: 0.0 ± 0.0
Trp
0.91TrpAla: 0.91 ± 0.289
0.0TrpCys: 0.0 ± 0.0
0.827TrpAsp: 0.827 ± 0.266
0.992TrpGlu: 0.992 ± 0.306
0.331TrpPhe: 0.331 ± 0.153
0.662TrpGly: 0.662 ± 0.224
0.083TrpHis: 0.083 ± 0.085
1.158TrpIle: 1.158 ± 0.264
0.662TrpLys: 0.662 ± 0.251
1.571TrpLeu: 1.571 ± 0.334
0.248TrpMet: 0.248 ± 0.128
0.91TrpAsn: 0.91 ± 0.239
0.0TrpPro: 0.0 ± 0.0
0.91TrpGln: 0.91 ± 0.265
0.91TrpArg: 0.91 ± 0.426
1.406TrpSer: 1.406 ± 0.313
0.331TrpThr: 0.331 ± 0.149
0.744TrpVal: 0.744 ± 0.203
0.414TrpTrp: 0.414 ± 0.189
0.662TrpTyr: 0.662 ± 0.242
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.895TyrAla: 2.895 ± 0.449
1.158TyrCys: 1.158 ± 0.325
3.06TyrAsp: 3.06 ± 0.548
3.639TyrGlu: 3.639 ± 0.477
1.82TyrPhe: 1.82 ± 0.405
3.391TyrGly: 3.391 ± 0.484
0.414TyrHis: 0.414 ± 0.183
2.729TyrIle: 2.729 ± 0.483
4.714TyrLys: 4.714 ± 0.549
3.887TyrLeu: 3.887 ± 0.488
1.241TyrMet: 1.241 ± 0.287
2.481TyrAsn: 2.481 ± 0.421
1.158TyrPro: 1.158 ± 0.319
1.489TyrGln: 1.489 ± 0.349
1.985TyrArg: 1.985 ± 0.383
2.481TyrSer: 2.481 ± 0.589
2.977TyrThr: 2.977 ± 0.476
2.895TyrVal: 2.895 ± 0.555
0.662TyrTrp: 0.662 ± 0.23
1.489TyrTyr: 1.489 ± 0.353
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (12092 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski