Amino acid dipepetide frequency for Rhodococcus virus RGL3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.097AlaAla: 10.097 ± 1.403
0.841AlaCys: 0.841 ± 0.257
6.381AlaAsp: 6.381 ± 0.711
7.853AlaGlu: 7.853 ± 0.834
4.137AlaPhe: 4.137 ± 0.521
7.012AlaGly: 7.012 ± 0.869
2.524AlaHis: 2.524 ± 0.425
4.067AlaIle: 4.067 ± 0.839
3.436AlaLys: 3.436 ± 0.562
9.466AlaLeu: 9.466 ± 0.831
1.893AlaMet: 1.893 ± 0.407
2.664AlaAsn: 2.664 ± 0.562
3.786AlaPro: 3.786 ± 0.579
3.997AlaGln: 3.997 ± 0.507
6.801AlaArg: 6.801 ± 0.762
5.679AlaSer: 5.679 ± 0.608
5.89AlaThr: 5.89 ± 0.602
7.222AlaVal: 7.222 ± 0.78
1.332AlaTrp: 1.332 ± 0.362
2.384AlaTyr: 2.384 ± 0.355
0.0AlaXaa: 0.0 ± 0.0
Cys
0.561CysAla: 0.561 ± 0.249
0.0CysCys: 0.0 ± 0.0
0.351CysAsp: 0.351 ± 0.137
1.052CysGlu: 1.052 ± 0.379
0.28CysPhe: 0.28 ± 0.126
0.771CysGly: 0.771 ± 0.21
0.14CysHis: 0.14 ± 0.098
0.351CysIle: 0.351 ± 0.182
0.421CysLys: 0.421 ± 0.166
0.771CysLeu: 0.771 ± 0.224
0.07CysMet: 0.07 ± 0.06
0.351CysAsn: 0.351 ± 0.142
0.421CysPro: 0.421 ± 0.166
0.421CysGln: 0.421 ± 0.174
0.701CysArg: 0.701 ± 0.254
0.701CysSer: 0.701 ± 0.242
0.21CysThr: 0.21 ± 0.119
0.491CysVal: 0.491 ± 0.218
0.07CysTrp: 0.07 ± 0.058
0.561CysTyr: 0.561 ± 0.217
0.0CysXaa: 0.0 ± 0.0
Asp
7.222AspAla: 7.222 ± 0.794
0.421AspCys: 0.421 ± 0.171
3.436AspAsp: 3.436 ± 0.326
5.75AspGlu: 5.75 ± 0.759
2.033AspPhe: 2.033 ± 0.359
6.521AspGly: 6.521 ± 0.639
1.472AspHis: 1.472 ± 0.328
1.262AspIle: 1.262 ± 0.276
1.893AspLys: 1.893 ± 0.429
6.661AspLeu: 6.661 ± 0.647
2.033AspMet: 2.033 ± 0.39
2.033AspAsn: 2.033 ± 0.368
4.628AspPro: 4.628 ± 0.875
2.103AspGln: 2.103 ± 0.328
2.384AspArg: 2.384 ± 0.405
3.506AspSer: 3.506 ± 0.727
3.015AspThr: 3.015 ± 0.381
4.978AspVal: 4.978 ± 0.572
1.122AspTrp: 1.122 ± 0.283
2.244AspTyr: 2.244 ± 0.371
0.0AspXaa: 0.0 ± 0.0
Glu
7.993GluAla: 7.993 ± 0.867
0.701GluCys: 0.701 ± 0.237
4.628GluAsp: 4.628 ± 0.64
4.908GluGlu: 4.908 ± 0.618
3.506GluPhe: 3.506 ± 0.502
6.661GluGly: 6.661 ± 0.765
1.753GluHis: 1.753 ± 0.308
3.366GluIle: 3.366 ± 0.555
2.454GluLys: 2.454 ± 0.42
7.853GluLeu: 7.853 ± 0.888
1.472GluMet: 1.472 ± 0.318
2.174GluAsn: 2.174 ± 0.47
2.384GluPro: 2.384 ± 0.433
2.314GluGln: 2.314 ± 0.493
4.137GluArg: 4.137 ± 0.642
4.277GluSer: 4.277 ± 0.641
4.628GluThr: 4.628 ± 0.537
6.381GluVal: 6.381 ± 0.727
1.613GluTrp: 1.613 ± 0.34
2.033GluTyr: 2.033 ± 0.419
0.0GluXaa: 0.0 ± 0.0
Phe
4.207PheAla: 4.207 ± 0.75
0.351PheCys: 0.351 ± 0.198
2.735PheAsp: 2.735 ± 0.48
2.384PheGlu: 2.384 ± 0.458
0.701PhePhe: 0.701 ± 0.196
3.155PheGly: 3.155 ± 0.484
0.631PheHis: 0.631 ± 0.21
1.472PheIle: 1.472 ± 0.433
1.052PheLys: 1.052 ± 0.252
2.454PheLeu: 2.454 ± 0.5
0.841PheMet: 0.841 ± 0.248
1.753PheAsn: 1.753 ± 0.287
1.192PhePro: 1.192 ± 0.282
1.543PheGln: 1.543 ± 0.305
2.174PheArg: 2.174 ± 0.369
2.103PheSer: 2.103 ± 0.382
1.683PheThr: 1.683 ± 0.31
3.155PheVal: 3.155 ± 0.46
0.841PheTrp: 0.841 ± 0.258
1.332PheTyr: 1.332 ± 0.294
0.0PheXaa: 0.0 ± 0.0
Gly
6.801GlyAla: 6.801 ± 0.916
0.701GlyCys: 0.701 ± 0.208
5.539GlyAsp: 5.539 ± 0.788
4.908GlyGlu: 4.908 ± 0.468
4.277GlyPhe: 4.277 ± 0.699
6.17GlyGly: 6.17 ± 0.692
1.963GlyHis: 1.963 ± 0.397
4.417GlyIle: 4.417 ± 0.54
2.735GlyLys: 2.735 ± 0.415
6.381GlyLeu: 6.381 ± 0.835
1.963GlyMet: 1.963 ± 0.641
3.155GlyAsn: 3.155 ± 0.573
4.347GlyPro: 4.347 ± 0.606
4.277GlyGln: 4.277 ± 0.534
5.048GlyArg: 5.048 ± 0.617
4.417GlySer: 4.417 ± 0.713
5.259GlyThr: 5.259 ± 0.648
6.521GlyVal: 6.521 ± 0.609
2.244GlyTrp: 2.244 ± 0.456
3.225GlyTyr: 3.225 ± 0.534
0.0GlyXaa: 0.0 ± 0.0
His
1.963HisAla: 1.963 ± 0.366
0.07HisCys: 0.07 ± 0.067
1.192HisAsp: 1.192 ± 0.26
2.314HisGlu: 2.314 ± 0.545
0.771HisPhe: 0.771 ± 0.213
1.262HisGly: 1.262 ± 0.338
0.421HisHis: 0.421 ± 0.178
1.192HisIle: 1.192 ± 0.285
0.771HisLys: 0.771 ± 0.278
1.543HisLeu: 1.543 ± 0.357
0.491HisMet: 0.491 ± 0.156
0.561HisAsn: 0.561 ± 0.157
1.402HisPro: 1.402 ± 0.388
0.631HisGln: 0.631 ± 0.208
2.033HisArg: 2.033 ± 0.466
1.122HisSer: 1.122 ± 0.286
1.122HisThr: 1.122 ± 0.337
1.472HisVal: 1.472 ± 0.308
0.421HisTrp: 0.421 ± 0.18
0.631HisTyr: 0.631 ± 0.193
0.0HisXaa: 0.0 ± 0.0
Ile
5.189IleAla: 5.189 ± 0.865
0.421IleCys: 0.421 ± 0.2
3.436IleAsp: 3.436 ± 0.583
3.716IleGlu: 3.716 ± 0.48
0.912IlePhe: 0.912 ± 0.326
4.067IleGly: 4.067 ± 0.663
1.332IleHis: 1.332 ± 0.345
1.893IleIle: 1.893 ± 0.41
1.543IleLys: 1.543 ± 0.35
2.875IleLeu: 2.875 ± 0.46
0.421IleMet: 0.421 ± 0.171
1.332IleAsn: 1.332 ± 0.323
2.805IlePro: 2.805 ± 0.502
0.841IleGln: 0.841 ± 0.218
3.366IleArg: 3.366 ± 0.412
1.963IleSer: 1.963 ± 0.39
3.927IleThr: 3.927 ± 0.519
2.945IleVal: 2.945 ± 0.541
0.421IleTrp: 0.421 ± 0.177
1.472IleTyr: 1.472 ± 0.314
0.0IleXaa: 0.0 ± 0.0
Lys
3.716LysAla: 3.716 ± 0.559
0.28LysCys: 0.28 ± 0.112
2.033LysAsp: 2.033 ± 0.316
2.454LysGlu: 2.454 ± 0.417
1.262LysPhe: 1.262 ± 0.298
3.155LysGly: 3.155 ± 0.433
0.771LysHis: 0.771 ± 0.212
1.893LysIle: 1.893 ± 0.345
1.402LysLys: 1.402 ± 0.345
3.225LysLeu: 3.225 ± 0.489
0.561LysMet: 0.561 ± 0.237
1.122LysAsn: 1.122 ± 0.313
1.893LysPro: 1.893 ± 0.427
0.421LysGln: 0.421 ± 0.154
2.875LysArg: 2.875 ± 0.504
2.384LysSer: 2.384 ± 0.374
1.963LysThr: 1.963 ± 0.435
3.436LysVal: 3.436 ± 0.598
0.561LysTrp: 0.561 ± 0.173
1.472LysTyr: 1.472 ± 0.374
0.0LysXaa: 0.0 ± 0.0
Leu
8.484LeuAla: 8.484 ± 0.919
0.421LeuCys: 0.421 ± 0.178
5.609LeuAsp: 5.609 ± 0.673
6.381LeuGlu: 6.381 ± 0.78
2.594LeuPhe: 2.594 ± 0.395
7.082LeuGly: 7.082 ± 1.044
1.963LeuHis: 1.963 ± 0.427
3.295LeuIle: 3.295 ± 0.437
3.295LeuLys: 3.295 ± 0.399
5.118LeuLeu: 5.118 ± 0.788
2.103LeuMet: 2.103 ± 0.453
2.454LeuAsn: 2.454 ± 0.461
3.927LeuPro: 3.927 ± 0.438
2.314LeuGln: 2.314 ± 0.305
4.628LeuArg: 4.628 ± 0.606
5.82LeuSer: 5.82 ± 0.676
6.871LeuThr: 6.871 ± 0.751
6.801LeuVal: 6.801 ± 0.772
1.683LeuTrp: 1.683 ± 0.399
2.103LeuTyr: 2.103 ± 0.343
0.0LeuXaa: 0.0 ± 0.0
Met
2.174MetAla: 2.174 ± 0.4
0.0MetCys: 0.0 ± 0.0
1.192MetAsp: 1.192 ± 0.295
1.262MetGlu: 1.262 ± 0.308
0.841MetPhe: 0.841 ± 0.255
2.174MetGly: 2.174 ± 0.325
0.491MetHis: 0.491 ± 0.192
1.402MetIle: 1.402 ± 0.334
1.052MetLys: 1.052 ± 0.254
1.543MetLeu: 1.543 ± 0.343
0.28MetMet: 0.28 ± 0.125
0.912MetAsn: 0.912 ± 0.281
1.122MetPro: 1.122 ± 0.3
0.491MetGln: 0.491 ± 0.292
1.613MetArg: 1.613 ± 0.431
1.753MetSer: 1.753 ± 0.321
2.735MetThr: 2.735 ± 0.431
1.192MetVal: 1.192 ± 0.317
0.421MetTrp: 0.421 ± 0.174
0.771MetTyr: 0.771 ± 0.25
0.0MetXaa: 0.0 ± 0.0
Asn
3.295AsnAla: 3.295 ± 0.632
0.561AsnCys: 0.561 ± 0.202
2.314AsnAsp: 2.314 ± 0.46
2.103AsnGlu: 2.103 ± 0.369
1.192AsnPhe: 1.192 ± 0.297
3.576AsnGly: 3.576 ± 0.586
0.912AsnHis: 0.912 ± 0.181
1.192AsnIle: 1.192 ± 0.286
1.122AsnLys: 1.122 ± 0.293
2.244AsnLeu: 2.244 ± 0.373
0.982AsnMet: 0.982 ± 0.247
1.472AsnAsn: 1.472 ± 0.317
2.103AsnPro: 2.103 ± 0.301
0.771AsnGln: 0.771 ± 0.266
2.103AsnArg: 2.103 ± 0.409
1.683AsnSer: 1.683 ± 0.301
1.893AsnThr: 1.893 ± 0.321
2.594AsnVal: 2.594 ± 0.362
0.491AsnTrp: 0.491 ± 0.182
1.122AsnTyr: 1.122 ± 0.268
0.0AsnXaa: 0.0 ± 0.0
Pro
4.487ProAla: 4.487 ± 0.543
0.491ProCys: 0.491 ± 0.206
2.945ProAsp: 2.945 ± 0.516
4.558ProGlu: 4.558 ± 0.581
1.262ProPhe: 1.262 ± 0.259
3.997ProGly: 3.997 ± 0.479
0.841ProHis: 0.841 ± 0.238
2.594ProIle: 2.594 ± 0.405
1.332ProLys: 1.332 ± 0.269
3.436ProLeu: 3.436 ± 0.426
1.332ProMet: 1.332 ± 0.247
1.683ProAsn: 1.683 ± 0.325
1.613ProPro: 1.613 ± 0.325
2.033ProGln: 2.033 ± 0.299
2.664ProArg: 2.664 ± 0.486
2.384ProSer: 2.384 ± 0.507
3.436ProThr: 3.436 ± 0.45
3.997ProVal: 3.997 ± 0.532
1.052ProTrp: 1.052 ± 0.278
1.543ProTyr: 1.543 ± 0.3
0.0ProXaa: 0.0 ± 0.0
Gln
3.576GlnAla: 3.576 ± 0.425
0.28GlnCys: 0.28 ± 0.117
1.823GlnAsp: 1.823 ± 0.354
1.613GlnGlu: 1.613 ± 0.332
1.052GlnPhe: 1.052 ± 0.283
2.244GlnGly: 2.244 ± 0.419
0.701GlnHis: 0.701 ± 0.247
2.664GlnIle: 2.664 ± 0.495
1.192GlnLys: 1.192 ± 0.276
2.524GlnLeu: 2.524 ± 0.559
0.982GlnMet: 0.982 ± 0.25
0.912GlnAsn: 0.912 ± 0.239
1.332GlnPro: 1.332 ± 0.383
1.262GlnGln: 1.262 ± 0.332
1.963GlnArg: 1.963 ± 0.356
1.963GlnSer: 1.963 ± 0.315
1.963GlnThr: 1.963 ± 0.315
3.576GlnVal: 3.576 ± 0.455
0.631GlnTrp: 0.631 ± 0.221
0.841GlnTyr: 0.841 ± 0.32
0.0GlnXaa: 0.0 ± 0.0
Arg
5.539ArgAla: 5.539 ± 0.661
1.052ArgCys: 1.052 ± 0.258
4.417ArgAsp: 4.417 ± 0.647
4.277ArgGlu: 4.277 ± 0.552
2.244ArgPhe: 2.244 ± 0.475
4.417ArgGly: 4.417 ± 0.482
0.841ArgHis: 0.841 ± 0.276
2.875ArgIle: 2.875 ± 0.468
3.225ArgLys: 3.225 ± 0.574
5.259ArgLeu: 5.259 ± 0.609
1.543ArgMet: 1.543 ± 0.273
2.384ArgAsn: 2.384 ± 0.416
2.664ArgPro: 2.664 ± 0.36
2.033ArgGln: 2.033 ± 0.362
4.768ArgArg: 4.768 ± 0.651
3.997ArgSer: 3.997 ± 0.504
3.225ArgThr: 3.225 ± 0.459
4.067ArgVal: 4.067 ± 0.549
1.613ArgTrp: 1.613 ± 0.31
1.963ArgTyr: 1.963 ± 0.531
0.0ArgXaa: 0.0 ± 0.0
Ser
5.259SerAla: 5.259 ± 0.809
0.561SerCys: 0.561 ± 0.206
4.067SerAsp: 4.067 ± 0.544
3.927SerGlu: 3.927 ± 0.653
2.384SerPhe: 2.384 ± 0.407
5.609SerGly: 5.609 ± 0.712
1.332SerHis: 1.332 ± 0.308
1.893SerIle: 1.893 ± 0.362
1.963SerLys: 1.963 ± 0.361
4.978SerLeu: 4.978 ± 0.784
2.103SerMet: 2.103 ± 0.326
2.174SerAsn: 2.174 ± 0.416
2.244SerPro: 2.244 ± 0.442
1.122SerGln: 1.122 ± 0.203
3.646SerArg: 3.646 ± 0.493
3.155SerSer: 3.155 ± 0.532
3.436SerThr: 3.436 ± 0.462
5.118SerVal: 5.118 ± 0.808
0.912SerTrp: 0.912 ± 0.371
1.402SerTyr: 1.402 ± 0.338
0.0SerXaa: 0.0 ± 0.0
Thr
5.048ThrAla: 5.048 ± 0.476
0.491ThrCys: 0.491 ± 0.188
4.137ThrAsp: 4.137 ± 0.635
4.417ThrGlu: 4.417 ± 0.664
2.033ThrPhe: 2.033 ± 0.373
6.24ThrGly: 6.24 ± 1.07
1.122ThrHis: 1.122 ± 0.313
3.295ThrIle: 3.295 ± 0.453
3.015ThrLys: 3.015 ± 0.436
5.75ThrLeu: 5.75 ± 0.444
0.771ThrMet: 0.771 ± 0.276
1.823ThrAsn: 1.823 ± 0.288
4.137ThrPro: 4.137 ± 0.555
2.244ThrGln: 2.244 ± 0.377
2.875ThrArg: 2.875 ± 0.496
2.945ThrSer: 2.945 ± 0.412
3.997ThrThr: 3.997 ± 0.764
4.908ThrVal: 4.908 ± 0.513
1.192ThrTrp: 1.192 ± 0.255
2.454ThrTyr: 2.454 ± 0.399
0.0ThrXaa: 0.0 ± 0.0
Val
7.222ValAla: 7.222 ± 0.778
0.701ValCys: 0.701 ± 0.255
4.628ValAsp: 4.628 ± 0.486
7.362ValGlu: 7.362 ± 0.793
3.015ValPhe: 3.015 ± 0.472
5.609ValGly: 5.609 ± 0.662
1.122ValHis: 1.122 ± 0.339
3.225ValIle: 3.225 ± 0.512
3.646ValLys: 3.646 ± 0.461
6.661ValLeu: 6.661 ± 0.742
1.963ValMet: 1.963 ± 0.417
2.805ValAsn: 2.805 ± 0.361
4.207ValPro: 4.207 ± 0.568
2.875ValGln: 2.875 ± 0.373
5.399ValArg: 5.399 ± 0.59
4.417ValSer: 4.417 ± 0.604
4.978ValThr: 4.978 ± 0.571
4.698ValVal: 4.698 ± 0.618
1.262ValTrp: 1.262 ± 0.339
1.893ValTyr: 1.893 ± 0.455
0.0ValXaa: 0.0 ± 0.0
Trp
1.683TrpAla: 1.683 ± 0.392
0.14TrpCys: 0.14 ± 0.088
1.472TrpAsp: 1.472 ± 0.266
1.402TrpGlu: 1.402 ± 0.298
0.561TrpPhe: 0.561 ± 0.208
1.753TrpGly: 1.753 ± 0.289
0.631TrpHis: 0.631 ± 0.227
1.192TrpIle: 1.192 ± 0.271
0.561TrpLys: 0.561 ± 0.2
1.052TrpLeu: 1.052 ± 0.227
0.631TrpMet: 0.631 ± 0.234
1.122TrpAsn: 1.122 ± 0.345
0.631TrpPro: 0.631 ± 0.189
0.491TrpGln: 0.491 ± 0.176
0.631TrpArg: 0.631 ± 0.176
1.262TrpSer: 1.262 ± 0.333
0.982TrpThr: 0.982 ± 0.277
1.543TrpVal: 1.543 ± 0.329
0.21TrpTrp: 0.21 ± 0.106
0.561TrpTyr: 0.561 ± 0.184
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.945TyrAla: 2.945 ± 0.429
0.21TyrCys: 0.21 ± 0.122
2.384TyrAsp: 2.384 ± 0.363
2.594TyrGlu: 2.594 ± 0.515
0.701TyrPhe: 0.701 ± 0.19
2.945TyrGly: 2.945 ± 0.49
0.421TyrHis: 0.421 ± 0.132
1.332TyrIle: 1.332 ± 0.315
0.771TyrLys: 0.771 ± 0.239
3.155TyrLeu: 3.155 ± 0.452
0.912TyrMet: 0.912 ± 0.228
0.841TyrAsn: 0.841 ± 0.275
0.841TyrPro: 0.841 ± 0.248
0.912TyrGln: 0.912 ± 0.254
2.594TyrArg: 2.594 ± 0.461
1.753TyrSer: 1.753 ± 0.425
1.683TyrThr: 1.683 ± 0.346
2.594TyrVal: 2.594 ± 0.523
0.421TyrTrp: 0.421 ± 0.156
0.982TyrTyr: 0.982 ± 0.25
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (14263 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski