Amino acid dipepetide frequency for Broome virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.597AlaAla: 5.597 ± 1.184
1.229AlaCys: 1.229 ± 0.35
3.276AlaAsp: 3.276 ± 0.616
2.457AlaGlu: 2.457 ± 0.403
3.276AlaPhe: 3.276 ± 0.838
3.549AlaGly: 3.549 ± 0.24
0.683AlaHis: 0.683 ± 0.195
4.914AlaIle: 4.914 ± 0.769
2.73AlaLys: 2.73 ± 0.69
7.098AlaLeu: 7.098 ± 0.808
3.413AlaMet: 3.413 ± 0.711
3.14AlaAsn: 3.14 ± 0.571
4.641AlaPro: 4.641 ± 0.919
1.911AlaGln: 1.911 ± 0.446
6.143AlaArg: 6.143 ± 0.632
4.778AlaSer: 4.778 ± 0.651
3.959AlaThr: 3.959 ± 0.396
5.187AlaVal: 5.187 ± 0.487
0.956AlaTrp: 0.956 ± 0.401
3.276AlaTyr: 3.276 ± 0.466
0.0AlaXaa: 0.0 ± 0.0
Cys
0.956CysAla: 0.956 ± 0.2
0.273CysCys: 0.273 ± 0.286
1.229CysAsp: 1.229 ± 0.468
0.683CysGlu: 0.683 ± 0.202
0.819CysPhe: 0.819 ± 0.417
1.092CysGly: 1.092 ± 0.346
0.41CysHis: 0.41 ± 0.206
0.546CysIle: 0.546 ± 0.204
0.819CysLys: 0.819 ± 0.315
2.594CysLeu: 2.594 ± 0.862
0.546CysMet: 0.546 ± 0.278
0.683CysAsn: 0.683 ± 0.178
0.819CysPro: 0.819 ± 0.248
0.546CysGln: 0.546 ± 0.371
0.819CysArg: 0.819 ± 0.373
0.956CysSer: 0.956 ± 0.311
1.092CysThr: 1.092 ± 0.404
1.092CysVal: 1.092 ± 0.287
0.273CysTrp: 0.273 ± 0.19
0.956CysTyr: 0.956 ± 0.283
0.0CysXaa: 0.0 ± 0.0
Asp
5.46AspAla: 5.46 ± 0.427
0.683AspCys: 0.683 ± 0.284
4.232AspAsp: 4.232 ± 0.477
3.276AspGlu: 3.276 ± 0.834
3.14AspPhe: 3.14 ± 0.403
5.597AspGly: 5.597 ± 0.841
1.775AspHis: 1.775 ± 0.339
3.276AspIle: 3.276 ± 0.583
2.048AspLys: 2.048 ± 0.498
5.187AspLeu: 5.187 ± 0.971
2.321AspMet: 2.321 ± 0.436
1.502AspAsn: 1.502 ± 0.429
3.14AspPro: 3.14 ± 0.501
1.229AspGln: 1.229 ± 0.376
3.959AspArg: 3.959 ± 0.614
2.594AspSer: 2.594 ± 0.448
2.594AspThr: 2.594 ± 0.66
7.781AspVal: 7.781 ± 1.119
0.546AspTrp: 0.546 ± 0.182
2.457AspTyr: 2.457 ± 0.615
0.0AspXaa: 0.0 ± 0.0
Glu
2.321GluAla: 2.321 ± 0.359
0.819GluCys: 0.819 ± 0.628
1.775GluAsp: 1.775 ± 0.542
1.775GluGlu: 1.775 ± 0.638
1.502GluPhe: 1.502 ± 0.495
2.457GluGly: 2.457 ± 0.476
1.502GluHis: 1.502 ± 0.593
2.73GluIle: 2.73 ± 0.332
1.775GluLys: 1.775 ± 0.557
4.914GluLeu: 4.914 ± 0.453
1.229GluMet: 1.229 ± 0.541
0.683GluAsn: 0.683 ± 0.299
1.911GluPro: 1.911 ± 0.533
2.184GluGln: 2.184 ± 0.657
3.959GluArg: 3.959 ± 0.628
3.686GluSer: 3.686 ± 0.826
2.321GluThr: 2.321 ± 0.548
4.095GluVal: 4.095 ± 0.948
1.638GluTrp: 1.638 ± 0.486
1.911GluTyr: 1.911 ± 0.276
0.0GluXaa: 0.0 ± 0.0
Phe
1.502PheAla: 1.502 ± 0.445
0.819PheCys: 0.819 ± 0.329
3.003PheAsp: 3.003 ± 0.411
2.867PheGlu: 2.867 ± 0.496
0.956PhePhe: 0.956 ± 0.286
3.14PheGly: 3.14 ± 0.439
0.683PheHis: 0.683 ± 0.407
2.321PheIle: 2.321 ± 0.528
1.502PheLys: 1.502 ± 0.336
4.232PheLeu: 4.232 ± 0.314
1.229PheMet: 1.229 ± 0.337
3.413PheAsn: 3.413 ± 0.544
2.184PhePro: 2.184 ± 0.423
1.365PheGln: 1.365 ± 0.268
1.911PheArg: 1.911 ± 0.487
3.549PheSer: 3.549 ± 0.509
1.638PheThr: 1.638 ± 0.259
2.048PheVal: 2.048 ± 0.606
0.683PheTrp: 0.683 ± 0.284
1.638PheTyr: 1.638 ± 0.32
0.0PheXaa: 0.0 ± 0.0
Gly
4.778GlyAla: 4.778 ± 0.594
0.819GlyCys: 0.819 ± 0.353
4.095GlyAsp: 4.095 ± 0.28
3.276GlyGlu: 3.276 ± 0.701
2.457GlyPhe: 2.457 ± 0.32
2.73GlyGly: 2.73 ± 0.535
0.956GlyHis: 0.956 ± 0.371
3.14GlyIle: 3.14 ± 0.396
1.638GlyLys: 1.638 ± 0.429
6.279GlyLeu: 6.279 ± 0.765
2.184GlyMet: 2.184 ± 0.522
2.867GlyAsn: 2.867 ± 0.368
2.048GlyPro: 2.048 ± 0.505
3.003GlyGln: 3.003 ± 0.536
3.14GlyArg: 3.14 ± 0.509
3.549GlySer: 3.549 ± 0.458
1.775GlyThr: 1.775 ± 0.451
5.051GlyVal: 5.051 ± 1.29
1.092GlyTrp: 1.092 ± 0.384
2.184GlyTyr: 2.184 ± 0.474
0.0GlyXaa: 0.0 ± 0.0
His
1.638HisAla: 1.638 ± 0.365
0.546HisCys: 0.546 ± 0.18
2.457HisAsp: 2.457 ± 0.532
1.775HisGlu: 1.775 ± 0.698
0.41HisPhe: 0.41 ± 0.267
1.638HisGly: 1.638 ± 0.389
0.41HisHis: 0.41 ± 0.279
0.683HisIle: 0.683 ± 0.139
0.273HisLys: 0.273 ± 0.168
2.594HisLeu: 2.594 ± 0.884
1.229HisMet: 1.229 ± 0.339
0.683HisAsn: 0.683 ± 0.251
1.775HisPro: 1.775 ± 0.543
1.365HisGln: 1.365 ± 0.436
1.092HisArg: 1.092 ± 0.452
1.229HisSer: 1.229 ± 0.331
0.956HisThr: 0.956 ± 0.502
2.457HisVal: 2.457 ± 0.389
0.41HisTrp: 0.41 ± 0.265
0.819HisTyr: 0.819 ± 0.433
0.0HisXaa: 0.0 ± 0.0
Ile
5.187IleAla: 5.187 ± 0.72
1.775IleCys: 1.775 ± 0.328
5.324IleAsp: 5.324 ± 1.063
2.594IleGlu: 2.594 ± 0.398
1.365IlePhe: 1.365 ± 0.514
3.549IleGly: 3.549 ± 0.589
1.092IleHis: 1.092 ± 0.352
2.184IleIle: 2.184 ± 0.537
2.048IleLys: 2.048 ± 0.489
5.187IleLeu: 5.187 ± 0.661
2.321IleMet: 2.321 ± 0.572
3.14IleAsn: 3.14 ± 0.434
3.276IlePro: 3.276 ± 0.591
1.365IleGln: 1.365 ± 0.429
3.959IleArg: 3.959 ± 0.444
3.822IleSer: 3.822 ± 0.498
2.867IleThr: 2.867 ± 0.44
2.867IleVal: 2.867 ± 0.453
0.546IleTrp: 0.546 ± 0.206
1.775IleTyr: 1.775 ± 0.426
0.0IleXaa: 0.0 ± 0.0
Lys
2.048LysAla: 2.048 ± 0.559
0.273LysCys: 0.273 ± 0.147
3.003LysAsp: 3.003 ± 0.935
1.775LysGlu: 1.775 ± 0.538
1.092LysPhe: 1.092 ± 0.334
1.775LysGly: 1.775 ± 0.349
0.956LysHis: 0.956 ± 0.336
1.229LysIle: 1.229 ± 0.444
1.911LysLys: 1.911 ± 0.821
3.822LysLeu: 3.822 ± 0.842
1.092LysMet: 1.092 ± 0.24
1.229LysAsn: 1.229 ± 0.295
1.229LysPro: 1.229 ± 0.277
1.365LysGln: 1.365 ± 0.359
1.775LysArg: 1.775 ± 0.738
2.321LysSer: 2.321 ± 0.703
1.775LysThr: 1.775 ± 0.25
3.003LysVal: 3.003 ± 0.922
0.819LysTrp: 0.819 ± 0.241
1.911LysTyr: 1.911 ± 0.716
0.0LysXaa: 0.0 ± 0.0
Leu
7.235LeuAla: 7.235 ± 0.708
1.502LeuCys: 1.502 ± 0.372
5.733LeuAsp: 5.733 ± 1.042
5.051LeuGlu: 5.051 ± 0.736
3.822LeuPhe: 3.822 ± 0.503
4.368LeuGly: 4.368 ± 0.351
3.003LeuHis: 3.003 ± 0.528
4.914LeuIle: 4.914 ± 0.833
4.505LeuLys: 4.505 ± 1.316
8.736LeuLeu: 8.736 ± 1.263
3.276LeuMet: 3.276 ± 0.804
5.187LeuAsn: 5.187 ± 0.568
6.143LeuPro: 6.143 ± 0.935
4.368LeuGln: 4.368 ± 0.71
7.644LeuArg: 7.644 ± 1.193
8.19LeuSer: 8.19 ± 0.995
6.689LeuThr: 6.689 ± 0.981
4.505LeuVal: 4.505 ± 0.91
0.956LeuTrp: 0.956 ± 0.291
3.14LeuTyr: 3.14 ± 0.935
0.0LeuXaa: 0.0 ± 0.0
Met
3.003MetAla: 3.003 ± 0.463
0.683MetCys: 0.683 ± 0.32
1.775MetAsp: 1.775 ± 0.419
1.092MetGlu: 1.092 ± 0.588
1.775MetPhe: 1.775 ± 0.427
2.048MetGly: 2.048 ± 0.888
0.41MetHis: 0.41 ± 0.171
2.184MetIle: 2.184 ± 0.598
0.546MetLys: 0.546 ± 0.293
3.003MetLeu: 3.003 ± 0.608
1.365MetMet: 1.365 ± 0.655
1.638MetAsn: 1.638 ± 0.262
1.365MetPro: 1.365 ± 0.359
0.956MetGln: 0.956 ± 0.39
2.594MetArg: 2.594 ± 0.548
3.549MetSer: 3.549 ± 0.484
2.184MetThr: 2.184 ± 0.81
1.502MetVal: 1.502 ± 0.457
0.41MetTrp: 0.41 ± 0.213
1.365MetTyr: 1.365 ± 0.428
0.0MetXaa: 0.0 ± 0.0
Asn
4.368AsnAla: 4.368 ± 1.116
0.546AsnCys: 0.546 ± 0.206
3.549AsnAsp: 3.549 ± 0.755
3.003AsnGlu: 3.003 ± 0.97
1.502AsnPhe: 1.502 ± 0.404
2.73AsnGly: 2.73 ± 0.446
1.911AsnHis: 1.911 ± 0.597
2.321AsnIle: 2.321 ± 0.721
1.092AsnLys: 1.092 ± 0.401
3.959AsnLeu: 3.959 ± 0.491
1.229AsnMet: 1.229 ± 0.362
2.048AsnAsn: 2.048 ± 0.317
2.594AsnPro: 2.594 ± 0.837
2.184AsnGln: 2.184 ± 0.414
1.911AsnArg: 1.911 ± 0.435
2.594AsnSer: 2.594 ± 0.688
2.457AsnThr: 2.457 ± 0.25
4.914AsnVal: 4.914 ± 0.672
1.092AsnTrp: 1.092 ± 0.464
2.321AsnTyr: 2.321 ± 0.538
0.0AsnXaa: 0.0 ± 0.0
Pro
2.457ProAla: 2.457 ± 0.647
0.819ProCys: 0.819 ± 0.332
3.413ProAsp: 3.413 ± 0.448
2.321ProGlu: 2.321 ± 0.606
3.14ProPhe: 3.14 ± 0.628
1.911ProGly: 1.911 ± 0.372
0.546ProHis: 0.546 ± 0.18
4.778ProIle: 4.778 ± 0.454
1.092ProLys: 1.092 ± 0.37
4.914ProLeu: 4.914 ± 1.125
1.092ProMet: 1.092 ± 0.292
3.14ProAsn: 3.14 ± 0.534
3.14ProPro: 3.14 ± 0.447
1.092ProGln: 1.092 ± 0.433
3.276ProArg: 3.276 ± 0.467
4.095ProSer: 4.095 ± 0.537
4.095ProThr: 4.095 ± 0.884
3.959ProVal: 3.959 ± 0.747
0.683ProTrp: 0.683 ± 0.252
0.819ProTyr: 0.819 ± 0.226
0.0ProXaa: 0.0 ± 0.0
Gln
2.321GlnAla: 2.321 ± 0.704
0.41GlnCys: 0.41 ± 0.179
1.775GlnAsp: 1.775 ± 0.601
0.819GlnGlu: 0.819 ± 0.434
1.229GlnPhe: 1.229 ± 0.417
1.229GlnGly: 1.229 ± 0.411
0.956GlnHis: 0.956 ± 0.227
1.638GlnIle: 1.638 ± 0.342
1.229GlnLys: 1.229 ± 0.526
4.778GlnLeu: 4.778 ± 0.643
0.819GlnMet: 0.819 ± 0.461
1.092GlnAsn: 1.092 ± 0.484
1.229GlnPro: 1.229 ± 0.531
1.229GlnGln: 1.229 ± 0.375
3.003GlnArg: 3.003 ± 0.588
3.003GlnSer: 3.003 ± 0.796
2.73GlnThr: 2.73 ± 0.409
2.048GlnVal: 2.048 ± 0.368
0.956GlnTrp: 0.956 ± 0.185
2.321GlnTyr: 2.321 ± 0.672
0.0GlnXaa: 0.0 ± 0.0
Arg
5.324ArgAla: 5.324 ± 0.666
1.502ArgCys: 1.502 ± 0.382
3.14ArgAsp: 3.14 ± 0.91
2.594ArgGlu: 2.594 ± 0.551
2.321ArgPhe: 2.321 ± 0.687
4.095ArgGly: 4.095 ± 0.465
1.775ArgHis: 1.775 ± 0.422
3.413ArgIle: 3.413 ± 0.593
1.775ArgLys: 1.775 ± 0.401
6.006ArgLeu: 6.006 ± 0.619
2.457ArgMet: 2.457 ± 0.533
3.003ArgAsn: 3.003 ± 0.362
3.14ArgPro: 3.14 ± 0.685
2.594ArgGln: 2.594 ± 0.475
4.778ArgArg: 4.778 ± 0.877
4.914ArgSer: 4.914 ± 0.635
2.867ArgThr: 2.867 ± 0.616
4.914ArgVal: 4.914 ± 0.543
1.229ArgTrp: 1.229 ± 0.289
3.14ArgTyr: 3.14 ± 0.674
0.0ArgXaa: 0.0 ± 0.0
Ser
5.597SerAla: 5.597 ± 1.155
1.775SerCys: 1.775 ± 0.447
3.959SerAsp: 3.959 ± 0.674
2.73SerGlu: 2.73 ± 0.55
2.457SerPhe: 2.457 ± 0.583
5.733SerGly: 5.733 ± 0.663
1.638SerHis: 1.638 ± 0.449
4.095SerIle: 4.095 ± 0.728
2.048SerLys: 2.048 ± 0.424
6.006SerLeu: 6.006 ± 0.529
2.048SerMet: 2.048 ± 0.413
3.413SerAsn: 3.413 ± 0.481
2.73SerPro: 2.73 ± 0.492
2.048SerGln: 2.048 ± 0.641
3.822SerArg: 3.822 ± 0.517
6.006SerSer: 6.006 ± 0.714
3.959SerThr: 3.959 ± 0.905
5.87SerVal: 5.87 ± 1.233
1.229SerTrp: 1.229 ± 0.438
3.549SerTyr: 3.549 ± 0.517
0.0SerXaa: 0.0 ± 0.0
Thr
4.368ThrAla: 4.368 ± 0.76
0.546ThrCys: 0.546 ± 0.317
3.549ThrAsp: 3.549 ± 0.599
2.048ThrGlu: 2.048 ± 0.561
1.775ThrPhe: 1.775 ± 0.397
2.457ThrGly: 2.457 ± 0.753
1.092ThrHis: 1.092 ± 0.302
3.959ThrIle: 3.959 ± 0.616
1.911ThrLys: 1.911 ± 0.503
6.143ThrLeu: 6.143 ± 1.006
1.911ThrMet: 1.911 ± 0.457
3.003ThrAsn: 3.003 ± 0.89
2.73ThrPro: 2.73 ± 0.837
2.048ThrGln: 2.048 ± 0.453
3.413ThrArg: 3.413 ± 0.76
3.822ThrSer: 3.822 ± 0.954
4.778ThrThr: 4.778 ± 1.258
3.549ThrVal: 3.549 ± 0.606
0.546ThrTrp: 0.546 ± 0.206
1.638ThrTyr: 1.638 ± 0.445
0.0ThrXaa: 0.0 ± 0.0
Val
5.46ValAla: 5.46 ± 0.746
1.775ValCys: 1.775 ± 0.26
3.14ValAsp: 3.14 ± 0.516
3.276ValGlu: 3.276 ± 0.626
3.959ValPhe: 3.959 ± 0.734
3.549ValGly: 3.549 ± 0.917
3.276ValHis: 3.276 ± 0.635
5.051ValIle: 5.051 ± 0.692
3.14ValLys: 3.14 ± 0.436
7.917ValLeu: 7.917 ± 0.832
1.775ValMet: 1.775 ± 0.416
4.641ValAsn: 4.641 ± 0.548
4.641ValPro: 4.641 ± 0.579
1.775ValGln: 1.775 ± 0.493
4.095ValArg: 4.095 ± 0.458
4.914ValSer: 4.914 ± 0.553
4.095ValThr: 4.095 ± 0.658
5.87ValVal: 5.87 ± 1.006
1.365ValTrp: 1.365 ± 0.389
2.184ValTyr: 2.184 ± 0.74
0.0ValXaa: 0.0 ± 0.0
Trp
0.956TrpAla: 0.956 ± 0.46
0.273TrpCys: 0.273 ± 0.158
1.229TrpAsp: 1.229 ± 0.414
0.273TrpGlu: 0.273 ± 0.242
1.229TrpPhe: 1.229 ± 0.46
0.273TrpGly: 0.273 ± 0.192
0.41TrpHis: 0.41 ± 0.206
1.229TrpIle: 1.229 ± 0.495
0.546TrpLys: 0.546 ± 0.295
1.775TrpLeu: 1.775 ± 0.675
0.683TrpMet: 0.683 ± 0.463
0.819TrpAsn: 0.819 ± 0.372
0.683TrpPro: 0.683 ± 0.231
0.683TrpGln: 0.683 ± 0.202
1.911TrpArg: 1.911 ± 0.359
0.546TrpSer: 0.546 ± 0.19
0.273TrpThr: 0.273 ± 0.28
1.365TrpVal: 1.365 ± 0.521
0.137TrpTrp: 0.137 ± 0.108
0.819TrpTyr: 0.819 ± 0.368
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.638TyrAla: 1.638 ± 0.507
0.137TyrCys: 0.137 ± 0.108
3.14TyrAsp: 3.14 ± 0.83
1.502TyrGlu: 1.502 ± 0.456
2.457TyrPhe: 2.457 ± 0.621
3.14TyrGly: 3.14 ± 0.516
0.956TyrHis: 0.956 ± 0.332
1.638TyrIle: 1.638 ± 0.354
1.775TyrLys: 1.775 ± 0.491
3.822TyrLeu: 3.822 ± 0.885
1.092TyrMet: 1.092 ± 0.302
3.003TyrAsn: 3.003 ± 0.845
1.365TyrPro: 1.365 ± 0.361
1.502TyrGln: 1.502 ± 0.385
1.775TyrArg: 1.775 ± 0.601
2.594TyrSer: 2.594 ± 0.596
2.184TyrThr: 2.184 ± 0.545
3.959TyrVal: 3.959 ± 0.675
0.546TyrTrp: 0.546 ± 0.348
2.321TyrTyr: 2.321 ± 0.524
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (7327 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski