Amino acid dipepetide frequency for Koyama Hill virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.594AlaAla: 4.594 ± 1.412
0.82AlaCys: 0.82 ± 0.238
1.477AlaAsp: 1.477 ± 0.528
4.266AlaGlu: 4.266 ± 0.701
1.969AlaPhe: 1.969 ± 0.499
2.297AlaGly: 2.297 ± 0.818
1.313AlaHis: 1.313 ± 0.286
4.922AlaIle: 4.922 ± 0.92
3.774AlaLys: 3.774 ± 0.862
6.399AlaLeu: 6.399 ± 0.922
2.133AlaMet: 2.133 ± 0.531
3.61AlaAsn: 3.61 ± 0.672
2.461AlaPro: 2.461 ± 0.665
3.61AlaGln: 3.61 ± 0.963
3.938AlaArg: 3.938 ± 1.085
3.281AlaSer: 3.281 ± 0.762
4.266AlaThr: 4.266 ± 0.82
4.922AlaVal: 4.922 ± 1.112
0.82AlaTrp: 0.82 ± 0.331
1.969AlaTyr: 1.969 ± 0.451
0.0AlaXaa: 0.0 ± 0.0
Cys
1.313CysAla: 1.313 ± 0.706
0.328CysCys: 0.328 ± 0.235
0.984CysAsp: 0.984 ± 0.54
0.656CysGlu: 0.656 ± 0.37
0.82CysPhe: 0.82 ± 0.485
1.477CysGly: 1.477 ± 0.249
0.164CysHis: 0.164 ± 0.185
0.656CysIle: 0.656 ± 0.238
0.164CysLys: 0.164 ± 0.158
0.656CysLeu: 0.656 ± 0.356
0.328CysMet: 0.328 ± 0.193
0.492CysAsn: 0.492 ± 0.289
0.492CysPro: 0.492 ± 0.298
0.328CysGln: 0.328 ± 0.351
0.656CysArg: 0.656 ± 0.392
0.82CysSer: 0.82 ± 0.39
0.164CysThr: 0.164 ± 0.187
0.82CysVal: 0.82 ± 0.32
0.0CysTrp: 0.0 ± 0.0
0.984CysTyr: 0.984 ± 0.393
0.0CysXaa: 0.0 ± 0.0
Asp
3.117AspAla: 3.117 ± 0.582
0.656AspCys: 0.656 ± 0.291
3.445AspAsp: 3.445 ± 0.593
4.922AspGlu: 4.922 ± 0.956
1.148AspPhe: 1.148 ± 0.414
3.938AspGly: 3.938 ± 1.39
0.984AspHis: 0.984 ± 0.357
3.938AspIle: 3.938 ± 0.725
2.461AspLys: 2.461 ± 0.509
6.399AspLeu: 6.399 ± 1.202
0.82AspMet: 0.82 ± 0.344
1.148AspAsn: 1.148 ± 0.33
3.61AspPro: 3.61 ± 0.634
2.625AspGln: 2.625 ± 0.657
4.594AspArg: 4.594 ± 1.098
3.938AspSer: 3.938 ± 0.635
1.641AspThr: 1.641 ± 0.678
5.742AspVal: 5.742 ± 0.882
0.656AspTrp: 0.656 ± 0.251
0.984AspTyr: 0.984 ± 0.296
0.0AspXaa: 0.0 ± 0.0
Glu
3.61GluAla: 3.61 ± 0.511
0.164GluCys: 0.164 ± 0.118
2.625GluAsp: 2.625 ± 0.646
5.25GluGlu: 5.25 ± 1.185
3.117GluPhe: 3.117 ± 0.843
3.774GluGly: 3.774 ± 0.846
1.641GluHis: 1.641 ± 0.48
4.758GluIle: 4.758 ± 1.064
3.938GluLys: 3.938 ± 1.054
5.414GluLeu: 5.414 ± 0.885
2.625GluMet: 2.625 ± 0.527
3.938GluAsn: 3.938 ± 0.712
2.297GluPro: 2.297 ± 0.43
2.789GluGln: 2.789 ± 1.0
5.578GluArg: 5.578 ± 1.299
5.086GluSer: 5.086 ± 0.741
4.266GluThr: 4.266 ± 0.819
3.938GluVal: 3.938 ± 0.818
1.477GluTrp: 1.477 ± 0.583
1.969GluTyr: 1.969 ± 0.882
0.0GluXaa: 0.0 ± 0.0
Phe
1.477PheAla: 1.477 ± 0.29
0.328PheCys: 0.328 ± 0.211
2.297PheAsp: 2.297 ± 0.431
2.297PheGlu: 2.297 ± 0.638
1.148PhePhe: 1.148 ± 0.435
2.953PheGly: 2.953 ± 0.76
0.656PheHis: 0.656 ± 0.343
2.625PheIle: 2.625 ± 0.818
2.297PheLys: 2.297 ± 0.643
3.61PheLeu: 3.61 ± 1.013
0.492PheMet: 0.492 ± 0.287
2.953PheAsn: 2.953 ± 0.784
0.984PhePro: 0.984 ± 0.283
1.148PheGln: 1.148 ± 0.366
2.953PheArg: 2.953 ± 0.733
3.61PheSer: 3.61 ± 0.966
2.789PheThr: 2.789 ± 0.683
1.805PheVal: 1.805 ± 0.297
0.164PheTrp: 0.164 ± 0.187
0.82PheTyr: 0.82 ± 0.35
0.0PheXaa: 0.0 ± 0.0
Gly
2.789GlyAla: 2.789 ± 0.844
0.164GlyCys: 0.164 ± 0.187
4.594GlyAsp: 4.594 ± 1.551
3.774GlyGlu: 3.774 ± 0.734
2.133GlyPhe: 2.133 ± 0.451
2.953GlyGly: 2.953 ± 0.759
2.297GlyHis: 2.297 ± 0.642
3.117GlyIle: 3.117 ± 0.485
3.61GlyLys: 3.61 ± 1.239
5.086GlyLeu: 5.086 ± 0.641
1.477GlyMet: 1.477 ± 0.619
2.297GlyAsn: 2.297 ± 0.576
1.313GlyPro: 1.313 ± 0.55
2.297GlyGln: 2.297 ± 0.647
4.266GlyArg: 4.266 ± 1.147
5.742GlySer: 5.742 ± 1.157
2.133GlyThr: 2.133 ± 0.631
4.594GlyVal: 4.594 ± 1.244
0.164GlyTrp: 0.164 ± 0.185
3.117GlyTyr: 3.117 ± 0.824
0.164GlyXaa: 0.164 ± 0.158
His
1.641HisAla: 1.641 ± 0.514
0.164HisCys: 0.164 ± 0.118
1.313HisAsp: 1.313 ± 0.311
0.328HisGlu: 0.328 ± 0.192
0.656HisPhe: 0.656 ± 0.266
1.641HisGly: 1.641 ± 0.806
0.82HisHis: 0.82 ± 0.178
1.969HisIle: 1.969 ± 0.352
1.148HisLys: 1.148 ± 0.562
3.938HisLeu: 3.938 ± 0.946
1.148HisMet: 1.148 ± 0.4
0.984HisAsn: 0.984 ± 0.291
0.82HisPro: 0.82 ± 0.476
1.477HisGln: 1.477 ± 0.576
2.133HisArg: 2.133 ± 0.498
1.641HisSer: 1.641 ± 0.534
1.148HisThr: 1.148 ± 0.335
2.297HisVal: 2.297 ± 0.894
0.82HisTrp: 0.82 ± 0.273
0.492HisTyr: 0.492 ± 0.246
0.0HisXaa: 0.0 ± 0.0
Ile
3.774IleAla: 3.774 ± 1.05
0.984IleCys: 0.984 ± 0.493
3.281IleAsp: 3.281 ± 0.752
5.742IleGlu: 5.742 ± 1.012
1.969IlePhe: 1.969 ± 0.594
3.774IleGly: 3.774 ± 0.631
2.789IleHis: 2.789 ± 0.646
3.61IleIle: 3.61 ± 0.643
2.461IleLys: 2.461 ± 0.862
6.891IleLeu: 6.891 ± 1.055
1.969IleMet: 1.969 ± 0.44
2.297IleAsn: 2.297 ± 0.552
3.117IlePro: 3.117 ± 0.742
2.625IleGln: 2.625 ± 0.591
4.102IleArg: 4.102 ± 0.608
5.578IleSer: 5.578 ± 0.775
3.61IleThr: 3.61 ± 0.315
2.625IleVal: 2.625 ± 0.631
0.656IleTrp: 0.656 ± 0.314
3.117IleTyr: 3.117 ± 0.852
0.0IleXaa: 0.0 ± 0.0
Lys
4.43LysAla: 4.43 ± 0.774
0.656LysCys: 0.656 ± 0.226
2.461LysAsp: 2.461 ± 0.712
3.774LysGlu: 3.774 ± 0.879
2.133LysPhe: 2.133 ± 0.491
2.789LysGly: 2.789 ± 0.714
0.82LysHis: 0.82 ± 0.293
5.086LysIle: 5.086 ± 0.445
3.117LysLys: 3.117 ± 0.905
4.594LysLeu: 4.594 ± 1.012
1.313LysMet: 1.313 ± 0.537
1.641LysAsn: 1.641 ± 0.516
1.805LysPro: 1.805 ± 0.684
2.297LysGln: 2.297 ± 0.847
3.445LysArg: 3.445 ± 0.847
3.774LysSer: 3.774 ± 0.537
2.789LysThr: 2.789 ± 0.735
3.61LysVal: 3.61 ± 0.914
0.328LysTrp: 0.328 ± 0.315
2.133LysTyr: 2.133 ± 0.68
0.0LysXaa: 0.0 ± 0.0
Leu
5.742LeuAla: 5.742 ± 1.046
0.82LeuCys: 0.82 ± 0.356
5.414LeuAsp: 5.414 ± 0.719
4.758LeuGlu: 4.758 ± 0.617
4.102LeuPhe: 4.102 ± 0.851
4.43LeuGly: 4.43 ± 0.724
3.117LeuHis: 3.117 ± 0.991
5.742LeuIle: 5.742 ± 1.058
5.578LeuLys: 5.578 ± 0.974
8.039LeuLeu: 8.039 ± 1.093
3.117LeuMet: 3.117 ± 0.761
4.594LeuAsn: 4.594 ± 0.364
5.25LeuPro: 5.25 ± 1.309
3.774LeuGln: 3.774 ± 0.827
7.711LeuArg: 7.711 ± 1.406
6.071LeuSer: 6.071 ± 1.113
6.563LeuThr: 6.563 ± 1.2
4.922LeuVal: 4.922 ± 0.732
1.313LeuTrp: 1.313 ± 0.55
2.297LeuTyr: 2.297 ± 0.592
0.0LeuXaa: 0.0 ± 0.0
Met
2.297MetAla: 2.297 ± 0.73
0.492MetCys: 0.492 ± 0.273
1.969MetAsp: 1.969 ± 0.599
1.313MetGlu: 1.313 ± 0.456
1.805MetPhe: 1.805 ± 0.372
1.641MetGly: 1.641 ± 0.754
1.313MetHis: 1.313 ± 0.309
0.984MetIle: 0.984 ± 0.348
0.984MetLys: 0.984 ± 0.272
3.774MetLeu: 3.774 ± 0.72
0.984MetMet: 0.984 ± 0.411
0.492MetAsn: 0.492 ± 0.189
0.82MetPro: 0.82 ± 0.401
0.328MetGln: 0.328 ± 0.239
2.461MetArg: 2.461 ± 0.817
1.805MetSer: 1.805 ± 0.605
2.133MetThr: 2.133 ± 0.626
0.984MetVal: 0.984 ± 0.6
0.328MetTrp: 0.328 ± 0.211
1.969MetTyr: 1.969 ± 0.432
0.0MetXaa: 0.0 ± 0.0
Asn
3.281AsnAla: 3.281 ± 0.951
0.328AsnCys: 0.328 ± 0.211
1.641AsnAsp: 1.641 ± 0.637
3.61AsnGlu: 3.61 ± 0.518
1.805AsnPhe: 1.805 ± 0.603
3.281AsnGly: 3.281 ± 0.794
1.313AsnHis: 1.313 ± 0.453
2.461AsnIle: 2.461 ± 0.377
2.297AsnLys: 2.297 ± 0.49
3.445AsnLeu: 3.445 ± 0.397
1.477AsnMet: 1.477 ± 0.492
1.641AsnAsn: 1.641 ± 0.296
1.313AsnPro: 1.313 ± 0.438
1.477AsnGln: 1.477 ± 0.536
4.102AsnArg: 4.102 ± 0.957
3.61AsnSer: 3.61 ± 0.809
2.297AsnThr: 2.297 ± 0.761
3.774AsnVal: 3.774 ± 0.664
0.492AsnTrp: 0.492 ± 0.253
2.133AsnTyr: 2.133 ± 0.822
0.0AsnXaa: 0.0 ± 0.0
Pro
2.297ProAla: 2.297 ± 0.641
0.0ProCys: 0.0 ± 0.0
2.297ProAsp: 2.297 ± 0.692
2.461ProGlu: 2.461 ± 0.658
1.805ProPhe: 1.805 ± 0.336
2.625ProGly: 2.625 ± 0.961
0.82ProHis: 0.82 ± 0.367
1.969ProIle: 1.969 ± 0.393
1.969ProLys: 1.969 ± 0.556
2.461ProLeu: 2.461 ± 0.547
0.82ProMet: 0.82 ± 0.206
1.148ProAsn: 1.148 ± 0.507
1.641ProPro: 1.641 ± 0.677
1.148ProGln: 1.148 ± 0.553
2.789ProArg: 2.789 ± 0.465
2.133ProSer: 2.133 ± 0.672
2.953ProThr: 2.953 ± 1.013
1.969ProVal: 1.969 ± 0.436
0.82ProTrp: 0.82 ± 0.352
1.805ProTyr: 1.805 ± 0.436
0.0ProXaa: 0.0 ± 0.0
Gln
2.953GlnAla: 2.953 ± 0.774
0.492GlnCys: 0.492 ± 0.263
1.805GlnAsp: 1.805 ± 0.563
2.789GlnGlu: 2.789 ± 0.617
1.477GlnPhe: 1.477 ± 0.521
1.805GlnGly: 1.805 ± 0.689
1.641GlnHis: 1.641 ± 0.402
3.61GlnIle: 3.61 ± 0.71
2.297GlnLys: 2.297 ± 0.678
4.102GlnLeu: 4.102 ± 0.678
1.805GlnMet: 1.805 ± 0.779
2.953GlnAsn: 2.953 ± 0.798
1.148GlnPro: 1.148 ± 0.547
1.477GlnGln: 1.477 ± 0.568
2.625GlnArg: 2.625 ± 0.505
1.641GlnSer: 1.641 ± 0.395
3.281GlnThr: 3.281 ± 0.576
1.969GlnVal: 1.969 ± 0.339
0.0GlnTrp: 0.0 ± 0.0
0.984GlnTyr: 0.984 ± 0.251
0.0GlnXaa: 0.0 ± 0.0
Arg
5.906ArgAla: 5.906 ± 1.051
1.969ArgCys: 1.969 ± 0.838
5.578ArgAsp: 5.578 ± 0.633
4.922ArgGlu: 4.922 ± 0.739
3.445ArgPhe: 3.445 ± 0.61
3.281ArgGly: 3.281 ± 0.682
1.313ArgHis: 1.313 ± 0.473
5.742ArgIle: 5.742 ± 0.581
1.969ArgLys: 1.969 ± 0.654
4.922ArgLeu: 4.922 ± 0.697
2.789ArgMet: 2.789 ± 0.608
4.758ArgAsn: 4.758 ± 0.515
1.969ArgPro: 1.969 ± 0.814
2.789ArgGln: 2.789 ± 0.48
3.61ArgArg: 3.61 ± 1.116
4.922ArgSer: 4.922 ± 0.61
3.938ArgThr: 3.938 ± 0.328
3.774ArgVal: 3.774 ± 0.452
0.492ArgTrp: 0.492 ± 0.275
2.297ArgTyr: 2.297 ± 0.917
0.0ArgXaa: 0.0 ± 0.0
Ser
4.594SerAla: 4.594 ± 0.48
0.656SerCys: 0.656 ± 0.403
4.43SerAsp: 4.43 ± 0.555
5.578SerGlu: 5.578 ± 1.234
2.789SerPhe: 2.789 ± 0.903
5.742SerGly: 5.742 ± 1.198
1.477SerHis: 1.477 ± 0.318
3.445SerIle: 3.445 ± 0.872
4.102SerLys: 4.102 ± 0.658
6.563SerLeu: 6.563 ± 1.23
1.148SerMet: 1.148 ± 0.321
2.297SerAsn: 2.297 ± 0.494
2.297SerPro: 2.297 ± 0.749
2.625SerGln: 2.625 ± 0.567
4.922SerArg: 4.922 ± 0.908
5.25SerSer: 5.25 ± 1.05
5.414SerThr: 5.414 ± 0.898
3.281SerVal: 3.281 ± 0.652
0.492SerTrp: 0.492 ± 0.259
2.789SerTyr: 2.789 ± 0.742
0.0SerXaa: 0.0 ± 0.0
Thr
3.281ThrAla: 3.281 ± 0.917
1.477ThrCys: 1.477 ± 0.84
3.445ThrAsp: 3.445 ± 0.778
4.102ThrGlu: 4.102 ± 0.95
1.641ThrPhe: 1.641 ± 0.375
3.938ThrGly: 3.938 ± 0.67
1.313ThrHis: 1.313 ± 0.419
3.117ThrIle: 3.117 ± 1.029
3.445ThrLys: 3.445 ± 0.849
8.039ThrLeu: 8.039 ± 0.904
1.313ThrMet: 1.313 ± 0.443
2.789ThrAsn: 2.789 ± 0.693
1.641ThrPro: 1.641 ± 0.527
2.953ThrGln: 2.953 ± 0.766
3.938ThrArg: 3.938 ± 0.648
3.938ThrSer: 3.938 ± 0.741
4.43ThrThr: 4.43 ± 1.198
4.266ThrVal: 4.266 ± 0.727
0.656ThrTrp: 0.656 ± 0.269
2.461ThrTyr: 2.461 ± 0.704
0.0ThrXaa: 0.0 ± 0.0
Val
3.281ValAla: 3.281 ± 0.482
1.148ValCys: 1.148 ± 0.334
3.61ValAsp: 3.61 ± 0.606
5.086ValGlu: 5.086 ± 0.945
1.805ValPhe: 1.805 ± 0.58
3.281ValGly: 3.281 ± 1.217
1.641ValHis: 1.641 ± 0.6
5.086ValIle: 5.086 ± 0.728
4.594ValLys: 4.594 ± 0.985
4.594ValLeu: 4.594 ± 0.688
2.133ValMet: 2.133 ± 0.587
2.953ValAsn: 2.953 ± 1.004
1.641ValPro: 1.641 ± 0.609
3.445ValGln: 3.445 ± 0.787
4.266ValArg: 4.266 ± 0.867
3.117ValSer: 3.117 ± 0.788
3.938ValThr: 3.938 ± 0.485
5.414ValVal: 5.414 ± 1.306
0.328ValTrp: 0.328 ± 0.206
2.953ValTyr: 2.953 ± 1.211
0.0ValXaa: 0.0 ± 0.0
Trp
0.492TrpAla: 0.492 ± 0.44
0.0TrpCys: 0.0 ± 0.0
0.656TrpAsp: 0.656 ± 0.269
0.82TrpGlu: 0.82 ± 0.351
0.328TrpPhe: 0.328 ± 0.206
0.164TrpGly: 0.164 ± 0.158
0.328TrpHis: 0.328 ± 0.206
0.82TrpIle: 0.82 ± 0.512
0.984TrpLys: 0.984 ± 0.449
0.82TrpLeu: 0.82 ± 0.251
0.328TrpMet: 0.328 ± 0.2
0.656TrpAsn: 0.656 ± 0.26
0.164TrpPro: 0.164 ± 0.198
0.328TrpGln: 0.328 ± 0.211
0.492TrpArg: 0.492 ± 0.289
0.984TrpSer: 0.984 ± 0.257
0.82TrpThr: 0.82 ± 0.369
0.656TrpVal: 0.656 ± 0.384
0.164TrpTrp: 0.164 ± 0.187
0.492TrpTyr: 0.492 ± 0.276
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.969TyrAla: 1.969 ± 0.621
0.656TyrCys: 0.656 ± 0.274
3.445TyrAsp: 3.445 ± 0.697
1.805TyrGlu: 1.805 ± 0.536
1.313TyrPhe: 1.313 ± 0.451
2.297TyrGly: 2.297 ± 0.763
0.82TyrHis: 0.82 ± 0.362
1.477TyrIle: 1.477 ± 0.428
1.641TyrLys: 1.641 ± 0.549
3.61TyrLeu: 3.61 ± 0.755
0.492TyrMet: 0.492 ± 0.256
2.133TyrAsn: 2.133 ± 0.749
1.148TyrPro: 1.148 ± 0.319
1.313TyrGln: 1.313 ± 0.298
1.805TyrArg: 1.805 ± 0.451
2.953TyrSer: 2.953 ± 0.665
3.61TyrThr: 3.61 ± 1.076
2.953TyrVal: 2.953 ± 0.383
0.328TyrTrp: 0.328 ± 0.185
1.805TyrTyr: 1.805 ± 0.578
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.164XaaLys: 0.164 ± 0.158
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (6096 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski