Amino acid dipepetide frequency for Karimabad virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.388AlaAla: 6.388 ± 2.152
0.983AlaCys: 0.983 ± 0.351
1.474AlaAsp: 1.474 ± 0.321
3.686AlaGlu: 3.686 ± 0.621
3.44AlaPhe: 3.44 ± 0.778
4.177AlaGly: 4.177 ± 0.094
2.948AlaHis: 2.948 ± 1.11
4.914AlaIle: 4.914 ± 1.185
2.457AlaLys: 2.457 ± 1.105
3.931AlaLeu: 3.931 ± 0.989
1.72AlaMet: 1.72 ± 0.758
1.229AlaAsn: 1.229 ± 0.394
1.474AlaPro: 1.474 ± 0.677
1.474AlaGln: 1.474 ± 0.783
3.931AlaArg: 3.931 ± 1.092
6.388AlaSer: 6.388 ± 1.998
2.457AlaThr: 2.457 ± 1.16
5.405AlaVal: 5.405 ± 1.271
0.0AlaTrp: 0.0 ± 0.0
1.229AlaTyr: 1.229 ± 0.808
0.0AlaXaa: 0.0 ± 0.0
Cys
1.229CysAla: 1.229 ± 0.701
0.737CysCys: 0.737 ± 0.2
0.983CysAsp: 0.983 ± 0.233
1.229CysGlu: 1.229 ± 0.513
1.966CysPhe: 1.966 ± 1.13
1.474CysGly: 1.474 ± 0.586
0.737CysHis: 0.737 ± 0.625
0.983CysIle: 0.983 ± 0.495
1.966CysLys: 1.966 ± 0.684
2.457CysLeu: 2.457 ± 0.758
0.246CysMet: 0.246 ± 0.168
1.229CysAsn: 1.229 ± 0.359
0.737CysPro: 0.737 ± 0.625
0.737CysGln: 0.737 ± 0.293
0.737CysArg: 0.737 ± 0.2
2.948CysSer: 2.948 ± 1.171
1.72CysThr: 1.72 ± 0.502
1.229CysVal: 1.229 ± 0.979
0.0CysTrp: 0.0 ± 0.0
0.491CysTyr: 0.491 ± 0.116
0.0CysXaa: 0.0 ± 0.0
Asp
3.686AspAla: 3.686 ± 1.547
1.229AspCys: 1.229 ± 0.394
4.668AspAsp: 4.668 ± 0.711
5.897AspGlu: 5.897 ± 0.373
1.474AspPhe: 1.474 ± 0.719
2.948AspGly: 2.948 ± 0.894
1.229AspHis: 1.229 ± 0.84
2.703AspIle: 2.703 ± 1.365
2.457AspLys: 2.457 ± 0.534
3.194AspLeu: 3.194 ± 0.815
2.211AspMet: 2.211 ± 0.613
1.966AspAsn: 1.966 ± 1.091
2.703AspPro: 2.703 ± 0.66
0.491AspGln: 0.491 ± 0.116
2.948AspArg: 2.948 ± 0.984
2.948AspSer: 2.948 ± 0.663
2.948AspThr: 2.948 ± 1.418
3.686AspVal: 3.686 ± 1.233
1.474AspTrp: 1.474 ± 0.401
1.229AspTyr: 1.229 ± 0.394
0.0AspXaa: 0.0 ± 0.0
Glu
5.405GluAla: 5.405 ± 1.208
1.966GluCys: 1.966 ± 0.684
4.177GluAsp: 4.177 ± 2.187
8.354GluGlu: 8.354 ± 1.123
4.914GluPhe: 4.914 ± 1.503
5.405GluGly: 5.405 ± 1.216
0.983GluHis: 0.983 ± 0.351
4.914GluIle: 4.914 ± 1.576
6.388GluLys: 6.388 ± 1.338
7.371GluLeu: 7.371 ± 1.292
3.194GluMet: 3.194 ± 1.551
2.703GluAsn: 2.703 ± 1.608
2.948GluPro: 2.948 ± 0.843
0.737GluGln: 0.737 ± 0.591
3.194GluArg: 3.194 ± 0.943
4.177GluSer: 4.177 ± 0.542
4.668GluThr: 4.668 ± 1.146
5.651GluVal: 5.651 ± 1.083
0.737GluTrp: 0.737 ± 0.293
0.983GluTyr: 0.983 ± 0.351
0.0GluXaa: 0.0 ± 0.0
Phe
4.177PheAla: 4.177 ± 1.835
1.474PheCys: 1.474 ± 0.349
2.703PheAsp: 2.703 ± 1.594
2.211PheGlu: 2.211 ± 0.678
1.72PhePhe: 1.72 ± 0.546
2.703PheGly: 2.703 ± 0.907
0.246PheHis: 0.246 ± 0.168
0.491PheIle: 0.491 ± 0.417
2.948PheLys: 2.948 ± 0.66
3.44PheLeu: 3.44 ± 0.327
2.211PheMet: 2.211 ± 0.653
2.211PheAsn: 2.211 ± 0.488
2.211PhePro: 2.211 ± 1.073
0.491PheGln: 0.491 ± 0.561
1.966PheArg: 1.966 ± 1.087
3.931PheSer: 3.931 ± 1.338
2.703PheThr: 2.703 ± 1.19
2.703PheVal: 2.703 ± 1.234
0.491PheTrp: 0.491 ± 0.116
1.229PheTyr: 1.229 ± 0.714
0.0PheXaa: 0.0 ± 0.0
Gly
4.668GlyAla: 4.668 ± 1.024
1.229GlyCys: 1.229 ± 0.701
4.423GlyAsp: 4.423 ± 1.202
4.177GlyGlu: 4.177 ± 1.017
4.177GlyPhe: 4.177 ± 1.149
5.897GlyGly: 5.897 ± 0.756
1.474GlyHis: 1.474 ± 0.527
3.44GlyIle: 3.44 ± 0.604
6.88GlyLys: 6.88 ± 0.508
6.634GlyLeu: 6.634 ± 1.187
1.474GlyMet: 1.474 ± 0.359
2.948GlyAsn: 2.948 ± 1.128
2.457GlyPro: 2.457 ± 0.305
1.229GlyGln: 1.229 ± 0.454
3.44GlyArg: 3.44 ± 0.424
4.668GlySer: 4.668 ± 0.543
2.457GlyThr: 2.457 ± 0.788
4.177GlyVal: 4.177 ± 0.939
0.737GlyTrp: 0.737 ± 0.293
2.211GlyTyr: 2.211 ± 0.455
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.983HisCys: 0.983 ± 0.459
1.72HisAsp: 1.72 ± 0.665
1.229HisGlu: 1.229 ± 0.454
1.966HisPhe: 1.966 ± 0.328
1.72HisGly: 1.72 ± 0.546
0.737HisHis: 0.737 ± 0.625
1.474HisIle: 1.474 ± 0.677
0.491HisLys: 0.491 ± 0.621
1.229HisLeu: 1.229 ± 0.394
0.737HisMet: 0.737 ± 0.2
2.211HisAsn: 2.211 ± 1.225
1.72HisPro: 1.72 ± 0.8
0.737HisGln: 0.737 ± 0.564
1.72HisArg: 1.72 ± 0.408
1.72HisSer: 1.72 ± 0.502
1.474HisThr: 1.474 ± 0.401
1.474HisVal: 1.474 ± 0.586
0.0HisTrp: 0.0 ± 0.0
0.737HisTyr: 0.737 ± 0.293
0.0HisXaa: 0.0 ± 0.0
Ile
2.948IleAla: 2.948 ± 0.61
0.737IleCys: 0.737 ± 0.2
4.914IleAsp: 4.914 ± 1.28
4.423IleGlu: 4.423 ± 0.383
2.703IlePhe: 2.703 ± 1.234
4.668IleGly: 4.668 ± 1.955
1.474IleHis: 1.474 ± 0.908
5.405IleIle: 5.405 ± 1.022
4.177IleLys: 4.177 ± 1.189
4.423IleLeu: 4.423 ± 0.552
2.457IleMet: 2.457 ± 0.582
2.211IleAsn: 2.211 ± 1.036
1.966IlePro: 1.966 ± 0.328
1.229IleGln: 1.229 ± 0.84
5.16IleArg: 5.16 ± 0.857
5.897IleSer: 5.897 ± 1.423
3.194IleThr: 3.194 ± 1.228
4.423IleVal: 4.423 ± 0.255
0.737IleTrp: 0.737 ± 0.2
1.229IleTyr: 1.229 ± 0.703
0.0IleXaa: 0.0 ± 0.0
Lys
3.194LysAla: 3.194 ± 0.535
2.211LysCys: 2.211 ± 0.878
3.194LysAsp: 3.194 ± 0.839
5.405LysGlu: 5.405 ± 1.311
3.931LysPhe: 3.931 ± 0.651
4.914LysGly: 4.914 ± 1.172
0.983LysHis: 0.983 ± 0.233
4.914LysIle: 4.914 ± 0.925
5.16LysLys: 5.16 ± 0.892
5.897LysLeu: 5.897 ± 2.213
3.686LysMet: 3.686 ± 1.361
2.211LysAsn: 2.211 ± 0.613
3.44LysPro: 3.44 ± 0.959
1.966LysGln: 1.966 ± 0.702
2.948LysArg: 2.948 ± 0.894
4.668LysSer: 4.668 ± 0.829
2.457LysThr: 2.457 ± 0.422
6.143LysVal: 6.143 ± 1.13
0.983LysTrp: 0.983 ± 0.615
2.948LysTyr: 2.948 ± 1.556
0.0LysXaa: 0.0 ± 0.0
Leu
5.405LeuAla: 5.405 ± 1.146
1.72LeuCys: 1.72 ± 0.387
3.194LeuAsp: 3.194 ± 0.502
6.388LeuGlu: 6.388 ± 1.305
2.948LeuPhe: 2.948 ± 1.066
4.914LeuGly: 4.914 ± 1.284
2.211LeuHis: 2.211 ± 1.073
5.897LeuIle: 5.897 ± 1.413
6.634LeuLys: 6.634 ± 1.497
5.405LeuLeu: 5.405 ± 1.622
3.44LeuMet: 3.44 ± 0.381
2.211LeuAsn: 2.211 ± 0.395
2.457LeuPro: 2.457 ± 0.82
3.686LeuGln: 3.686 ± 0.472
5.405LeuArg: 5.405 ± 1.266
6.88LeuSer: 6.88 ± 1.357
4.423LeuThr: 4.423 ± 1.221
5.405LeuVal: 5.405 ± 0.533
0.491LeuTrp: 0.491 ± 0.116
2.211LeuTyr: 2.211 ± 1.073
0.0LeuXaa: 0.0 ± 0.0
Met
1.229MetAla: 1.229 ± 0.281
0.0MetCys: 0.0 ± 0.0
2.211MetAsp: 2.211 ± 0.396
2.703MetGlu: 2.703 ± 0.984
0.491MetPhe: 0.491 ± 0.336
1.966MetGly: 1.966 ± 0.474
2.457MetHis: 2.457 ± 0.611
2.211MetIle: 2.211 ± 1.566
2.457MetLys: 2.457 ± 0.563
2.703MetLeu: 2.703 ± 0.672
1.966MetMet: 1.966 ± 0.357
2.457MetAsn: 2.457 ± 0.422
0.491MetPro: 0.491 ± 0.458
1.72MetGln: 1.72 ± 1.176
1.72MetArg: 1.72 ± 0.381
4.423MetSer: 4.423 ± 0.757
2.703MetThr: 2.703 ± 0.985
1.474MetVal: 1.474 ± 0.349
0.246MetTrp: 0.246 ± 0.607
0.737MetTyr: 0.737 ± 0.523
0.0MetXaa: 0.0 ± 0.0
Asn
1.72AsnAla: 1.72 ± 0.393
0.246AsnCys: 0.246 ± 0.208
1.474AsnAsp: 1.474 ± 0.487
3.686AsnGlu: 3.686 ± 1.193
1.229AsnPhe: 1.229 ± 0.373
2.211AsnGly: 2.211 ± 0.925
1.229AsnHis: 1.229 ± 0.454
2.703AsnIle: 2.703 ± 0.725
2.948AsnLys: 2.948 ± 0.759
5.897AsnLeu: 5.897 ± 0.546
1.229AsnMet: 1.229 ± 0.469
1.474AsnAsn: 1.474 ± 0.349
2.703AsnPro: 2.703 ± 0.535
1.474AsnGln: 1.474 ± 0.527
2.211AsnArg: 2.211 ± 0.391
3.686AsnSer: 3.686 ± 0.939
1.229AsnThr: 1.229 ± 0.513
1.72AsnVal: 1.72 ± 0.655
0.246AsnTrp: 0.246 ± 0.398
0.983AsnTyr: 0.983 ± 0.351
0.0AsnXaa: 0.0 ± 0.0
Pro
1.229ProAla: 1.229 ± 0.281
0.246ProCys: 0.246 ± 0.208
2.211ProAsp: 2.211 ± 0.396
4.914ProGlu: 4.914 ± 1.428
1.72ProPhe: 1.72 ± 0.502
3.44ProGly: 3.44 ± 0.671
0.737ProHis: 0.737 ± 0.523
2.703ProIle: 2.703 ± 0.535
1.966ProLys: 1.966 ± 0.471
2.457ProLeu: 2.457 ± 0.744
0.737ProMet: 0.737 ± 0.833
1.966ProAsn: 1.966 ± 1.062
0.737ProPro: 0.737 ± 0.2
1.474ProGln: 1.474 ± 0.387
2.211ProArg: 2.211 ± 0.806
3.931ProSer: 3.931 ± 1.482
1.966ProThr: 1.966 ± 1.581
1.966ProVal: 1.966 ± 0.435
0.737ProTrp: 0.737 ± 0.504
0.737ProTyr: 0.737 ± 0.2
0.0ProXaa: 0.0 ± 0.0
Gln
1.72GlnAla: 1.72 ± 0.659
0.983GlnCys: 0.983 ± 0.565
1.72GlnAsp: 1.72 ± 0.978
1.229GlnGlu: 1.229 ± 0.373
0.737GlnPhe: 0.737 ± 0.786
1.966GlnGly: 1.966 ± 1.023
0.737GlnHis: 0.737 ± 0.2
2.457GlnIle: 2.457 ± 1.345
3.931GlnLys: 3.931 ± 0.959
0.491GlnLeu: 0.491 ± 0.561
0.491GlnMet: 0.491 ± 0.369
1.229GlnAsn: 1.229 ± 0.84
1.229GlnPro: 1.229 ± 0.672
1.474GlnGln: 1.474 ± 0.527
1.229GlnArg: 1.229 ± 0.359
1.229GlnSer: 1.229 ± 0.359
1.474GlnThr: 1.474 ± 0.349
0.737GlnVal: 0.737 ± 0.2
0.0GlnTrp: 0.0 ± 0.0
0.983GlnTyr: 0.983 ± 0.351
0.0GlnXaa: 0.0 ± 0.0
Arg
4.423ArgAla: 4.423 ± 1.197
1.229ArgCys: 1.229 ± 0.701
3.686ArgAsp: 3.686 ± 1.261
5.16ArgGlu: 5.16 ± 1.064
0.737ArgPhe: 0.737 ± 0.523
3.931ArgGly: 3.931 ± 0.552
0.491ArgHis: 0.491 ± 0.116
3.686ArgIle: 3.686 ± 0.526
3.194ArgLys: 3.194 ± 0.754
3.931ArgLeu: 3.931 ± 0.629
2.703ArgMet: 2.703 ± 0.697
2.703ArgAsn: 2.703 ± 0.315
1.966ArgPro: 1.966 ± 0.357
1.229ArgGln: 1.229 ± 0.281
3.194ArgArg: 3.194 ± 1.45
3.44ArgSer: 3.44 ± 0.977
2.211ArgThr: 2.211 ± 0.862
4.668ArgVal: 4.668 ± 0.895
1.229ArgTrp: 1.229 ± 0.703
1.474ArgTyr: 1.474 ± 0.677
0.0ArgXaa: 0.0 ± 0.0
Ser
5.405SerAla: 5.405 ± 1.198
2.703SerCys: 2.703 ± 2.125
3.686SerAsp: 3.686 ± 0.122
5.405SerGlu: 5.405 ± 0.668
2.703SerPhe: 2.703 ± 0.664
6.88SerGly: 6.88 ± 0.658
1.72SerHis: 1.72 ± 0.387
6.388SerIle: 6.388 ± 1.061
5.405SerLys: 5.405 ± 0.678
7.371SerLeu: 7.371 ± 1.675
2.211SerMet: 2.211 ± 0.488
2.703SerAsn: 2.703 ± 0.907
4.423SerPro: 4.423 ± 0.966
0.983SerGln: 0.983 ± 0.459
3.686SerArg: 3.686 ± 1.985
8.354SerSer: 8.354 ± 1.838
5.405SerThr: 5.405 ± 1.23
3.686SerVal: 3.686 ± 0.526
1.966SerTrp: 1.966 ± 0.454
1.72SerTyr: 1.72 ± 0.546
0.0SerXaa: 0.0 ± 0.0
Thr
2.703ThrAla: 2.703 ± 0.985
1.966ThrCys: 1.966 ± 0.846
1.966ThrAsp: 1.966 ± 0.612
4.177ThrGlu: 4.177 ± 1.564
1.229ThrPhe: 1.229 ± 0.513
3.44ThrGly: 3.44 ± 0.778
0.491ThrHis: 0.491 ± 0.621
1.966ThrIle: 1.966 ± 0.702
4.177ThrLys: 4.177 ± 0.803
7.125ThrLeu: 7.125 ± 0.83
1.966ThrMet: 1.966 ± 1.656
1.966ThrAsn: 1.966 ± 0.519
1.474ThrPro: 1.474 ± 0.677
0.737ThrGln: 0.737 ± 0.504
2.703ThrArg: 2.703 ± 0.66
5.651ThrSer: 5.651 ± 0.749
2.703ThrThr: 2.703 ± 0.454
3.931ThrVal: 3.931 ± 1.018
0.491ThrTrp: 0.491 ± 0.796
1.72ThrTyr: 1.72 ± 0.408
0.0ThrXaa: 0.0 ± 0.0
Val
3.194ValAla: 3.194 ± 0.592
2.703ValCys: 2.703 ± 0.599
1.966ValAsp: 1.966 ± 0.684
6.634ValGlu: 6.634 ± 1.465
2.703ValPhe: 2.703 ± 0.599
2.948ValGly: 2.948 ± 0.213
2.211ValHis: 2.211 ± 0.391
4.423ValIle: 4.423 ± 1.1
4.914ValLys: 4.914 ± 1.199
4.668ValLeu: 4.668 ± 0.791
1.966ValMet: 1.966 ± 0.466
2.703ValAsn: 2.703 ± 0.792
0.983ValPro: 0.983 ± 0.351
3.194ValGln: 3.194 ± 1.031
4.423ValArg: 4.423 ± 0.552
5.405ValSer: 5.405 ± 0.723
3.686ValThr: 3.686 ± 0.122
3.44ValVal: 3.44 ± 0.815
0.246ValTrp: 0.246 ± 0.208
1.966ValTyr: 1.966 ± 1.062
0.0ValXaa: 0.0 ± 0.0
Trp
0.491TrpAla: 0.491 ± 0.116
0.0TrpCys: 0.0 ± 0.0
0.737TrpAsp: 0.737 ± 0.2
0.491TrpGlu: 0.491 ± 0.336
0.246TrpPhe: 0.246 ± 0.168
0.983TrpGly: 0.983 ± 0.468
0.0TrpHis: 0.0 ± 0.0
1.229TrpIle: 1.229 ± 0.469
0.246TrpLys: 0.246 ± 0.398
0.491TrpLeu: 0.491 ± 0.369
0.737TrpMet: 0.737 ± 0.2
0.491TrpAsn: 0.491 ± 0.336
0.491TrpPro: 0.491 ± 0.561
0.246TrpGln: 0.246 ± 0.168
0.983TrpArg: 0.983 ± 0.495
0.491TrpSer: 0.491 ± 0.417
1.229TrpThr: 1.229 ± 0.373
1.229TrpVal: 1.229 ± 0.281
0.246TrpTrp: 0.246 ± 0.168
0.491TrpTyr: 0.491 ± 0.561
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.737TyrAla: 0.737 ± 0.2
0.491TyrCys: 0.491 ± 0.417
0.983TyrAsp: 0.983 ± 1.127
1.474TyrGlu: 1.474 ± 0.401
1.229TyrPhe: 1.229 ± 0.703
2.211TyrGly: 2.211 ± 0.613
0.983TyrHis: 0.983 ± 0.233
1.474TyrIle: 1.474 ± 0.677
2.457TyrLys: 2.457 ± 0.82
2.211TyrLeu: 2.211 ± 0.326
0.737TyrMet: 0.737 ± 0.564
1.474TyrAsn: 1.474 ± 0.401
1.474TyrPro: 1.474 ± 0.55
0.737TyrGln: 0.737 ± 0.833
1.474TyrArg: 1.474 ± 0.387
1.966TyrSer: 1.966 ± 1.846
1.474TyrThr: 1.474 ± 0.677
1.229TyrVal: 1.229 ± 0.714
0.491TyrTrp: 0.491 ± 0.116
0.737TyrTyr: 0.737 ± 0.504
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4071 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski