Amino acid dipepetide frequency for Lone star tick rhabdovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.079AlaAla: 5.079 ± 6.301
0.535AlaCys: 0.535 ± 0.35
1.604AlaAsp: 1.604 ± 0.329
2.673AlaGlu: 2.673 ± 1.074
1.069AlaPhe: 1.069 ± 0.762
2.406AlaGly: 2.406 ± 0.608
1.604AlaHis: 1.604 ± 0.297
2.138AlaIle: 2.138 ± 1.152
2.138AlaLys: 2.138 ± 0.312
3.475AlaLeu: 3.475 ± 0.676
1.069AlaMet: 1.069 ± 0.702
3.475AlaAsn: 3.475 ± 2.309
2.673AlaPro: 2.673 ± 1.972
1.604AlaGln: 1.604 ± 0.438
3.742AlaArg: 3.742 ± 2.322
3.208AlaSer: 3.208 ± 1.253
4.277AlaThr: 4.277 ± 1.012
2.94AlaVal: 2.94 ± 1.53
1.069AlaTrp: 1.069 ± 0.956
1.871AlaTyr: 1.871 ± 0.892
0.0AlaXaa: 0.0 ± 0.0
Cys
0.267CysAla: 0.267 ± 0.391
0.535CysCys: 0.535 ± 0.637
1.337CysAsp: 1.337 ± 0.468
1.337CysGlu: 1.337 ± 0.468
0.802CysPhe: 0.802 ± 0.441
0.535CysGly: 0.535 ± 0.294
0.267CysHis: 0.267 ± 0.147
0.802CysIle: 0.802 ± 0.358
0.802CysLys: 0.802 ± 0.758
1.604CysLeu: 1.604 ± 0.586
0.267CysMet: 0.267 ± 0.147
0.267CysAsn: 0.267 ± 0.147
1.337CysPro: 1.337 ± 0.923
0.267CysGln: 0.267 ± 0.147
0.535CysArg: 0.535 ± 0.536
1.337CysSer: 1.337 ± 0.561
0.802CysThr: 0.802 ± 0.402
0.802CysVal: 0.802 ± 0.454
0.802CysTrp: 0.802 ± 0.441
0.535CysTyr: 0.535 ± 0.257
0.0CysXaa: 0.0 ± 0.0
Asp
2.138AspAla: 2.138 ± 0.591
0.802AspCys: 0.802 ± 0.272
1.871AspAsp: 1.871 ± 1.07
2.673AspGlu: 2.673 ± 1.114
1.871AspPhe: 1.871 ± 0.762
2.94AspGly: 2.94 ± 1.118
1.871AspHis: 1.871 ± 0.895
3.475AspIle: 3.475 ± 0.916
4.277AspLys: 4.277 ± 0.766
5.881AspLeu: 5.881 ± 1.321
1.871AspMet: 1.871 ± 0.598
1.871AspAsn: 1.871 ± 0.469
5.346AspPro: 5.346 ± 1.172
1.604AspGln: 1.604 ± 0.438
4.01AspArg: 4.01 ± 0.827
2.673AspSer: 2.673 ± 0.651
3.742AspThr: 3.742 ± 0.809
1.871AspVal: 1.871 ± 0.895
1.337AspTrp: 1.337 ± 0.629
2.138AspTyr: 2.138 ± 0.71
0.0AspXaa: 0.0 ± 0.0
Glu
3.475GluAla: 3.475 ± 1.839
0.535GluCys: 0.535 ± 0.294
5.613GluAsp: 5.613 ± 0.428
6.148GluGlu: 6.148 ± 0.854
1.871GluPhe: 1.871 ± 0.285
5.346GluGly: 5.346 ± 0.862
1.069GluHis: 1.069 ± 0.41
4.277GluIle: 4.277 ± 1.312
2.406GluLys: 2.406 ± 0.324
6.148GluLeu: 6.148 ± 0.742
1.604GluMet: 1.604 ± 0.626
2.406GluAsn: 2.406 ± 0.822
2.138GluPro: 2.138 ± 0.584
2.406GluGln: 2.406 ± 0.231
1.604GluArg: 1.604 ± 0.882
4.277GluSer: 4.277 ± 0.664
5.079GluThr: 5.079 ± 1.404
3.208GluVal: 3.208 ± 0.604
0.802GluTrp: 0.802 ± 0.378
3.208GluTyr: 3.208 ± 0.491
0.0GluXaa: 0.0 ± 0.0
Phe
1.337PheAla: 1.337 ± 0.485
0.267PheCys: 0.267 ± 0.147
0.802PheAsp: 0.802 ± 0.402
2.406PheGlu: 2.406 ± 0.669
2.94PhePhe: 2.94 ± 0.791
2.138PheGly: 2.138 ± 0.461
0.535PheHis: 0.535 ± 0.294
2.673PheIle: 2.673 ± 0.936
2.673PheLys: 2.673 ± 0.929
3.742PheLeu: 3.742 ± 0.95
0.535PheMet: 0.535 ± 0.294
1.069PheAsn: 1.069 ± 0.421
2.406PhePro: 2.406 ± 0.486
1.604PheGln: 1.604 ± 0.566
2.138PheArg: 2.138 ± 0.497
4.01PheSer: 4.01 ± 0.509
2.94PheThr: 2.94 ± 0.756
1.604PheVal: 1.604 ± 0.297
0.267PheTrp: 0.267 ± 0.391
0.535PheTyr: 0.535 ± 0.257
0.0PheXaa: 0.0 ± 0.0
Gly
2.94GlyAla: 2.94 ± 0.414
1.069GlyCys: 1.069 ± 0.752
4.812GlyAsp: 4.812 ± 0.892
2.673GlyGlu: 2.673 ± 1.169
2.138GlyPhe: 2.138 ± 0.506
3.208GlyGly: 3.208 ± 0.276
0.267GlyHis: 0.267 ± 0.147
2.673GlyIle: 2.673 ± 0.649
2.138GlyLys: 2.138 ± 1.063
9.088GlyLeu: 9.088 ± 0.786
1.871GlyMet: 1.871 ± 1.249
1.337GlyAsn: 1.337 ± 0.378
1.871GlyPro: 1.871 ± 0.613
3.742GlyGln: 3.742 ± 0.698
3.208GlyArg: 3.208 ± 0.985
3.475GlySer: 3.475 ± 0.851
2.673GlyThr: 2.673 ± 1.042
3.475GlyVal: 3.475 ± 0.649
1.069GlyTrp: 1.069 ± 0.65
1.871GlyTyr: 1.871 ± 0.631
0.0GlyXaa: 0.0 ± 0.0
His
1.337HisAla: 1.337 ± 0.549
0.802HisCys: 0.802 ± 0.758
1.069HisAsp: 1.069 ± 0.353
1.337HisGlu: 1.337 ± 0.264
0.802HisPhe: 0.802 ± 0.272
0.267HisGly: 0.267 ± 0.147
0.802HisHis: 0.802 ± 0.272
3.475HisIle: 3.475 ± 0.791
1.337HisLys: 1.337 ± 0.937
2.673HisLeu: 2.673 ± 0.651
0.0HisMet: 0.0 ± 0.0
1.069HisAsn: 1.069 ± 0.653
2.138HisPro: 2.138 ± 0.479
1.871HisGln: 1.871 ± 0.475
1.337HisArg: 1.337 ± 0.735
1.871HisSer: 1.871 ± 0.579
1.069HisThr: 1.069 ± 0.553
0.535HisVal: 0.535 ± 0.637
1.069HisTrp: 1.069 ± 0.453
0.535HisTyr: 0.535 ± 0.294
0.0HisXaa: 0.0 ± 0.0
Ile
2.94IleAla: 2.94 ± 0.349
0.535IleCys: 0.535 ± 0.352
3.208IleAsp: 3.208 ± 0.761
5.346IleGlu: 5.346 ± 1.223
2.94IlePhe: 2.94 ± 1.249
2.94IleGly: 2.94 ± 0.513
1.871IleHis: 1.871 ± 0.985
5.079IleIle: 5.079 ± 0.782
6.148IleLys: 6.148 ± 2.1
5.346IleLeu: 5.346 ± 0.902
2.138IleMet: 2.138 ± 0.312
3.475IleAsn: 3.475 ± 0.909
4.544IlePro: 4.544 ± 0.739
2.673IleGln: 2.673 ± 0.951
4.01IleArg: 4.01 ± 0.966
3.742IleSer: 3.742 ± 0.8
6.148IleThr: 6.148 ± 0.582
4.544IleVal: 4.544 ± 1.252
1.871IleTrp: 1.871 ± 0.559
4.277IleTyr: 4.277 ± 1.629
0.0IleXaa: 0.0 ± 0.0
Lys
2.406LysAla: 2.406 ± 0.822
1.069LysCys: 1.069 ± 0.514
4.277LysAsp: 4.277 ± 0.751
5.079LysGlu: 5.079 ± 1.037
1.337LysPhe: 1.337 ± 0.81
6.415LysGly: 6.415 ± 0.784
1.604LysHis: 1.604 ± 0.471
8.554LysIle: 8.554 ± 1.262
5.346LysLys: 5.346 ± 2.118
5.346LysLeu: 5.346 ± 1.62
1.337LysMet: 1.337 ± 1.278
2.673LysAsn: 2.673 ± 0.793
0.802LysPro: 0.802 ± 0.489
0.802LysGln: 0.802 ± 0.758
2.138LysArg: 2.138 ± 0.589
5.079LysSer: 5.079 ± 1.563
4.277LysThr: 4.277 ± 0.859
3.475LysVal: 3.475 ± 0.954
0.802LysTrp: 0.802 ± 0.378
3.208LysTyr: 3.208 ± 0.445
0.0LysXaa: 0.0 ± 0.0
Leu
5.613LeuAla: 5.613 ± 1.46
1.337LeuCys: 1.337 ± 0.335
4.812LeuAsp: 4.812 ± 0.935
8.821LeuGlu: 8.821 ± 2.016
3.208LeuPhe: 3.208 ± 0.422
4.812LeuGly: 4.812 ± 1.022
1.871LeuHis: 1.871 ± 0.692
8.554LeuIle: 8.554 ± 1.152
7.217LeuLys: 7.217 ± 1.277
7.485LeuLeu: 7.485 ± 1.723
2.406LeuMet: 2.406 ± 0.816
5.613LeuAsn: 5.613 ± 1.366
2.94LeuPro: 2.94 ± 0.572
1.604LeuGln: 1.604 ± 0.391
5.881LeuArg: 5.881 ± 0.734
8.019LeuSer: 8.019 ± 1.678
7.752LeuThr: 7.752 ± 1.585
2.673LeuVal: 2.673 ± 0.867
1.337LeuTrp: 1.337 ± 0.648
4.01LeuTyr: 4.01 ± 0.722
0.0LeuXaa: 0.0 ± 0.0
Met
2.138MetAla: 2.138 ± 0.613
0.267MetCys: 0.267 ± 0.147
1.871MetAsp: 1.871 ± 0.513
1.604MetGlu: 1.604 ± 1.273
0.802MetPhe: 0.802 ± 0.402
1.604MetGly: 1.604 ± 0.438
0.267MetHis: 0.267 ± 0.147
1.604MetIle: 1.604 ± 0.737
1.337MetLys: 1.337 ± 0.514
2.138MetLeu: 2.138 ± 0.421
0.0MetMet: 0.0 ± 0.0
1.069MetAsn: 1.069 ± 0.346
1.337MetPro: 1.337 ± 0.735
0.0MetGln: 0.0 ± 0.0
1.604MetArg: 1.604 ± 0.4
1.871MetSer: 1.871 ± 0.236
1.604MetThr: 1.604 ± 0.469
1.604MetVal: 1.604 ± 0.956
0.535MetTrp: 0.535 ± 0.294
0.267MetTyr: 0.267 ± 0.147
0.0MetXaa: 0.0 ± 0.0
Asn
1.337AsnAla: 1.337 ± 0.264
0.535AsnCys: 0.535 ± 0.294
2.406AsnAsp: 2.406 ± 0.231
2.406AsnGlu: 2.406 ± 0.698
1.871AsnPhe: 1.871 ± 0.895
1.871AsnGly: 1.871 ± 1.421
2.138AsnHis: 2.138 ± 0.706
4.01AsnIle: 4.01 ± 0.804
2.673AsnLys: 2.673 ± 0.865
5.346AsnLeu: 5.346 ± 0.744
0.802AsnMet: 0.802 ± 0.298
2.138AsnAsn: 2.138 ± 1.176
3.475AsnPro: 3.475 ± 1.065
2.406AsnGln: 2.406 ± 1.23
1.604AsnArg: 1.604 ± 0.469
4.812AsnSer: 4.812 ± 1.642
2.138AsnThr: 2.138 ± 0.506
1.604AsnVal: 1.604 ± 0.651
0.535AsnTrp: 0.535 ± 0.294
2.94AsnTyr: 2.94 ± 1.054
0.0AsnXaa: 0.0 ± 0.0
Pro
3.742ProAla: 3.742 ± 1.583
0.535ProCys: 0.535 ± 0.294
2.406ProAsp: 2.406 ± 0.933
1.604ProGlu: 1.604 ± 0.31
0.802ProPhe: 0.802 ± 0.272
3.208ProGly: 3.208 ± 0.856
1.337ProHis: 1.337 ± 0.561
3.475ProIle: 3.475 ± 0.866
2.94ProLys: 2.94 ± 1.091
5.079ProLeu: 5.079 ± 0.808
0.0ProMet: 0.0 ± 0.0
2.673ProAsn: 2.673 ± 0.527
4.01ProPro: 4.01 ± 2.978
1.871ProGln: 1.871 ± 0.444
2.138ProArg: 2.138 ± 0.608
5.346ProSer: 5.346 ± 0.728
4.01ProThr: 4.01 ± 1.018
2.673ProVal: 2.673 ± 1.295
0.802ProTrp: 0.802 ± 0.272
2.94ProTyr: 2.94 ± 0.64
0.0ProXaa: 0.0 ± 0.0
Gln
1.604GlnAla: 1.604 ± 1.708
0.0GlnCys: 0.0 ± 0.0
1.337GlnAsp: 1.337 ± 0.444
1.871GlnGlu: 1.871 ± 0.918
1.604GlnPhe: 1.604 ± 0.696
2.673GlnGly: 2.673 ± 0.441
2.138GlnHis: 2.138 ± 1.045
2.94GlnIle: 2.94 ± 0.855
2.673GlnLys: 2.673 ± 0.572
2.138GlnLeu: 2.138 ± 0.243
1.069GlnMet: 1.069 ± 0.778
1.871GlnAsn: 1.871 ± 0.613
1.069GlnPro: 1.069 ± 0.41
1.871GlnGln: 1.871 ± 0.996
2.138GlnArg: 2.138 ± 0.863
2.138GlnSer: 2.138 ± 0.897
1.337GlnThr: 1.337 ± 0.8
1.604GlnVal: 1.604 ± 0.563
1.069GlnTrp: 1.069 ± 0.368
1.069GlnTyr: 1.069 ± 0.41
0.0GlnXaa: 0.0 ± 0.0
Arg
2.138ArgAla: 2.138 ± 0.863
1.871ArgCys: 1.871 ± 0.757
1.604ArgAsp: 1.604 ± 0.908
4.01ArgGlu: 4.01 ± 1.482
2.406ArgPhe: 2.406 ± 0.966
2.406ArgGly: 2.406 ± 0.813
1.604ArgHis: 1.604 ± 0.77
2.94ArgIle: 2.94 ± 0.719
3.208ArgLys: 3.208 ± 0.8
5.346ArgLeu: 5.346 ± 0.958
1.069ArgMet: 1.069 ± 0.553
1.871ArgAsn: 1.871 ± 0.469
2.94ArgPro: 2.94 ± 0.702
1.871ArgGln: 1.871 ± 0.892
3.475ArgArg: 3.475 ± 1.065
2.94ArgSer: 2.94 ± 0.447
3.208ArgThr: 3.208 ± 0.816
3.742ArgVal: 3.742 ± 0.481
0.802ArgTrp: 0.802 ± 0.441
1.604ArgTyr: 1.604 ± 0.438
0.0ArgXaa: 0.0 ± 0.0
Ser
4.01SerAla: 4.01 ± 1.212
1.069SerCys: 1.069 ± 0.65
4.812SerAsp: 4.812 ± 1.992
4.812SerGlu: 4.812 ± 1.474
2.406SerPhe: 2.406 ± 0.966
2.673SerGly: 2.673 ± 0.299
0.535SerHis: 0.535 ± 0.257
4.01SerIle: 4.01 ± 0.684
5.079SerLys: 5.079 ± 1.037
7.485SerLeu: 7.485 ± 2.088
1.604SerMet: 1.604 ± 0.391
5.346SerAsn: 5.346 ± 0.68
3.475SerPro: 3.475 ± 0.966
1.871SerGln: 1.871 ± 0.609
3.475SerArg: 3.475 ± 0.941
6.415SerSer: 6.415 ± 0.924
5.079SerThr: 5.079 ± 1.626
5.079SerVal: 5.079 ± 0.864
1.871SerTrp: 1.871 ± 1.029
2.94SerTyr: 2.94 ± 0.513
0.0SerXaa: 0.0 ± 0.0
Thr
1.871ThrAla: 1.871 ± 0.935
1.337ThrCys: 1.337 ± 0.629
5.079ThrAsp: 5.079 ± 1.211
4.544ThrGlu: 4.544 ± 1.363
1.604ThrPhe: 1.604 ± 0.471
3.742ThrGly: 3.742 ± 1.199
2.406ThrHis: 2.406 ± 0.815
5.079ThrIle: 5.079 ± 0.701
5.079ThrLys: 5.079 ± 1.161
8.287ThrLeu: 8.287 ± 1.386
2.138ThrMet: 2.138 ± 0.421
3.208ThrAsn: 3.208 ± 0.599
3.475ThrPro: 3.475 ± 1.821
2.406ThrGln: 2.406 ± 0.686
2.673ThrArg: 2.673 ± 0.441
5.613ThrSer: 5.613 ± 1.849
7.217ThrThr: 7.217 ± 0.742
4.544ThrVal: 4.544 ± 2.055
1.069ThrTrp: 1.069 ± 0.353
2.94ThrTyr: 2.94 ± 0.572
0.0ThrXaa: 0.0 ± 0.0
Val
2.138ValAla: 2.138 ± 2.164
1.337ValCys: 1.337 ± 0.735
1.604ValAsp: 1.604 ± 1.117
1.871ValGlu: 1.871 ± 1.337
2.94ValPhe: 2.94 ± 0.962
2.94ValGly: 2.94 ± 1.82
1.604ValHis: 1.604 ± 0.909
5.079ValIle: 5.079 ± 1.272
2.406ValLys: 2.406 ± 0.816
3.475ValLeu: 3.475 ± 0.956
0.802ValMet: 0.802 ± 0.441
1.604ValAsn: 1.604 ± 0.648
2.94ValPro: 2.94 ± 1.219
1.337ValGln: 1.337 ± 0.264
3.208ValArg: 3.208 ± 0.502
3.475ValSer: 3.475 ± 0.829
7.217ValThr: 7.217 ± 1.941
3.475ValVal: 3.475 ± 0.547
0.802ValTrp: 0.802 ± 0.402
1.069ValTyr: 1.069 ± 0.349
0.0ValXaa: 0.0 ± 0.0
Trp
1.337TrpAla: 1.337 ± 1.005
0.267TrpCys: 0.267 ± 0.147
1.069TrpAsp: 1.069 ± 0.453
1.069TrpGlu: 1.069 ± 0.588
1.337TrpPhe: 1.337 ± 0.444
1.069TrpGly: 1.069 ± 0.353
0.535TrpHis: 0.535 ± 0.294
1.337TrpIle: 1.337 ± 0.81
2.406TrpLys: 2.406 ± 1.079
1.337TrpLeu: 1.337 ± 0.903
1.337TrpMet: 1.337 ± 0.629
1.604TrpAsn: 1.604 ± 0.471
0.535TrpPro: 0.535 ± 0.315
0.267TrpGln: 0.267 ± 0.318
0.0TrpArg: 0.0 ± 0.0
0.802TrpSer: 0.802 ± 0.272
1.069TrpThr: 1.069 ± 0.368
0.535TrpVal: 0.535 ± 0.257
0.267TrpTrp: 0.267 ± 0.147
0.535TrpTyr: 0.535 ± 0.294
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.802TyrAla: 0.802 ± 0.441
0.535TyrCys: 0.535 ± 0.515
2.673TyrAsp: 2.673 ± 0.962
1.337TyrGlu: 1.337 ± 0.81
1.871TyrPhe: 1.871 ± 0.758
2.406TyrGly: 2.406 ± 0.433
1.069TyrHis: 1.069 ± 0.306
1.604TyrIle: 1.604 ± 0.882
3.742TyrLys: 3.742 ± 1.402
4.544TyrLeu: 4.544 ± 0.753
1.337TyrMet: 1.337 ± 0.264
2.406TyrAsn: 2.406 ± 0.966
2.138TyrPro: 2.138 ± 0.584
2.406TyrGln: 2.406 ± 0.608
2.138TyrArg: 2.138 ± 0.827
2.94TyrSer: 2.94 ± 0.686
2.673TyrThr: 2.673 ± 0.441
1.337TyrVal: 1.337 ± 0.416
0.535TyrTrp: 0.535 ± 0.294
0.535TyrTyr: 0.535 ± 0.294
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3742 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski