Amino acid dipepetide frequency for Pararge aegeria rhabdovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.178AlaAla: 1.178 ± 0.742
0.471AlaCys: 0.471 ± 0.336
2.356AlaAsp: 2.356 ± 0.29
1.413AlaGlu: 1.413 ± 0.397
1.178AlaPhe: 1.178 ± 0.262
4.476AlaGly: 4.476 ± 1.72
0.471AlaHis: 0.471 ± 0.285
2.591AlaIle: 2.591 ± 0.554
2.12AlaLys: 2.12 ± 1.346
5.183AlaLeu: 5.183 ± 1.201
1.649AlaMet: 1.649 ± 0.713
1.885AlaAsn: 1.885 ± 0.485
0.942AlaPro: 0.942 ± 0.268
1.649AlaGln: 1.649 ± 0.766
1.413AlaArg: 1.413 ± 0.714
3.062AlaSer: 3.062 ± 0.458
3.062AlaThr: 3.062 ± 0.884
0.942AlaVal: 0.942 ± 0.446
1.178AlaTrp: 1.178 ± 0.382
1.885AlaTyr: 1.885 ± 0.483
0.0AlaXaa: 0.0 ± 0.0
Cys
1.178CysAla: 1.178 ± 0.572
0.236CysCys: 0.236 ± 0.227
1.178CysAsp: 1.178 ± 0.757
0.707CysGlu: 0.707 ± 0.623
0.942CysPhe: 0.942 ± 0.57
0.942CysGly: 0.942 ± 0.538
0.236CysHis: 0.236 ± 0.227
1.649CysIle: 1.649 ± 0.256
1.413CysLys: 1.413 ± 0.736
1.178CysLeu: 1.178 ± 0.496
0.471CysMet: 0.471 ± 0.231
1.178CysAsn: 1.178 ± 0.48
0.942CysPro: 0.942 ± 0.396
0.707CysGln: 0.707 ± 0.341
1.178CysArg: 1.178 ± 0.714
1.885CysSer: 1.885 ± 0.507
1.413CysThr: 1.413 ± 0.51
1.178CysVal: 1.178 ± 0.338
0.942CysTrp: 0.942 ± 0.352
0.471CysTyr: 0.471 ± 0.303
0.0CysXaa: 0.0 ± 0.0
Asp
1.413AspAla: 1.413 ± 0.218
1.885AspCys: 1.885 ± 0.676
3.062AspAsp: 3.062 ± 0.543
3.534AspGlu: 3.534 ± 0.564
1.178AspPhe: 1.178 ± 0.354
2.356AspGly: 2.356 ± 0.772
1.413AspHis: 1.413 ± 0.465
2.356AspIle: 2.356 ± 0.681
2.591AspLys: 2.591 ± 0.888
8.245AspLeu: 8.245 ± 1.047
0.707AspMet: 0.707 ± 0.297
2.12AspAsn: 2.12 ± 0.798
2.591AspPro: 2.591 ± 1.061
2.12AspGln: 2.12 ± 0.728
3.062AspArg: 3.062 ± 1.024
3.062AspSer: 3.062 ± 0.555
3.298AspThr: 3.298 ± 0.811
4.947AspVal: 4.947 ± 0.568
1.178AspTrp: 1.178 ± 0.929
3.062AspTyr: 3.062 ± 0.934
0.0AspXaa: 0.0 ± 0.0
Glu
1.413GluAla: 1.413 ± 0.931
0.236GluCys: 0.236 ± 0.34
2.827GluAsp: 2.827 ± 1.289
1.413GluGlu: 1.413 ± 0.338
1.885GluPhe: 1.885 ± 0.876
6.832GluGly: 6.832 ± 1.022
0.236GluHis: 0.236 ± 0.363
3.769GluIle: 3.769 ± 1.323
1.885GluLys: 1.885 ± 0.876
6.596GluLeu: 6.596 ± 1.208
1.885GluMet: 1.885 ± 0.568
2.827GluAsn: 2.827 ± 0.325
1.649GluPro: 1.649 ± 0.514
2.591GluGln: 2.591 ± 0.535
2.591GluArg: 2.591 ± 0.666
3.534GluSer: 3.534 ± 0.809
3.769GluThr: 3.769 ± 0.909
3.298GluVal: 3.298 ± 0.838
0.471GluTrp: 0.471 ± 0.416
4.005GluTyr: 4.005 ± 0.712
0.0GluXaa: 0.0 ± 0.0
Phe
1.413PheAla: 1.413 ± 0.42
0.236PheCys: 0.236 ± 0.143
1.413PheAsp: 1.413 ± 0.594
2.356PheGlu: 2.356 ± 0.664
1.178PhePhe: 1.178 ± 0.496
1.413PheGly: 1.413 ± 0.557
1.885PheHis: 1.885 ± 0.507
2.591PheIle: 2.591 ± 1.331
2.827PheLys: 2.827 ± 0.681
5.889PheLeu: 5.889 ± 1.668
0.471PheMet: 0.471 ± 0.416
1.649PheAsn: 1.649 ± 0.756
3.062PhePro: 3.062 ± 0.733
1.649PheGln: 1.649 ± 0.347
2.12PheArg: 2.12 ± 0.425
3.534PheSer: 3.534 ± 0.626
2.12PheThr: 2.12 ± 0.447
0.942PheVal: 0.942 ± 0.352
0.707PheTrp: 0.707 ± 0.296
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.12GlyAla: 2.12 ± 0.843
1.178GlyCys: 1.178 ± 0.303
2.827GlyAsp: 2.827 ± 0.645
4.005GlyGlu: 4.005 ± 0.977
3.534GlyPhe: 3.534 ± 1.081
4.947GlyGly: 4.947 ± 1.214
0.942GlyHis: 0.942 ± 0.524
5.183GlyIle: 5.183 ± 1.238
3.062GlyLys: 3.062 ± 0.597
9.423GlyLeu: 9.423 ± 1.398
1.649GlyMet: 1.649 ± 0.523
2.356GlyAsn: 2.356 ± 0.931
3.534GlyPro: 3.534 ± 0.758
1.885GlyGln: 1.885 ± 0.478
3.769GlyArg: 3.769 ± 0.334
7.067GlySer: 7.067 ± 0.923
3.534GlyThr: 3.534 ± 0.806
5.183GlyVal: 5.183 ± 1.026
1.178GlyTrp: 1.178 ± 0.262
3.298GlyTyr: 3.298 ± 1.111
0.0GlyXaa: 0.0 ± 0.0
His
1.178HisAla: 1.178 ± 0.478
0.707HisCys: 0.707 ± 0.428
0.707HisAsp: 0.707 ± 0.356
1.413HisGlu: 1.413 ± 0.493
1.413HisPhe: 1.413 ± 0.346
1.413HisGly: 1.413 ± 0.623
0.471HisHis: 0.471 ± 0.424
1.885HisIle: 1.885 ± 0.864
1.178HisLys: 1.178 ± 0.374
1.885HisLeu: 1.885 ± 0.472
0.471HisMet: 0.471 ± 0.231
1.178HisAsn: 1.178 ± 0.44
1.649HisPro: 1.649 ± 0.272
1.413HisGln: 1.413 ± 0.591
2.591HisArg: 2.591 ± 0.736
1.649HisSer: 1.649 ± 0.461
0.707HisThr: 0.707 ± 0.264
1.413HisVal: 1.413 ± 0.378
0.942HisTrp: 0.942 ± 0.467
1.649HisTyr: 1.649 ± 0.368
0.0HisXaa: 0.0 ± 0.0
Ile
1.885IleAla: 1.885 ± 0.423
1.413IleCys: 1.413 ± 0.411
4.711IleAsp: 4.711 ± 1.033
2.591IleGlu: 2.591 ± 0.662
1.413IlePhe: 1.413 ± 0.393
5.654IleGly: 5.654 ± 1.128
3.062IleHis: 3.062 ± 0.595
4.476IleIle: 4.476 ± 0.902
3.534IleLys: 3.534 ± 0.891
5.889IleLeu: 5.889 ± 0.915
0.942IleMet: 0.942 ± 0.696
3.534IleAsn: 3.534 ± 0.598
5.889IlePro: 5.889 ± 1.205
1.885IleGln: 1.885 ± 0.445
5.183IleArg: 5.183 ± 1.032
5.889IleSer: 5.889 ± 0.66
4.711IleThr: 4.711 ± 1.059
3.298IleVal: 3.298 ± 0.922
0.707IleTrp: 0.707 ± 0.296
3.534IleTyr: 3.534 ± 0.675
0.0IleXaa: 0.0 ± 0.0
Lys
3.534LysAla: 3.534 ± 0.694
1.413LysCys: 1.413 ± 0.279
2.12LysAsp: 2.12 ± 0.719
4.005LysGlu: 4.005 ± 1.465
1.885LysPhe: 1.885 ± 1.124
3.769LysGly: 3.769 ± 0.585
0.471LysHis: 0.471 ± 0.336
2.827LysIle: 2.827 ± 0.666
4.476LysLys: 4.476 ± 0.613
5.418LysLeu: 5.418 ± 1.456
1.885LysMet: 1.885 ± 0.605
3.062LysAsn: 3.062 ± 0.675
2.591LysPro: 2.591 ± 0.638
0.707LysGln: 0.707 ± 0.567
4.476LysArg: 4.476 ± 1.275
3.062LysSer: 3.062 ± 0.663
4.711LysThr: 4.711 ± 0.879
3.298LysVal: 3.298 ± 0.43
0.707LysTrp: 0.707 ± 0.26
1.413LysTyr: 1.413 ± 0.734
0.0LysXaa: 0.0 ± 0.0
Leu
4.947LeuAla: 4.947 ± 1.267
2.827LeuCys: 2.827 ± 0.842
6.125LeuAsp: 6.125 ± 1.161
6.832LeuGlu: 6.832 ± 1.281
4.24LeuPhe: 4.24 ± 0.964
7.303LeuGly: 7.303 ± 1.163
2.827LeuHis: 2.827 ± 0.658
10.13LeuIle: 10.13 ± 1.418
5.654LeuLys: 5.654 ± 0.821
11.779LeuLeu: 11.779 ± 1.864
2.356LeuMet: 2.356 ± 0.723
5.654LeuAsn: 5.654 ± 0.955
5.183LeuPro: 5.183 ± 1.184
3.534LeuGln: 3.534 ± 0.906
6.36LeuArg: 6.36 ± 2.134
8.952LeuSer: 8.952 ± 1.065
5.418LeuThr: 5.418 ± 0.859
5.418LeuVal: 5.418 ± 1.541
1.413LeuTrp: 1.413 ± 0.433
4.947LeuTyr: 4.947 ± 0.344
0.0LeuXaa: 0.0 ± 0.0
Met
0.707MetAla: 0.707 ± 0.266
0.471MetCys: 0.471 ± 0.285
1.413MetAsp: 1.413 ± 0.918
0.942MetGlu: 0.942 ± 0.561
0.942MetPhe: 0.942 ± 0.524
2.356MetGly: 2.356 ± 0.623
0.471MetHis: 0.471 ± 0.379
1.413MetIle: 1.413 ± 0.481
1.649MetLys: 1.649 ± 0.46
1.885MetLeu: 1.885 ± 0.762
1.413MetMet: 1.413 ± 0.397
1.178MetAsn: 1.178 ± 0.554
0.236MetPro: 0.236 ± 0.143
0.707MetGln: 0.707 ± 0.428
0.707MetArg: 0.707 ± 0.266
2.827MetSer: 2.827 ± 0.684
1.885MetThr: 1.885 ± 0.704
0.0MetVal: 0.0 ± 0.0
0.471MetTrp: 0.471 ± 0.4
0.471MetTyr: 0.471 ± 0.336
0.0MetXaa: 0.0 ± 0.0
Asn
1.885AsnAla: 1.885 ± 0.759
0.707AsnCys: 0.707 ± 0.264
1.413AsnAsp: 1.413 ± 0.437
1.649AsnGlu: 1.649 ± 0.401
1.885AsnPhe: 1.885 ± 0.254
4.24AsnGly: 4.24 ± 0.961
2.12AsnHis: 2.12 ± 0.719
2.356AsnIle: 2.356 ± 0.365
1.885AsnLys: 1.885 ± 0.498
5.183AsnLeu: 5.183 ± 1.384
1.413AsnMet: 1.413 ± 0.543
1.885AsnAsn: 1.885 ± 0.545
3.062AsnPro: 3.062 ± 0.585
3.534AsnGln: 3.534 ± 0.826
1.413AsnArg: 1.413 ± 0.346
3.534AsnSer: 3.534 ± 0.982
0.942AsnThr: 0.942 ± 0.37
1.649AsnVal: 1.649 ± 0.46
0.942AsnTrp: 0.942 ± 0.407
1.413AsnTyr: 1.413 ± 0.428
0.0AsnXaa: 0.0 ± 0.0
Pro
2.12ProAla: 2.12 ± 0.973
0.942ProCys: 0.942 ± 0.606
4.24ProAsp: 4.24 ± 1.059
2.591ProGlu: 2.591 ± 0.831
1.649ProPhe: 1.649 ± 0.469
2.12ProGly: 2.12 ± 0.884
1.413ProHis: 1.413 ± 0.417
2.591ProIle: 2.591 ± 0.812
3.062ProLys: 3.062 ± 0.469
6.36ProLeu: 6.36 ± 1.684
0.471ProMet: 0.471 ± 0.333
1.885ProAsn: 1.885 ± 0.488
3.534ProPro: 3.534 ± 0.818
2.356ProGln: 2.356 ± 0.665
3.298ProArg: 3.298 ± 0.961
5.183ProSer: 5.183 ± 0.687
2.356ProThr: 2.356 ± 0.735
3.298ProVal: 3.298 ± 1.081
0.707ProTrp: 0.707 ± 0.442
2.356ProTyr: 2.356 ± 0.319
0.0ProXaa: 0.0 ± 0.0
Gln
1.885GlnAla: 1.885 ± 0.973
1.178GlnCys: 1.178 ± 0.48
1.413GlnAsp: 1.413 ± 0.63
1.885GlnGlu: 1.885 ± 0.467
1.413GlnPhe: 1.413 ± 0.409
4.24GlnGly: 4.24 ± 0.936
0.942GlnHis: 0.942 ± 0.494
1.413GlnIle: 1.413 ± 0.803
2.356GlnLys: 2.356 ± 0.502
3.534GlnLeu: 3.534 ± 0.911
0.942GlnMet: 0.942 ± 0.467
1.178GlnAsn: 1.178 ± 0.496
1.885GlnPro: 1.885 ± 0.369
0.471GlnGln: 0.471 ± 0.238
2.12GlnArg: 2.12 ± 0.42
4.005GlnSer: 4.005 ± 0.889
2.12GlnThr: 2.12 ± 0.768
1.649GlnVal: 1.649 ± 0.64
0.0GlnTrp: 0.0 ± 0.0
1.178GlnTyr: 1.178 ± 0.766
0.0GlnXaa: 0.0 ± 0.0
Arg
2.356ArgAla: 2.356 ± 0.518
0.942ArgCys: 0.942 ± 0.462
2.591ArgAsp: 2.591 ± 0.551
3.298ArgGlu: 3.298 ± 0.884
3.534ArgPhe: 3.534 ± 0.455
3.769ArgGly: 3.769 ± 0.779
2.12ArgHis: 2.12 ± 0.625
3.769ArgIle: 3.769 ± 0.991
2.591ArgLys: 2.591 ± 1.334
6.36ArgLeu: 6.36 ± 0.886
0.471ArgMet: 0.471 ± 0.238
2.12ArgAsn: 2.12 ± 0.584
2.356ArgPro: 2.356 ± 0.666
1.885ArgGln: 1.885 ± 0.568
2.591ArgArg: 2.591 ± 0.625
4.476ArgSer: 4.476 ± 0.317
3.534ArgThr: 3.534 ± 1.07
4.711ArgVal: 4.711 ± 0.92
0.707ArgTrp: 0.707 ± 0.391
1.178ArgTyr: 1.178 ± 0.407
0.0ArgXaa: 0.0 ± 0.0
Ser
3.298SerAla: 3.298 ± 0.552
2.827SerCys: 2.827 ± 1.072
6.36SerAsp: 6.36 ± 1.504
4.947SerGlu: 4.947 ± 1.301
2.591SerPhe: 2.591 ± 0.455
5.654SerGly: 5.654 ± 1.131
2.356SerHis: 2.356 ± 0.606
7.303SerIle: 7.303 ± 1.296
5.183SerLys: 5.183 ± 1.015
8.009SerLeu: 8.009 ± 1.11
0.707SerMet: 0.707 ± 0.266
2.356SerAsn: 2.356 ± 0.664
5.183SerPro: 5.183 ± 1.033
3.534SerGln: 3.534 ± 0.842
4.476SerArg: 4.476 ± 1.062
8.245SerSer: 8.245 ± 3.798
4.24SerThr: 4.24 ± 0.716
2.591SerVal: 2.591 ± 0.533
1.649SerTrp: 1.649 ± 0.52
3.062SerTyr: 3.062 ± 1.054
0.0SerXaa: 0.0 ± 0.0
Thr
2.12ThrAla: 2.12 ± 0.925
0.471ThrCys: 0.471 ± 0.303
3.534ThrAsp: 3.534 ± 0.63
4.005ThrGlu: 4.005 ± 0.622
2.591ThrPhe: 2.591 ± 0.638
3.298ThrGly: 3.298 ± 1.405
1.649ThrHis: 1.649 ± 0.766
4.005ThrIle: 4.005 ± 0.934
3.769ThrLys: 3.769 ± 0.728
5.654ThrLeu: 5.654 ± 1.204
1.649ThrMet: 1.649 ± 0.401
2.827ThrAsn: 2.827 ± 0.62
2.591ThrPro: 2.591 ± 0.97
1.178ThrGln: 1.178 ± 0.548
3.062ThrArg: 3.062 ± 0.869
5.183ThrSer: 5.183 ± 1.146
3.534ThrThr: 3.534 ± 1.412
3.062ThrVal: 3.062 ± 0.836
2.591ThrTrp: 2.591 ± 0.597
2.12ThrTyr: 2.12 ± 0.768
0.0ThrXaa: 0.0 ± 0.0
Val
2.12ValAla: 2.12 ± 0.757
0.942ValCys: 0.942 ± 0.396
2.827ValAsp: 2.827 ± 0.65
2.12ValGlu: 2.12 ± 0.564
1.413ValPhe: 1.413 ± 0.666
2.591ValGly: 2.591 ± 0.841
1.413ValHis: 1.413 ± 0.47
5.183ValIle: 5.183 ± 0.575
2.827ValLys: 2.827 ± 0.828
5.183ValLeu: 5.183 ± 1.035
1.178ValMet: 1.178 ± 0.617
2.356ValAsn: 2.356 ± 0.646
2.12ValPro: 2.12 ± 0.555
2.356ValGln: 2.356 ± 0.983
2.12ValArg: 2.12 ± 0.669
4.711ValSer: 4.711 ± 1.07
4.711ValThr: 4.711 ± 1.298
2.591ValVal: 2.591 ± 0.737
0.471ValTrp: 0.471 ± 0.285
2.12ValTyr: 2.12 ± 0.636
0.0ValXaa: 0.0 ± 0.0
Trp
0.942TrpAla: 0.942 ± 0.464
0.0TrpCys: 0.0 ± 0.0
1.649TrpAsp: 1.649 ± 0.883
1.178TrpGlu: 1.178 ± 0.312
1.885TrpPhe: 1.885 ± 0.767
0.942TrpGly: 0.942 ± 0.37
0.236TrpHis: 0.236 ± 0.143
2.12TrpIle: 2.12 ± 0.894
1.649TrpLys: 1.649 ± 0.998
1.649TrpLeu: 1.649 ± 0.256
0.236TrpMet: 0.236 ± 0.143
0.942TrpAsn: 0.942 ± 0.37
0.942TrpPro: 0.942 ± 0.407
0.0TrpGln: 0.0 ± 0.0
0.471TrpArg: 0.471 ± 0.285
0.942TrpSer: 0.942 ± 0.34
0.942TrpThr: 0.942 ± 0.548
0.236TrpVal: 0.236 ± 0.34
0.471TrpTrp: 0.471 ± 0.554
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.413TyrAla: 1.413 ± 0.565
0.942TyrCys: 0.942 ± 0.757
1.413TyrAsp: 1.413 ± 0.378
2.827TyrGlu: 2.827 ± 0.813
0.707TyrPhe: 0.707 ± 0.264
2.356TyrGly: 2.356 ± 0.732
1.178TyrHis: 1.178 ± 0.725
2.827TyrIle: 2.827 ± 0.556
2.12TyrLys: 2.12 ± 0.431
6.596TyrLeu: 6.596 ± 0.765
0.942TyrMet: 0.942 ± 0.245
0.942TyrAsn: 0.942 ± 0.352
2.591TyrPro: 2.591 ± 0.563
1.649TyrGln: 1.649 ± 0.6
2.356TyrArg: 2.356 ± 0.691
3.769TyrSer: 3.769 ± 1.11
1.885TyrThr: 1.885 ± 0.454
1.649TyrVal: 1.649 ± 0.82
0.0TyrTrp: 0.0 ± 0.0
1.649TyrTyr: 1.649 ± 0.368
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4246 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski