Amino acid dipepetide frequency for Datura yellow vein nucleorhabdovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.372AlaAla: 4.372 ± 0.702
0.729AlaCys: 0.729 ± 0.435
2.915AlaAsp: 2.915 ± 0.814
4.858AlaGlu: 4.858 ± 1.744
2.915AlaPhe: 2.915 ± 0.816
2.672AlaGly: 2.672 ± 0.965
1.214AlaHis: 1.214 ± 0.593
4.858AlaIle: 4.858 ± 0.778
3.158AlaLys: 3.158 ± 1.158
7.044AlaLeu: 7.044 ± 1.876
0.729AlaMet: 0.729 ± 0.371
1.214AlaAsn: 1.214 ± 0.651
2.672AlaPro: 2.672 ± 0.743
1.457AlaGln: 1.457 ± 0.342
2.915AlaArg: 2.915 ± 0.556
3.158AlaSer: 3.158 ± 0.564
4.372AlaThr: 4.372 ± 0.987
2.672AlaVal: 2.672 ± 0.413
0.729AlaTrp: 0.729 ± 0.576
3.158AlaTyr: 3.158 ± 0.622
0.0AlaXaa: 0.0 ± 0.0
Cys
0.729CysAla: 0.729 ± 0.317
0.243CysCys: 0.243 ± 0.145
1.7CysAsp: 1.7 ± 0.498
0.486CysGlu: 0.486 ± 0.636
0.486CysPhe: 0.486 ± 0.29
2.186CysGly: 2.186 ± 0.346
0.486CysHis: 0.486 ± 0.463
0.972CysIle: 0.972 ± 0.322
2.915CysLys: 2.915 ± 0.541
2.186CysLeu: 2.186 ± 0.49
0.486CysMet: 0.486 ± 0.29
0.972CysAsn: 0.972 ± 0.58
0.972CysPro: 0.972 ± 0.318
0.0CysGln: 0.0 ± 0.0
0.486CysArg: 0.486 ± 0.274
2.186CysSer: 2.186 ± 0.772
0.486CysThr: 0.486 ± 0.463
1.214CysVal: 1.214 ± 0.417
0.0CysTrp: 0.0 ± 0.0
1.457CysTyr: 1.457 ± 0.55
0.0CysXaa: 0.0 ± 0.0
Asp
1.943AspAla: 1.943 ± 0.552
0.972AspCys: 0.972 ± 0.622
4.372AspAsp: 4.372 ± 1.501
5.101AspGlu: 5.101 ± 0.936
1.214AspPhe: 1.214 ± 0.563
2.672AspGly: 2.672 ± 0.807
0.729AspHis: 0.729 ± 0.576
5.829AspIle: 5.829 ± 0.703
4.372AspLys: 4.372 ± 0.647
4.372AspLeu: 4.372 ± 0.891
1.214AspMet: 1.214 ± 0.51
3.158AspAsn: 3.158 ± 1.056
3.643AspPro: 3.643 ± 1.126
0.729AspGln: 0.729 ± 0.435
2.186AspArg: 2.186 ± 1.305
4.129AspSer: 4.129 ± 1.319
5.587AspThr: 5.587 ± 1.798
2.186AspVal: 2.186 ± 0.625
0.243AspTrp: 0.243 ± 0.145
1.214AspTyr: 1.214 ± 0.559
0.0AspXaa: 0.0 ± 0.0
Glu
3.886GluAla: 3.886 ± 1.61
0.729GluCys: 0.729 ± 0.396
4.372GluAsp: 4.372 ± 0.565
7.773GluGlu: 7.773 ± 1.649
2.186GluPhe: 2.186 ± 0.719
3.158GluGly: 3.158 ± 0.603
2.915GluHis: 2.915 ± 1.326
4.129GluIle: 4.129 ± 1.141
2.186GluLys: 2.186 ± 0.321
4.615GluLeu: 4.615 ± 0.549
2.186GluMet: 2.186 ± 0.67
2.915GluAsn: 2.915 ± 0.805
2.186GluPro: 2.186 ± 1.175
1.457GluGln: 1.457 ± 0.492
4.129GluArg: 4.129 ± 0.807
4.372GluSer: 4.372 ± 0.952
3.401GluThr: 3.401 ± 0.497
3.401GluVal: 3.401 ± 0.811
0.243GluTrp: 0.243 ± 0.344
2.186GluTyr: 2.186 ± 0.38
0.0GluXaa: 0.0 ± 0.0
Phe
1.214PheAla: 1.214 ± 0.5
0.0PheCys: 0.0 ± 0.0
2.186PheAsp: 2.186 ± 1.273
0.972PheGlu: 0.972 ± 0.735
0.486PhePhe: 0.486 ± 0.29
1.7PheGly: 1.7 ± 0.475
1.457PheHis: 1.457 ± 0.396
0.972PheIle: 0.972 ± 0.318
1.943PheLys: 1.943 ± 0.533
2.429PheLeu: 2.429 ± 0.836
1.943PheMet: 1.943 ± 1.092
1.7PheAsn: 1.7 ± 1.037
0.972PhePro: 0.972 ± 0.4
2.186PheGln: 2.186 ± 0.461
1.943PheArg: 1.943 ± 0.908
4.129PheSer: 4.129 ± 1.062
1.457PheThr: 1.457 ± 0.492
1.7PheVal: 1.7 ± 0.573
0.486PheTrp: 0.486 ± 0.636
0.972PheTyr: 0.972 ± 0.37
0.0PheXaa: 0.0 ± 0.0
Gly
3.401GlyAla: 3.401 ± 1.331
0.729GlyCys: 0.729 ± 0.81
4.858GlyAsp: 4.858 ± 1.568
4.129GlyGlu: 4.129 ± 0.936
1.457GlyPhe: 1.457 ± 0.492
3.643GlyGly: 3.643 ± 1.375
1.457GlyHis: 1.457 ± 0.639
3.643GlyIle: 3.643 ± 0.59
4.372GlyLys: 4.372 ± 1.377
4.858GlyLeu: 4.858 ± 0.984
0.729GlyMet: 0.729 ± 0.572
2.186GlyAsn: 2.186 ± 1.097
1.7GlyPro: 1.7 ± 0.492
0.486GlyGln: 0.486 ± 0.262
3.158GlyArg: 3.158 ± 1.012
3.643GlySer: 3.643 ± 0.908
3.158GlyThr: 3.158 ± 0.958
4.372GlyVal: 4.372 ± 1.707
1.457GlyTrp: 1.457 ± 0.605
2.429GlyTyr: 2.429 ± 0.465
0.0GlyXaa: 0.0 ± 0.0
His
1.457HisAla: 1.457 ± 1.433
0.486HisCys: 0.486 ± 0.308
1.943HisAsp: 1.943 ± 0.775
1.943HisGlu: 1.943 ± 0.448
1.214HisPhe: 1.214 ± 0.439
2.672HisGly: 2.672 ± 0.579
0.729HisHis: 0.729 ± 0.435
0.729HisIle: 0.729 ± 0.279
1.7HisLys: 1.7 ± 0.805
1.457HisLeu: 1.457 ± 0.417
0.972HisMet: 0.972 ± 0.408
1.214HisAsn: 1.214 ± 0.559
1.943HisPro: 1.943 ± 0.408
1.457HisGln: 1.457 ± 0.663
0.972HisArg: 0.972 ± 0.37
1.7HisSer: 1.7 ± 0.574
0.972HisThr: 0.972 ± 0.328
0.972HisVal: 0.972 ± 0.375
0.486HisTrp: 0.486 ± 0.29
1.943HisTyr: 1.943 ± 0.699
0.0HisXaa: 0.0 ± 0.0
Ile
2.429IleAla: 2.429 ± 0.493
2.186IleCys: 2.186 ± 0.545
2.429IleAsp: 2.429 ± 0.968
2.429IleGlu: 2.429 ± 0.724
1.943IlePhe: 1.943 ± 0.676
4.372IleGly: 4.372 ± 0.641
0.972IleHis: 0.972 ± 0.322
6.315IleIle: 6.315 ± 2.059
5.829IleLys: 5.829 ± 1.289
4.372IleLeu: 4.372 ± 1.016
2.672IleMet: 2.672 ± 0.675
3.886IleAsn: 3.886 ± 1.231
2.672IlePro: 2.672 ± 0.608
1.457IleGln: 1.457 ± 0.411
4.129IleArg: 4.129 ± 1.015
8.016IleSer: 8.016 ± 1.111
4.129IleThr: 4.129 ± 1.456
4.858IleVal: 4.858 ± 0.965
1.214IleTrp: 1.214 ± 0.5
2.186IleTyr: 2.186 ± 0.744
0.0IleXaa: 0.0 ± 0.0
Lys
3.643LysAla: 3.643 ± 0.909
1.214LysCys: 1.214 ± 0.358
3.886LysAsp: 3.886 ± 1.024
4.372LysGlu: 4.372 ± 1.353
1.943LysPhe: 1.943 ± 0.501
3.643LysGly: 3.643 ± 0.736
2.429LysHis: 2.429 ± 0.239
5.587LysIle: 5.587 ± 0.963
5.587LysLys: 5.587 ± 2.07
4.129LysLeu: 4.129 ± 0.917
1.457LysMet: 1.457 ± 1.168
2.429LysAsn: 2.429 ± 0.374
0.729LysPro: 0.729 ± 0.603
2.915LysGln: 2.915 ± 1.096
4.858LysArg: 4.858 ± 0.713
5.344LysSer: 5.344 ± 1.355
3.886LysThr: 3.886 ± 1.345
4.858LysVal: 4.858 ± 0.635
0.729LysTrp: 0.729 ± 0.435
2.429LysTyr: 2.429 ± 0.478
0.0LysXaa: 0.0 ± 0.0
Leu
5.587LeuAla: 5.587 ± 0.853
2.915LeuCys: 2.915 ± 0.972
3.401LeuAsp: 3.401 ± 0.653
3.643LeuGlu: 3.643 ± 0.847
3.643LeuPhe: 3.643 ± 0.353
4.372LeuGly: 4.372 ± 1.519
1.7LeuHis: 1.7 ± 0.52
4.372LeuIle: 4.372 ± 1.534
4.615LeuLys: 4.615 ± 1.344
7.773LeuLeu: 7.773 ± 1.594
4.129LeuMet: 4.129 ± 1.047
4.129LeuAsn: 4.129 ± 0.808
4.372LeuPro: 4.372 ± 0.598
2.429LeuGln: 2.429 ± 1.02
4.858LeuArg: 4.858 ± 1.177
9.473LeuSer: 9.473 ± 1.724
5.101LeuThr: 5.101 ± 0.683
6.072LeuVal: 6.072 ± 0.49
1.457LeuTrp: 1.457 ± 1.153
3.886LeuTyr: 3.886 ± 1.459
0.0LeuXaa: 0.0 ± 0.0
Met
1.943MetAla: 1.943 ± 0.702
0.486MetCys: 0.486 ± 0.274
1.7MetAsp: 1.7 ± 0.467
1.7MetGlu: 1.7 ± 0.468
0.243MetPhe: 0.243 ± 0.281
0.972MetGly: 0.972 ± 0.37
0.486MetHis: 0.486 ± 0.489
1.943MetIle: 1.943 ± 0.387
2.672MetLys: 2.672 ± 1.114
2.915MetLeu: 2.915 ± 0.736
1.457MetMet: 1.457 ± 0.68
2.672MetAsn: 2.672 ± 1.117
0.0MetPro: 0.0 ± 0.0
0.486MetGln: 0.486 ± 0.308
1.214MetArg: 1.214 ± 0.34
3.643MetSer: 3.643 ± 0.971
1.943MetThr: 1.943 ± 0.501
1.7MetVal: 1.7 ± 0.65
0.729MetTrp: 0.729 ± 0.367
2.915MetTyr: 2.915 ± 0.924
0.0MetXaa: 0.0 ± 0.0
Asn
3.158AsnAla: 3.158 ± 0.64
0.243AsnCys: 0.243 ± 0.145
0.729AsnAsp: 0.729 ± 0.279
1.7AsnGlu: 1.7 ± 0.76
0.972AsnPhe: 0.972 ± 0.4
1.457AsnGly: 1.457 ± 0.479
1.214AsnHis: 1.214 ± 0.417
4.129AsnIle: 4.129 ± 0.571
3.401AsnLys: 3.401 ± 1.083
4.615AsnLeu: 4.615 ± 0.264
2.915AsnMet: 2.915 ± 0.792
2.672AsnAsn: 2.672 ± 0.806
2.429AsnPro: 2.429 ± 1.287
2.672AsnGln: 2.672 ± 0.826
2.186AsnArg: 2.186 ± 0.291
2.186AsnSer: 2.186 ± 0.689
2.915AsnThr: 2.915 ± 0.568
3.886AsnVal: 3.886 ± 0.862
1.214AsnTrp: 1.214 ± 0.5
1.943AsnTyr: 1.943 ± 0.683
0.0AsnXaa: 0.0 ± 0.0
Pro
4.129ProAla: 4.129 ± 1.062
0.243ProCys: 0.243 ± 0.318
1.457ProAsp: 1.457 ± 0.399
2.915ProGlu: 2.915 ± 1.049
2.186ProPhe: 2.186 ± 1.003
1.457ProGly: 1.457 ± 0.362
1.457ProHis: 1.457 ± 0.642
2.672ProIle: 2.672 ± 0.954
2.186ProLys: 2.186 ± 0.702
3.401ProLeu: 3.401 ± 0.506
0.243ProMet: 0.243 ± 0.145
1.214ProAsn: 1.214 ± 0.546
2.915ProPro: 2.915 ± 1.629
1.214ProGln: 1.214 ± 1.151
1.7ProArg: 1.7 ± 0.919
4.615ProSer: 4.615 ± 0.746
1.943ProThr: 1.943 ± 0.533
2.672ProVal: 2.672 ± 0.667
0.486ProTrp: 0.486 ± 0.467
2.672ProTyr: 2.672 ± 0.579
0.0ProXaa: 0.0 ± 0.0
Gln
2.186GlnAla: 2.186 ± 0.607
0.243GlnCys: 0.243 ± 0.281
1.7GlnAsp: 1.7 ± 0.282
1.943GlnGlu: 1.943 ± 0.701
0.243GlnPhe: 0.243 ± 0.145
0.972GlnGly: 0.972 ± 0.399
0.729GlnHis: 0.729 ± 0.633
1.457GlnIle: 1.457 ± 0.413
2.672GlnLys: 2.672 ± 0.506
0.729GlnLeu: 0.729 ± 0.47
0.729GlnMet: 0.729 ± 0.326
2.672GlnAsn: 2.672 ± 0.981
1.943GlnPro: 1.943 ± 0.574
0.972GlnGln: 0.972 ± 0.583
0.972GlnArg: 0.972 ± 0.387
2.672GlnSer: 2.672 ± 0.509
3.158GlnThr: 3.158 ± 0.506
1.457GlnVal: 1.457 ± 0.817
0.0GlnTrp: 0.0 ± 0.0
0.972GlnTyr: 0.972 ± 0.538
0.0GlnXaa: 0.0 ± 0.0
Arg
2.186ArgAla: 2.186 ± 0.79
0.972ArgCys: 0.972 ± 0.37
3.643ArgAsp: 3.643 ± 1.283
2.672ArgGlu: 2.672 ± 0.96
1.7ArgPhe: 1.7 ± 0.282
4.615ArgGly: 4.615 ± 1.052
1.214ArgHis: 1.214 ± 0.398
3.401ArgIle: 3.401 ± 0.736
2.186ArgLys: 2.186 ± 0.301
6.801ArgLeu: 6.801 ± 1.872
1.214ArgMet: 1.214 ± 0.398
2.429ArgAsn: 2.429 ± 0.88
2.672ArgPro: 2.672 ± 0.63
1.214ArgGln: 1.214 ± 0.519
2.672ArgArg: 2.672 ± 0.678
2.186ArgSer: 2.186 ± 0.601
3.886ArgThr: 3.886 ± 0.951
3.643ArgVal: 3.643 ± 0.353
0.729ArgTrp: 0.729 ± 0.71
1.457ArgTyr: 1.457 ± 0.488
0.0ArgXaa: 0.0 ± 0.0
Ser
5.344SerAla: 5.344 ± 1.81
1.7SerCys: 1.7 ± 0.68
5.587SerAsp: 5.587 ± 1.537
5.587SerGlu: 5.587 ± 0.927
2.672SerPhe: 2.672 ± 0.506
3.158SerGly: 3.158 ± 0.679
2.429SerHis: 2.429 ± 0.9
4.858SerIle: 4.858 ± 0.676
5.587SerLys: 5.587 ± 0.532
6.315SerLeu: 6.315 ± 1.58
2.672SerMet: 2.672 ± 0.626
3.886SerAsn: 3.886 ± 0.651
4.858SerPro: 4.858 ± 0.76
2.672SerGln: 2.672 ± 0.615
3.643SerArg: 3.643 ± 0.878
7.773SerSer: 7.773 ± 1.131
5.344SerThr: 5.344 ± 0.897
6.072SerVal: 6.072 ± 1.828
1.457SerTrp: 1.457 ± 0.456
2.915SerTyr: 2.915 ± 0.744
0.0SerXaa: 0.0 ± 0.0
Thr
3.643ThrAla: 3.643 ± 0.998
2.186ThrCys: 2.186 ± 1.386
3.158ThrAsp: 3.158 ± 0.899
2.186ThrGlu: 2.186 ± 0.978
1.7ThrPhe: 1.7 ± 0.922
5.101ThrGly: 5.101 ± 0.989
1.457ThrHis: 1.457 ± 0.634
4.372ThrIle: 4.372 ± 0.427
4.372ThrLys: 4.372 ± 0.977
6.558ThrLeu: 6.558 ± 0.989
2.672ThrMet: 2.672 ± 1.11
2.429ThrAsn: 2.429 ± 0.979
2.429ThrPro: 2.429 ± 0.332
1.457ThrGln: 1.457 ± 1.214
4.129ThrArg: 4.129 ± 0.866
6.072ThrSer: 6.072 ± 1.203
5.101ThrThr: 5.101 ± 1.138
3.401ThrVal: 3.401 ± 1.207
1.214ThrTrp: 1.214 ± 0.438
2.186ThrTyr: 2.186 ± 0.908
0.0ThrXaa: 0.0 ± 0.0
Val
4.615ValAla: 4.615 ± 0.681
2.429ValCys: 2.429 ± 0.9
2.672ValAsp: 2.672 ± 0.556
5.344ValGlu: 5.344 ± 1.495
1.943ValPhe: 1.943 ± 0.8
4.372ValGly: 4.372 ± 1.178
1.7ValHis: 1.7 ± 0.366
4.372ValIle: 4.372 ± 1.375
2.672ValLys: 2.672 ± 0.786
7.287ValLeu: 7.287 ± 1.228
1.214ValMet: 1.214 ± 0.505
2.186ValAsn: 2.186 ± 0.361
0.972ValPro: 0.972 ± 0.409
1.214ValGln: 1.214 ± 0.538
2.672ValArg: 2.672 ± 0.422
6.072ValSer: 6.072 ± 1.286
4.858ValThr: 4.858 ± 0.635
3.886ValVal: 3.886 ± 0.59
0.243ValTrp: 0.243 ± 0.354
1.457ValTyr: 1.457 ± 0.823
0.0ValXaa: 0.0 ± 0.0
Trp
0.243TrpAla: 0.243 ± 0.145
0.486TrpCys: 0.486 ± 0.709
0.972TrpAsp: 0.972 ± 0.622
1.214TrpGlu: 1.214 ± 0.306
0.486TrpPhe: 0.486 ± 0.636
0.972TrpGly: 0.972 ± 0.387
0.486TrpHis: 0.486 ± 0.636
0.729TrpIle: 0.729 ± 0.303
0.972TrpLys: 0.972 ± 0.251
1.457TrpLeu: 1.457 ± 0.634
0.486TrpMet: 0.486 ± 0.308
0.729TrpAsn: 0.729 ± 0.318
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.214TrpArg: 1.214 ± 0.546
0.729TrpSer: 0.729 ± 0.648
1.457TrpThr: 1.457 ± 0.627
0.729TrpVal: 0.729 ± 0.279
0.243TrpTrp: 0.243 ± 0.318
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.7TyrAla: 1.7 ± 0.922
1.457TyrCys: 1.457 ± 0.823
2.186TyrAsp: 2.186 ± 0.649
1.457TyrGlu: 1.457 ± 0.634
1.214TyrPhe: 1.214 ± 0.438
1.7TyrGly: 1.7 ± 0.793
1.7TyrHis: 1.7 ± 1.015
3.158TyrIle: 3.158 ± 1.159
2.672TyrLys: 2.672 ± 0.343
4.615TyrLeu: 4.615 ± 1.344
1.457TyrMet: 1.457 ± 0.634
2.186TyrAsn: 2.186 ± 0.63
1.7TyrPro: 1.7 ± 0.641
1.943TyrGln: 1.943 ± 0.5
1.457TyrArg: 1.457 ± 0.485
2.429TyrSer: 2.429 ± 0.647
2.672TyrThr: 2.672 ± 0.655
2.429TyrVal: 2.429 ± 0.563
0.243TyrTrp: 0.243 ± 0.145
2.429TyrTyr: 2.429 ± 0.747
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4118 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski