Amino acid dipepetide frequency for Dugbe virus (isolate ArD44313) (DUGV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.307AlaAla: 2.307 ± 0.486
1.318AlaCys: 1.318 ± 0.357
0.989AlaAsp: 0.989 ± 0.233
3.131AlaGlu: 3.131 ± 0.699
0.824AlaPhe: 0.824 ± 0.218
2.472AlaGly: 2.472 ± 0.77
0.33AlaHis: 0.33 ± 0.18
2.966AlaIle: 2.966 ± 1.085
2.142AlaLys: 2.142 ± 0.185
6.261AlaLeu: 6.261 ± 1.738
0.989AlaMet: 0.989 ± 0.303
1.977AlaAsn: 1.977 ± 0.291
1.318AlaPro: 1.318 ± 0.16
1.318AlaGln: 1.318 ± 1.447
2.142AlaArg: 2.142 ± 0.096
3.46AlaSer: 3.46 ± 1.092
2.472AlaThr: 2.472 ± 1.282
4.778AlaVal: 4.778 ± 0.295
0.494AlaTrp: 0.494 ± 0.751
0.989AlaTyr: 0.989 ± 0.678
0.0AlaXaa: 0.0 ± 0.0
Cys
0.989CysAla: 0.989 ± 1.074
1.483CysCys: 1.483 ± 0.317
0.989CysAsp: 0.989 ± 0.305
1.648CysGlu: 1.648 ± 0.338
1.483CysPhe: 1.483 ± 0.267
0.824CysGly: 0.824 ± 0.626
0.494CysHis: 0.494 ± 0.106
1.977CysIle: 1.977 ± 0.423
1.318CysLys: 1.318 ± 0.778
2.801CysLeu: 2.801 ± 0.575
0.494CysMet: 0.494 ± 0.325
1.648CysAsn: 1.648 ± 1.171
1.812CysPro: 1.812 ± 1.189
0.989CysGln: 0.989 ± 0.211
2.636CysArg: 2.636 ± 0.825
2.307CysSer: 2.307 ± 0.383
2.142CysThr: 2.142 ± 1.496
1.648CysVal: 1.648 ± 0.442
0.494CysTrp: 0.494 ± 0.106
1.483CysTyr: 1.483 ± 1.242
0.0CysXaa: 0.0 ± 0.0
Asp
1.977AspAla: 1.977 ± 0.917
2.307AspCys: 2.307 ± 0.476
3.131AspAsp: 3.131 ± 0.855
3.79AspGlu: 3.79 ± 0.51
1.812AspPhe: 1.812 ± 0.482
3.295AspGly: 3.295 ± 0.626
0.989AspHis: 0.989 ± 0.299
4.614AspIle: 4.614 ± 0.491
3.625AspLys: 3.625 ± 0.702
5.767AspLeu: 5.767 ± 1.304
1.153AspMet: 1.153 ± 0.243
2.966AspAsn: 2.966 ± 0.811
0.989AspPro: 0.989 ± 0.211
0.824AspGln: 0.824 ± 0.221
2.966AspArg: 2.966 ± 0.543
5.273AspSer: 5.273 ± 0.715
1.977AspThr: 1.977 ± 0.466
4.119AspVal: 4.119 ± 0.385
0.659AspTrp: 0.659 ± 0.146
2.472AspTyr: 2.472 ± 0.857
0.0AspXaa: 0.0 ± 0.0
Glu
3.625GluAla: 3.625 ± 0.902
2.801GluCys: 2.801 ± 0.705
5.108GluAsp: 5.108 ± 1.317
4.943GluGlu: 4.943 ± 1.057
2.636GluPhe: 2.636 ± 0.363
4.614GluGly: 4.614 ± 1.019
1.648GluHis: 1.648 ± 0.596
3.625GluIle: 3.625 ± 0.178
5.108GluLys: 5.108 ± 0.305
8.568GluLeu: 8.568 ± 0.384
1.812GluMet: 1.812 ± 1.367
2.636GluAsn: 2.636 ± 0.566
1.648GluPro: 1.648 ± 0.413
2.472GluGln: 2.472 ± 0.546
3.625GluArg: 3.625 ± 0.761
3.79GluSer: 3.79 ± 0.738
3.955GluThr: 3.955 ± 0.827
6.097GluVal: 6.097 ± 0.927
0.659GluTrp: 0.659 ± 0.306
1.812GluTyr: 1.812 ± 0.214
0.0GluXaa: 0.0 ± 0.0
Phe
1.483PheAla: 1.483 ± 0.805
1.648PheCys: 1.648 ± 0.443
1.812PheAsp: 1.812 ± 0.923
2.801PheGlu: 2.801 ± 0.211
2.636PhePhe: 2.636 ± 0.243
1.812PheGly: 1.812 ± 0.214
0.0PheHis: 0.0 ± 0.0
2.636PheIle: 2.636 ± 0.243
3.131PheLys: 3.131 ± 0.266
5.437PheLeu: 5.437 ± 0.817
0.989PheMet: 0.989 ± 0.299
2.307PheAsn: 2.307 ± 0.661
1.318PhePro: 1.318 ± 0.472
0.659PheGln: 0.659 ± 0.306
0.989PheArg: 0.989 ± 0.299
4.449PheSer: 4.449 ± 0.383
3.295PheThr: 3.295 ± 0.614
0.824PheVal: 0.824 ± 0.257
0.165PheTrp: 0.165 ± 0.09
1.483PheTyr: 1.483 ± 0.808
0.0PheXaa: 0.0 ± 0.0
Gly
1.812GlyAla: 1.812 ± 0.482
1.977GlyCys: 1.977 ± 1.038
2.966GlyAsp: 2.966 ± 1.062
3.131GlyGlu: 3.131 ± 0.237
0.824GlyPhe: 0.824 ± 0.257
2.472GlyGly: 2.472 ± 0.382
0.989GlyHis: 0.989 ± 0.536
3.79GlyIle: 3.79 ± 0.622
5.932GlyLys: 5.932 ± 1.783
6.261GlyLeu: 6.261 ± 1.364
1.483GlyMet: 1.483 ± 0.317
3.295GlyAsn: 3.295 ± 0.889
2.142GlyPro: 2.142 ± 0.185
1.318GlyGln: 1.318 ± 0.283
2.801GlyArg: 2.801 ± 0.882
3.79GlySer: 3.79 ± 0.094
2.966GlyThr: 2.966 ± 0.761
2.801GlyVal: 2.801 ± 1.698
0.494GlyTrp: 0.494 ± 0.269
1.153GlyTyr: 1.153 ± 0.226
0.0GlyXaa: 0.0 ± 0.0
His
0.659HisAla: 0.659 ± 0.359
1.318HisCys: 1.318 ± 0.32
0.824HisAsp: 0.824 ± 0.221
0.33HisGlu: 0.33 ± 0.18
0.824HisPhe: 0.824 ± 0.449
0.659HisGly: 0.659 ± 0.263
0.33HisHis: 0.33 ± 0.131
0.989HisIle: 0.989 ± 0.299
0.659HisLys: 0.659 ± 0.263
2.801HisLeu: 2.801 ± 0.276
0.659HisMet: 0.659 ± 0.146
0.494HisAsn: 0.494 ± 0.106
0.824HisPro: 0.824 ± 0.692
0.824HisGln: 0.824 ± 0.388
1.318HisArg: 1.318 ± 0.32
1.812HisSer: 1.812 ± 0.214
1.318HisThr: 1.318 ± 0.526
1.483HisVal: 1.483 ± 0.267
0.494HisTrp: 0.494 ± 0.106
0.494HisTyr: 0.494 ± 0.325
0.0HisXaa: 0.0 ± 0.0
Ile
3.46IleAla: 3.46 ± 0.456
0.989IleCys: 0.989 ± 0.649
2.966IleAsp: 2.966 ± 0.793
3.46IleGlu: 3.46 ± 0.714
2.966IlePhe: 2.966 ± 0.296
2.636IleGly: 2.636 ± 0.825
1.153IleHis: 1.153 ± 0.238
2.636IleIle: 2.636 ± 0.243
5.602IleLys: 5.602 ± 0.717
4.778IleLeu: 4.778 ± 0.419
1.648IleMet: 1.648 ± 0.606
3.131IleAsn: 3.131 ± 0.686
1.812IlePro: 1.812 ± 0.909
1.812IleGln: 1.812 ± 0.489
4.119IleArg: 4.119 ± 0.385
6.261IleSer: 6.261 ± 0.995
3.295IleThr: 3.295 ± 0.736
3.79IleVal: 3.79 ± 0.364
0.659IleTrp: 0.659 ± 0.146
1.318IleTyr: 1.318 ± 0.283
0.0IleXaa: 0.0 ± 0.0
Lys
2.801LysAla: 2.801 ± 0.88
1.812LysCys: 1.812 ± 0.355
5.767LysAsp: 5.767 ± 0.822
6.92LysGlu: 6.92 ± 0.553
2.966LysPhe: 2.966 ± 0.19
3.625LysGly: 3.625 ± 1.385
1.812LysHis: 1.812 ± 0.214
3.295LysIle: 3.295 ± 0.278
8.074LysLys: 8.074 ± 1.542
11.04LysLeu: 11.04 ± 1.062
1.318LysMet: 1.318 ± 0.587
3.295LysAsn: 3.295 ± 0.177
2.801LysPro: 2.801 ± 0.393
2.966LysGln: 2.966 ± 0.618
4.778LysArg: 4.778 ± 0.609
3.295LysSer: 3.295 ± 1.077
3.955LysThr: 3.955 ± 0.813
5.273LysVal: 5.273 ± 0.609
0.824LysTrp: 0.824 ± 0.701
1.483LysTyr: 1.483 ± 0.342
0.0LysXaa: 0.0 ± 0.0
Leu
4.614LeuAla: 4.614 ± 1.688
2.307LeuCys: 2.307 ± 0.112
5.932LeuAsp: 5.932 ± 0.828
8.733LeuGlu: 8.733 ± 1.303
4.284LeuPhe: 4.284 ± 0.37
5.108LeuGly: 5.108 ± 0.548
3.131LeuHis: 3.131 ± 0.22
7.415LeuIle: 7.415 ± 1.216
8.733LeuLys: 8.733 ± 1.059
13.182LeuLeu: 13.182 ± 1.446
2.142LeuMet: 2.142 ± 0.441
6.097LeuAsn: 6.097 ± 1.711
3.295LeuPro: 3.295 ± 0.871
4.119LeuGln: 4.119 ± 0.916
5.437LeuArg: 5.437 ± 0.643
10.216LeuSer: 10.216 ± 1.542
8.074LeuThr: 8.074 ± 0.689
6.097LeuVal: 6.097 ± 0.441
0.989LeuTrp: 0.989 ± 0.233
3.131LeuTyr: 3.131 ± 0.491
0.0LeuXaa: 0.0 ± 0.0
Met
0.824MetAla: 0.824 ± 0.257
0.33MetCys: 0.33 ± 0.131
1.483MetAsp: 1.483 ± 0.531
1.153MetGlu: 1.153 ± 0.226
0.824MetPhe: 0.824 ± 0.449
1.318MetGly: 1.318 ± 0.686
0.659MetHis: 0.659 ± 0.306
1.318MetIle: 1.318 ± 0.587
1.977MetLys: 1.977 ± 0.57
3.625MetLeu: 3.625 ± 0.902
0.659MetMet: 0.659 ± 0.359
0.659MetAsn: 0.659 ± 0.146
0.165MetPro: 0.165 ± 0.09
1.153MetGln: 1.153 ± 0.629
0.33MetArg: 0.33 ± 0.18
2.307MetSer: 2.307 ± 0.661
0.824MetThr: 0.824 ± 0.388
0.989MetVal: 0.989 ± 0.305
0.165MetTrp: 0.165 ± 0.199
0.33MetTyr: 0.33 ± 0.18
0.0MetXaa: 0.0 ± 0.0
Asn
1.977AsnAla: 1.977 ± 1.927
1.483AsnCys: 1.483 ± 0.714
2.307AsnAsp: 2.307 ± 1.005
2.142AsnGlu: 2.142 ± 0.096
2.307AsnPhe: 2.307 ± 0.46
1.318AsnGly: 1.318 ± 0.62
0.824AsnHis: 0.824 ± 0.221
3.625AsnIle: 3.625 ± 0.173
3.46AsnLys: 3.46 ± 1.122
4.943AsnLeu: 4.943 ± 0.818
0.494AsnMet: 0.494 ± 0.269
1.648AsnAsn: 1.648 ± 0.413
1.812AsnPro: 1.812 ± 0.482
1.318AsnGln: 1.318 ± 0.611
3.131AsnArg: 3.131 ± 0.9
5.602AsnSer: 5.602 ± 0.733
2.307AsnThr: 2.307 ± 0.576
4.778AsnVal: 4.778 ± 0.176
0.824AsnTrp: 0.824 ± 0.449
0.824AsnTyr: 0.824 ± 0.449
0.0AsnXaa: 0.0 ± 0.0
Pro
1.318ProAla: 1.318 ± 0.283
0.659ProCys: 0.659 ± 0.474
1.648ProAsp: 1.648 ± 0.443
3.295ProGlu: 3.295 ± 1.144
1.977ProPhe: 1.977 ± 0.826
2.142ProGly: 2.142 ± 0.65
0.33ProHis: 0.33 ± 0.131
1.977ProIle: 1.977 ± 0.467
2.307ProLys: 2.307 ± 1.168
2.307ProLeu: 2.307 ± 0.697
0.33ProMet: 0.33 ± 0.18
0.989ProAsn: 0.989 ± 0.211
0.659ProPro: 0.659 ± 0.343
0.989ProGln: 0.989 ± 0.211
1.483ProArg: 1.483 ± 0.56
2.801ProSer: 2.801 ± 0.815
2.966ProThr: 2.966 ± 1.178
1.483ProVal: 1.483 ± 0.67
0.494ProTrp: 0.494 ± 0.37
1.318ProTyr: 1.318 ± 0.32
0.0ProXaa: 0.0 ± 0.0
Gln
2.307GlnAla: 2.307 ± 0.914
1.153GlnCys: 1.153 ± 0.348
1.977GlnAsp: 1.977 ± 0.599
2.307GlnGlu: 2.307 ± 0.576
1.483GlnPhe: 1.483 ± 0.572
1.483GlnGly: 1.483 ± 0.128
1.318GlnHis: 1.318 ± 0.526
1.483GlnIle: 1.483 ± 0.557
2.472GlnLys: 2.472 ± 0.806
4.284GlnLeu: 4.284 ± 0.718
1.153GlnMet: 1.153 ± 0.629
1.977GlnAsn: 1.977 ± 0.826
0.659GlnPro: 0.659 ± 0.359
1.977GlnGln: 1.977 ± 0.023
0.989GlnArg: 0.989 ± 0.211
3.131GlnSer: 3.131 ± 0.237
2.307GlnThr: 2.307 ± 0.112
1.483GlnVal: 1.483 ± 0.342
0.0GlnTrp: 0.0 ± 0.0
0.659GlnTyr: 0.659 ± 0.263
0.0GlnXaa: 0.0 ± 0.0
Arg
1.318ArgAla: 1.318 ± 0.16
1.648ArgCys: 1.648 ± 0.178
2.307ArgAsp: 2.307 ± 0.77
3.79ArgGlu: 3.79 ± 0.9
3.131ArgPhe: 3.131 ± 0.326
1.153ArgGly: 1.153 ± 0.673
1.483ArgHis: 1.483 ± 0.359
2.307ArgIle: 2.307 ± 0.112
3.46ArgLys: 3.46 ± 0.957
7.085ArgLeu: 7.085 ± 1.09
1.648ArgMet: 1.648 ± 0.178
2.142ArgAsn: 2.142 ± 0.653
1.483ArgPro: 1.483 ± 0.477
2.801ArgGln: 2.801 ± 0.453
2.801ArgArg: 2.801 ± 0.276
4.614ArgSer: 4.614 ± 0.642
3.625ArgThr: 3.625 ± 0.862
3.79ArgVal: 3.79 ± 0.738
0.33ArgTrp: 0.33 ± 0.358
1.153ArgTyr: 1.153 ± 0.629
0.0ArgXaa: 0.0 ± 0.0
Ser
3.79SerAla: 3.79 ± 0.927
1.812SerCys: 1.812 ± 0.845
5.602SerAsp: 5.602 ± 1.723
7.25SerGlu: 7.25 ± 1.804
3.295SerPhe: 3.295 ± 0.278
6.426SerGly: 6.426 ± 0.833
1.318SerHis: 1.318 ± 0.291
4.778SerIle: 4.778 ± 0.609
6.756SerLys: 6.756 ± 0.738
7.744SerLeu: 7.744 ± 1.085
1.483SerMet: 1.483 ± 0.808
3.79SerAsn: 3.79 ± 0.698
2.142SerPro: 2.142 ± 0.464
3.131SerGln: 3.131 ± 0.383
4.284SerArg: 4.284 ± 0.504
9.722SerSer: 9.722 ± 2.715
6.426SerThr: 6.426 ± 0.8
5.273SerVal: 5.273 ± 1.063
1.648SerTrp: 1.648 ± 0.777
1.977SerTyr: 1.977 ± 0.57
0.0SerXaa: 0.0 ± 0.0
Thr
3.46ThrAla: 3.46 ± 0.696
1.812ThrCys: 1.812 ± 1.189
3.625ThrAsp: 3.625 ± 0.648
6.097ThrGlu: 6.097 ± 0.838
1.483ThrPhe: 1.483 ± 0.611
4.614ThrGly: 4.614 ± 1.28
1.153ThrHis: 1.153 ± 0.226
3.625ThrIle: 3.625 ± 1.155
3.625ThrLys: 3.625 ± 0.648
4.778ThrLeu: 4.778 ± 0.81
0.659ThrMet: 0.659 ± 0.306
2.801ThrAsn: 2.801 ± 0.391
2.801ThrPro: 2.801 ± 0.822
2.142ThrGln: 2.142 ± 1.021
2.801ThrArg: 2.801 ± 0.393
5.767ThrSer: 5.767 ± 1.068
6.097ThrThr: 6.097 ± 2.551
4.449ThrVal: 4.449 ± 0.914
0.659ThrTrp: 0.659 ± 0.522
1.153ThrTyr: 1.153 ± 0.238
0.0ThrXaa: 0.0 ± 0.0
Val
2.307ValAla: 2.307 ± 0.897
1.483ValCys: 1.483 ± 0.714
3.625ValAsp: 3.625 ± 0.419
4.284ValGlu: 4.284 ± 0.399
2.307ValPhe: 2.307 ± 0.661
3.625ValGly: 3.625 ± 0.435
0.494ValHis: 0.494 ± 0.106
2.472ValIle: 2.472 ± 0.308
6.591ValLys: 6.591 ± 0.205
6.92ValLeu: 6.92 ± 0.716
0.989ValMet: 0.989 ± 0.211
3.625ValAsn: 3.625 ± 0.319
2.966ValPro: 2.966 ± 0.255
2.801ValGln: 2.801 ± 1.127
3.46ValArg: 3.46 ± 0.35
6.426ValSer: 6.426 ± 0.565
4.119ValThr: 4.119 ± 0.949
5.437ValVal: 5.437 ± 1.855
0.33ValTrp: 0.33 ± 0.131
1.318ValTyr: 1.318 ± 0.32
0.0ValXaa: 0.0 ± 0.0
Trp
0.165TrpAla: 0.165 ± 0.397
0.165TrpCys: 0.165 ± 0.09
0.33TrpAsp: 0.33 ± 0.358
0.824TrpGlu: 0.824 ± 0.221
0.659TrpPhe: 0.659 ± 0.716
1.318TrpGly: 1.318 ± 1.007
0.0TrpHis: 0.0 ± 0.0
0.33TrpIle: 0.33 ± 0.131
1.483TrpLys: 1.483 ± 0.359
1.318TrpLeu: 1.318 ± 0.291
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.494TrpPro: 0.494 ± 0.596
0.0TrpGln: 0.0 ± 0.0
0.824TrpArg: 0.824 ± 0.257
1.812TrpSer: 1.812 ± 0.355
0.494TrpThr: 0.494 ± 0.37
0.494TrpVal: 0.494 ± 0.339
0.165TrpTrp: 0.165 ± 0.09
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.483TyrAla: 1.483 ± 0.808
1.153TyrCys: 1.153 ± 0.847
1.318TyrAsp: 1.318 ± 0.719
1.153TyrGlu: 1.153 ± 0.676
1.153TyrPhe: 1.153 ± 0.348
1.483TyrGly: 1.483 ± 0.56
0.659TyrHis: 0.659 ± 0.146
2.801TyrIle: 2.801 ± 0.575
1.648TyrLys: 1.648 ± 0.572
3.131TyrLeu: 3.131 ± 0.65
0.824TyrMet: 0.824 ± 0.692
1.648TyrAsn: 1.648 ± 0.338
0.33TyrPro: 0.33 ± 0.18
0.989TyrGln: 0.989 ± 0.233
1.153TyrArg: 1.153 ± 0.243
1.812TyrSer: 1.812 ± 0.737
0.989TyrThr: 0.989 ± 0.394
0.494TyrVal: 0.494 ± 0.269
0.33TyrTrp: 0.33 ± 0.358
1.153TyrTyr: 1.153 ± 0.243
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (6070 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski