Amino acid dipepetide frequency for Scophthalmus maximus rhabdovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.466AlaAla: 5.466 ± 2.249
1.562AlaCys: 1.562 ± 0.348
4.425AlaAsp: 4.425 ± 0.798
2.863AlaGlu: 2.863 ± 0.426
1.822AlaPhe: 1.822 ± 0.925
3.384AlaGly: 3.384 ± 0.945
1.562AlaHis: 1.562 ± 0.893
3.123AlaIle: 3.123 ± 0.78
2.603AlaLys: 2.603 ± 0.541
5.986AlaLeu: 5.986 ± 0.846
2.082AlaMet: 2.082 ± 0.723
2.603AlaAsn: 2.603 ± 1.081
4.164AlaPro: 4.164 ± 1.13
2.343AlaGln: 2.343 ± 0.401
2.863AlaArg: 2.863 ± 1.274
5.206AlaSer: 5.206 ± 0.663
5.206AlaThr: 5.206 ± 1.171
4.164AlaVal: 4.164 ± 1.483
1.301AlaTrp: 1.301 ± 0.554
2.863AlaTyr: 2.863 ± 0.766
0.0AlaXaa: 0.0 ± 0.0
Cys
1.562CysAla: 1.562 ± 0.858
1.041CysCys: 1.041 ± 0.7
0.0CysAsp: 0.0 ± 0.0
0.781CysGlu: 0.781 ± 0.609
0.26CysPhe: 0.26 ± 0.14
0.26CysGly: 0.26 ± 0.343
0.26CysHis: 0.26 ± 0.343
1.562CysIle: 1.562 ± 0.348
1.562CysLys: 1.562 ± 0.929
2.343CysLeu: 2.343 ± 0.634
0.0CysMet: 0.0 ± 0.0
0.521CysAsn: 0.521 ± 0.26
1.822CysPro: 1.822 ± 0.976
0.26CysGln: 0.26 ± 0.14
0.0CysArg: 0.0 ± 0.0
1.041CysSer: 1.041 ± 0.557
1.041CysThr: 1.041 ± 0.561
1.041CysVal: 1.041 ± 0.561
1.041CysTrp: 1.041 ± 0.366
0.781CysTyr: 0.781 ± 0.352
0.0CysXaa: 0.0 ± 0.0
Asp
2.863AspAla: 2.863 ± 1.615
0.781AspCys: 0.781 ± 0.381
2.343AspAsp: 2.343 ± 0.674
4.685AspGlu: 4.685 ± 0.474
1.301AspPhe: 1.301 ± 0.36
2.603AspGly: 2.603 ± 0.472
1.562AspHis: 1.562 ± 0.348
1.822AspIle: 1.822 ± 0.518
3.384AspLys: 3.384 ± 0.423
5.466AspLeu: 5.466 ± 1.058
2.082AspMet: 2.082 ± 0.673
3.384AspAsn: 3.384 ± 0.723
2.863AspPro: 2.863 ± 0.546
2.343AspGln: 2.343 ± 1.056
2.863AspArg: 2.863 ± 0.698
5.206AspSer: 5.206 ± 1.262
2.082AspThr: 2.082 ± 0.409
3.384AspVal: 3.384 ± 0.695
1.041AspTrp: 1.041 ± 0.474
2.603AspTyr: 2.603 ± 0.776
0.0AspXaa: 0.0 ± 0.0
Glu
2.863GluAla: 2.863 ± 0.874
0.521GluCys: 0.521 ± 0.686
2.603GluAsp: 2.603 ± 0.986
5.206GluGlu: 5.206 ± 1.5
2.343GluPhe: 2.343 ± 0.758
4.164GluGly: 4.164 ± 0.754
1.301GluHis: 1.301 ± 0.371
3.123GluIle: 3.123 ± 0.906
2.863GluLys: 2.863 ± 0.87
3.644GluLeu: 3.644 ± 0.793
2.082GluMet: 2.082 ± 0.881
2.082GluAsn: 2.082 ± 0.566
2.863GluPro: 2.863 ± 0.599
2.863GluGln: 2.863 ± 1.323
4.425GluArg: 4.425 ± 1.152
5.466GluSer: 5.466 ± 1.023
3.384GluThr: 3.384 ± 0.763
5.986GluVal: 5.986 ± 1.093
1.041GluTrp: 1.041 ± 0.343
3.123GluTyr: 3.123 ± 0.612
0.0GluXaa: 0.0 ± 0.0
Phe
2.603PheAla: 2.603 ± 0.527
0.521PheCys: 0.521 ± 0.686
1.562PheAsp: 1.562 ± 0.38
1.822PheGlu: 1.822 ± 0.563
1.822PhePhe: 1.822 ± 0.448
1.562PheGly: 1.562 ± 0.689
0.521PheHis: 0.521 ± 0.281
0.781PheIle: 0.781 ± 0.421
3.384PheLys: 3.384 ± 1.046
3.384PheLeu: 3.384 ± 0.522
2.343PheMet: 2.343 ± 0.71
0.781PheAsn: 0.781 ± 0.421
1.822PhePro: 1.822 ± 0.213
1.041PheGln: 1.041 ± 0.34
2.343PheArg: 2.343 ± 0.758
2.343PheSer: 2.343 ± 0.729
1.301PheThr: 1.301 ± 0.44
2.863PheVal: 2.863 ± 1.076
1.041PheTrp: 1.041 ± 0.432
0.521PheTyr: 0.521 ± 0.26
0.0PheXaa: 0.0 ± 0.0
Gly
2.863GlyAla: 2.863 ± 0.988
0.521GlyCys: 0.521 ± 0.281
3.904GlyAsp: 3.904 ± 0.714
1.822GlyGlu: 1.822 ± 0.635
1.822GlyPhe: 1.822 ± 0.561
5.206GlyGly: 5.206 ± 0.54
1.041GlyHis: 1.041 ± 0.34
3.904GlyIle: 3.904 ± 0.71
1.822GlyLys: 1.822 ± 0.807
6.507GlyLeu: 6.507 ± 1.329
1.301GlyMet: 1.301 ± 0.44
2.082GlyAsn: 2.082 ± 0.772
2.863GlyPro: 2.863 ± 0.489
2.082GlyGln: 2.082 ± 0.565
3.904GlyArg: 3.904 ± 0.866
5.726GlySer: 5.726 ± 1.097
5.466GlyThr: 5.466 ± 0.803
2.863GlyVal: 2.863 ± 1.224
2.863GlyTrp: 2.863 ± 0.458
2.082GlyTyr: 2.082 ± 0.498
0.0GlyXaa: 0.0 ± 0.0
His
0.781HisAla: 0.781 ± 0.609
0.26HisCys: 0.26 ± 0.293
1.041HisAsp: 1.041 ± 0.366
0.521HisGlu: 0.521 ± 0.279
2.082HisPhe: 2.082 ± 0.276
1.562HisGly: 1.562 ± 0.472
0.521HisHis: 0.521 ± 0.443
2.082HisIle: 2.082 ± 0.732
1.562HisLys: 1.562 ± 0.558
2.343HisLeu: 2.343 ± 0.492
1.562HisMet: 1.562 ± 0.615
0.781HisAsn: 0.781 ± 0.421
2.082HisPro: 2.082 ± 0.756
1.562HisGln: 1.562 ± 0.705
0.521HisArg: 0.521 ± 0.281
1.301HisSer: 1.301 ± 0.5
1.041HisThr: 1.041 ± 0.34
1.562HisVal: 1.562 ± 0.38
0.521HisTrp: 0.521 ± 0.281
0.521HisTyr: 0.521 ± 0.279
0.0HisXaa: 0.0 ± 0.0
Ile
3.384IleAla: 3.384 ± 0.253
0.521IleCys: 0.521 ± 0.279
2.863IleAsp: 2.863 ± 0.755
2.603IleGlu: 2.603 ± 0.626
2.082IlePhe: 2.082 ± 0.68
3.644IleGly: 3.644 ± 0.691
1.041IleHis: 1.041 ± 0.557
2.343IleIle: 2.343 ± 0.359
4.425IleLys: 4.425 ± 0.918
4.945IleLeu: 4.945 ± 0.532
1.301IleMet: 1.301 ± 0.701
1.822IleAsn: 1.822 ± 0.404
3.904IlePro: 3.904 ± 0.345
1.301IleGln: 1.301 ± 0.36
5.206IleArg: 5.206 ± 0.985
4.164IleSer: 4.164 ± 1.544
4.425IleThr: 4.425 ± 0.837
2.343IleVal: 2.343 ± 0.585
1.301IleTrp: 1.301 ± 0.499
1.562IleTyr: 1.562 ± 0.627
0.0IleXaa: 0.0 ± 0.0
Lys
1.822LysAla: 1.822 ± 0.716
1.562LysCys: 1.562 ± 0.624
3.904LysAsp: 3.904 ± 0.617
4.164LysGlu: 4.164 ± 1.102
2.082LysPhe: 2.082 ± 0.78
4.425LysGly: 4.425 ± 0.866
1.562LysHis: 1.562 ± 0.348
3.123LysIle: 3.123 ± 0.838
4.685LysLys: 4.685 ± 0.511
6.247LysLeu: 6.247 ± 1.541
1.562LysMet: 1.562 ± 0.593
0.781LysAsn: 0.781 ± 0.297
2.082LysPro: 2.082 ± 0.716
2.343LysGln: 2.343 ± 0.492
3.904LysArg: 3.904 ± 0.808
3.644LysSer: 3.644 ± 1.288
2.863LysThr: 2.863 ± 0.864
2.603LysVal: 2.603 ± 1.356
1.822LysTrp: 1.822 ± 0.469
1.562LysTyr: 1.562 ± 0.38
0.0LysXaa: 0.0 ± 0.0
Leu
8.069LeuAla: 8.069 ± 1.542
0.781LeuCys: 0.781 ± 0.421
5.206LeuAsp: 5.206 ± 0.728
6.247LeuGlu: 6.247 ± 1.061
2.863LeuPhe: 2.863 ± 0.637
7.288LeuGly: 7.288 ± 1.013
2.082LeuHis: 2.082 ± 0.634
7.808LeuIle: 7.808 ± 2.36
4.945LeuLys: 4.945 ± 1.233
6.507LeuLeu: 6.507 ± 1.641
3.904LeuMet: 3.904 ± 0.442
5.986LeuAsn: 5.986 ± 1.033
3.904LeuPro: 3.904 ± 1.155
2.603LeuGln: 2.603 ± 0.454
5.726LeuArg: 5.726 ± 0.823
8.329LeuSer: 8.329 ± 2.589
6.507LeuThr: 6.507 ± 1.124
4.685LeuVal: 4.685 ± 1.197
0.781LeuTrp: 0.781 ± 0.297
1.562LeuTyr: 1.562 ± 0.63
0.0LeuXaa: 0.0 ± 0.0
Met
4.164MetAla: 4.164 ± 1.048
0.781MetCys: 0.781 ± 0.381
1.301MetAsp: 1.301 ± 0.831
2.343MetGlu: 2.343 ± 0.534
1.822MetPhe: 1.822 ± 0.213
2.082MetGly: 2.082 ± 0.498
0.26MetHis: 0.26 ± 0.32
1.562MetIle: 1.562 ± 0.558
2.343MetLys: 2.343 ± 1.183
2.343MetLeu: 2.343 ± 0.877
1.041MetMet: 1.041 ± 0.283
0.781MetAsn: 0.781 ± 0.447
0.781MetPro: 0.781 ± 0.747
0.781MetGln: 0.781 ± 0.34
2.082MetArg: 2.082 ± 0.602
1.822MetSer: 1.822 ± 0.645
1.822MetThr: 1.822 ± 0.807
2.863MetVal: 2.863 ± 0.855
0.26MetTrp: 0.26 ± 0.343
1.041MetTyr: 1.041 ± 0.7
0.0MetXaa: 0.0 ± 0.0
Asn
3.123AsnAla: 3.123 ± 0.86
0.781AsnCys: 0.781 ± 0.609
1.301AsnAsp: 1.301 ± 0.701
1.562AsnGlu: 1.562 ± 0.307
1.041AsnPhe: 1.041 ± 0.34
2.603AsnGly: 2.603 ± 1.374
1.301AsnHis: 1.301 ± 0.44
2.343AsnIle: 2.343 ± 0.509
1.562AsnLys: 1.562 ± 0.432
6.507AsnLeu: 6.507 ± 1.286
0.781AsnMet: 0.781 ± 0.426
1.562AsnAsn: 1.562 ± 0.555
2.863AsnPro: 2.863 ± 0.945
1.301AsnGln: 1.301 ± 0.472
2.863AsnArg: 2.863 ± 1.654
1.562AsnSer: 1.562 ± 0.473
2.343AsnThr: 2.343 ± 0.55
1.301AsnVal: 1.301 ± 0.371
1.301AsnTrp: 1.301 ± 0.472
2.603AsnTyr: 2.603 ± 0.685
0.0AsnXaa: 0.0 ± 0.0
Pro
3.384ProAla: 3.384 ± 0.522
0.26ProCys: 0.26 ± 0.372
4.164ProAsp: 4.164 ± 0.545
3.384ProGlu: 3.384 ± 1.381
0.521ProPhe: 0.521 ± 0.328
2.082ProGly: 2.082 ± 0.822
1.041ProHis: 1.041 ± 0.34
3.644ProIle: 3.644 ± 0.945
1.562ProLys: 1.562 ± 0.595
6.247ProLeu: 6.247 ± 0.98
0.521ProMet: 0.521 ± 0.281
1.562ProAsn: 1.562 ± 0.292
2.603ProPro: 2.603 ± 1.319
0.781ProGln: 0.781 ± 0.6
2.863ProArg: 2.863 ± 0.246
6.507ProSer: 6.507 ± 0.919
2.343ProThr: 2.343 ± 0.987
2.863ProVal: 2.863 ± 0.89
1.301ProTrp: 1.301 ± 0.538
2.082ProTyr: 2.082 ± 0.644
0.0ProXaa: 0.0 ± 0.0
Gln
2.343GlnAla: 2.343 ± 0.715
1.041GlnCys: 1.041 ± 0.561
1.822GlnAsp: 1.822 ± 0.871
1.301GlnGlu: 1.301 ± 0.543
2.082GlnPhe: 2.082 ± 0.458
1.822GlnGly: 1.822 ± 0.448
0.521GlnHis: 0.521 ± 0.279
1.562GlnIle: 1.562 ± 0.498
0.781GlnLys: 0.781 ± 0.297
2.863GlnLeu: 2.863 ± 0.815
1.822GlnMet: 1.822 ± 1.119
2.343GlnAsn: 2.343 ± 0.55
1.301GlnPro: 1.301 ± 0.44
0.521GlnGln: 0.521 ± 0.271
2.863GlnArg: 2.863 ± 0.367
3.123GlnSer: 3.123 ± 0.85
2.343GlnThr: 2.343 ± 0.601
3.123GlnVal: 3.123 ± 0.609
0.781GlnTrp: 0.781 ± 0.421
0.521GlnTyr: 0.521 ± 0.26
0.0GlnXaa: 0.0 ± 0.0
Arg
3.644ArgAla: 3.644 ± 1.46
0.781ArgCys: 0.781 ± 0.421
3.644ArgAsp: 3.644 ± 1.009
3.644ArgGlu: 3.644 ± 1.524
3.384ArgPhe: 3.384 ± 0.725
3.644ArgGly: 3.644 ± 0.797
2.082ArgHis: 2.082 ± 0.815
3.904ArgIle: 3.904 ± 0.87
3.644ArgLys: 3.644 ± 0.854
5.466ArgLeu: 5.466 ± 0.845
1.562ArgMet: 1.562 ± 0.83
2.603ArgAsn: 2.603 ± 0.509
2.863ArgPro: 2.863 ± 0.246
2.343ArgGln: 2.343 ± 0.67
3.904ArgArg: 3.904 ± 1.824
4.425ArgSer: 4.425 ± 0.551
4.945ArgThr: 4.945 ± 1.534
1.822ArgVal: 1.822 ± 0.524
1.301ArgTrp: 1.301 ± 0.548
1.301ArgTyr: 1.301 ± 0.692
0.0ArgXaa: 0.0 ± 0.0
Ser
7.028SerAla: 7.028 ± 1.041
1.822SerCys: 1.822 ± 0.448
3.644SerAsp: 3.644 ± 0.532
7.548SerGlu: 7.548 ± 0.612
2.082SerPhe: 2.082 ± 0.547
4.945SerGly: 4.945 ± 0.576
2.343SerHis: 2.343 ± 0.647
3.644SerIle: 3.644 ± 0.846
4.425SerLys: 4.425 ± 0.701
7.808SerLeu: 7.808 ± 1.422
1.822SerMet: 1.822 ± 0.605
2.863SerAsn: 2.863 ± 0.402
3.384SerPro: 3.384 ± 0.843
3.644SerGln: 3.644 ± 1.231
5.986SerArg: 5.986 ± 0.698
7.548SerSer: 7.548 ± 1.506
2.863SerThr: 2.863 ± 0.591
3.904SerVal: 3.904 ± 0.419
2.082SerTrp: 2.082 ± 0.851
2.343SerTyr: 2.343 ± 0.788
0.0SerXaa: 0.0 ± 0.0
Thr
3.384ThrAla: 3.384 ± 1.566
2.082ThrCys: 2.082 ± 0.357
4.425ThrAsp: 4.425 ± 0.837
3.384ThrGlu: 3.384 ± 0.577
3.123ThrPhe: 3.123 ± 0.771
3.123ThrGly: 3.123 ± 0.705
1.301ThrHis: 1.301 ± 0.548
3.644ThrIle: 3.644 ± 1.009
3.123ThrLys: 3.123 ± 0.415
4.685ThrLeu: 4.685 ± 1.102
0.781ThrMet: 0.781 ± 0.29
3.384ThrAsn: 3.384 ± 1.177
2.603ThrPro: 2.603 ± 0.527
2.343ThrGln: 2.343 ± 0.837
3.123ThrArg: 3.123 ± 0.969
4.945ThrSer: 4.945 ± 1.231
4.685ThrThr: 4.685 ± 1.454
4.945ThrVal: 4.945 ± 0.965
0.781ThrTrp: 0.781 ± 0.421
1.822ThrTyr: 1.822 ± 0.788
0.0ThrXaa: 0.0 ± 0.0
Val
3.644ValAla: 3.644 ± 1.155
1.301ValCys: 1.301 ± 0.942
4.425ValAsp: 4.425 ± 0.91
4.425ValGlu: 4.425 ± 1.631
0.521ValPhe: 0.521 ± 0.328
2.863ValGly: 2.863 ± 1.543
2.343ValHis: 2.343 ± 1.056
2.343ValIle: 2.343 ± 0.599
4.425ValLys: 4.425 ± 1.57
4.945ValLeu: 4.945 ± 1.028
3.123ValMet: 3.123 ± 0.646
2.863ValAsn: 2.863 ± 1.301
3.123ValPro: 3.123 ± 1.116
2.603ValGln: 2.603 ± 0.506
3.384ValArg: 3.384 ± 0.957
3.384ValSer: 3.384 ± 0.704
4.685ValThr: 4.685 ± 0.693
2.603ValVal: 2.603 ± 1.062
0.521ValTrp: 0.521 ± 0.279
1.301ValTyr: 1.301 ± 0.44
0.0ValXaa: 0.0 ± 0.0
Trp
0.521TrpAla: 0.521 ± 0.443
0.0TrpCys: 0.0 ± 0.0
0.781TrpAsp: 0.781 ± 0.421
1.562TrpGlu: 1.562 ± 0.623
0.26TrpPhe: 0.26 ± 0.14
1.822TrpGly: 1.822 ± 0.574
0.26TrpHis: 0.26 ± 0.14
2.082TrpIle: 2.082 ± 0.276
1.301TrpLys: 1.301 ± 0.36
2.343TrpLeu: 2.343 ± 1.415
1.562TrpMet: 1.562 ± 0.466
0.521TrpAsn: 0.521 ± 0.281
0.521TrpPro: 0.521 ± 0.281
1.041TrpGln: 1.041 ± 0.404
1.041TrpArg: 1.041 ± 0.366
2.863TrpSer: 2.863 ± 0.995
1.041TrpThr: 1.041 ± 0.514
1.822TrpVal: 1.822 ± 0.807
0.521TrpTrp: 0.521 ± 0.507
0.26TrpTyr: 0.26 ± 0.343
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.082TyrAla: 2.082 ± 0.699
0.26TyrCys: 0.26 ± 0.14
1.562TyrAsp: 1.562 ± 0.437
2.082TyrGlu: 2.082 ± 1.85
1.041TyrPhe: 1.041 ± 0.432
1.562TyrGly: 1.562 ± 0.558
1.301TyrHis: 1.301 ± 0.456
0.781TyrIle: 0.781 ± 0.344
2.343TyrLys: 2.343 ± 0.211
5.206TyrLeu: 5.206 ± 1.214
0.781TyrMet: 0.781 ± 0.277
1.301TyrAsn: 1.301 ± 0.615
1.301TyrPro: 1.301 ± 0.319
0.781TyrGln: 0.781 ± 0.646
1.041TyrArg: 1.041 ± 0.64
3.123TyrSer: 3.123 ± 0.54
1.301TyrThr: 1.301 ± 0.472
2.082TyrVal: 2.082 ± 0.552
0.521TyrTrp: 0.521 ± 0.507
0.521TyrTyr: 0.521 ± 0.281
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3843 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski