Amino acid dipepetide frequency for Drosophila ananassae sigmavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.487AlaAla: 1.487 ± 0.876
0.248AlaCys: 0.248 ± 0.305
2.727AlaAsp: 2.727 ± 0.946
1.983AlaGlu: 1.983 ± 0.387
0.496AlaPhe: 0.496 ± 0.304
1.487AlaGly: 1.487 ± 0.614
0.992AlaHis: 0.992 ± 0.594
1.983AlaIle: 1.983 ± 1.071
2.479AlaLys: 2.479 ± 0.584
4.71AlaLeu: 4.71 ± 1.709
1.983AlaMet: 1.983 ± 0.517
2.727AlaAsn: 2.727 ± 0.884
1.487AlaPro: 1.487 ± 0.656
1.735AlaGln: 1.735 ± 0.556
2.231AlaArg: 2.231 ± 0.503
1.735AlaSer: 1.735 ± 0.573
2.727AlaThr: 2.727 ± 1.109
1.487AlaVal: 1.487 ± 0.545
1.487AlaTrp: 1.487 ± 1.175
0.992AlaTyr: 0.992 ± 0.36
0.0AlaXaa: 0.0 ± 0.0
Cys
0.992CysAla: 0.992 ± 0.424
0.248CysCys: 0.248 ± 0.152
0.992CysAsp: 0.992 ± 0.543
1.239CysGlu: 1.239 ± 0.845
0.496CysPhe: 0.496 ± 0.271
0.496CysGly: 0.496 ± 0.271
0.248CysHis: 0.248 ± 0.305
1.983CysIle: 1.983 ± 0.971
1.239CysLys: 1.239 ± 0.331
1.735CysLeu: 1.735 ± 0.777
0.248CysMet: 0.248 ± 0.152
1.487CysAsn: 1.487 ± 0.635
0.744CysPro: 0.744 ± 0.35
0.496CysGln: 0.496 ± 0.498
0.496CysArg: 0.496 ± 0.485
0.992CysSer: 0.992 ± 0.543
0.496CysThr: 0.496 ± 0.572
0.992CysVal: 0.992 ± 0.447
0.248CysTrp: 0.248 ± 0.152
0.248CysTyr: 0.248 ± 0.152
0.0CysXaa: 0.0 ± 0.0
Asp
0.496AspAla: 0.496 ± 0.278
1.239AspCys: 1.239 ± 0.69
2.975AspAsp: 2.975 ± 0.645
3.718AspGlu: 3.718 ± 0.954
2.479AspPhe: 2.479 ± 0.578
3.223AspGly: 3.223 ± 0.854
1.735AspHis: 1.735 ± 0.563
3.966AspIle: 3.966 ± 0.905
4.462AspLys: 4.462 ± 0.891
5.949AspLeu: 5.949 ± 0.915
1.239AspMet: 1.239 ± 0.548
3.223AspAsn: 3.223 ± 1.121
4.462AspPro: 4.462 ± 1.157
1.487AspGln: 1.487 ± 0.45
3.223AspArg: 3.223 ± 0.869
6.197AspSer: 6.197 ± 1.143
1.487AspThr: 1.487 ± 0.684
1.983AspVal: 1.983 ± 0.462
1.487AspTrp: 1.487 ± 0.699
2.727AspTyr: 2.727 ± 0.747
0.0AspXaa: 0.0 ± 0.0
Glu
3.223GluAla: 3.223 ± 1.201
1.239GluCys: 1.239 ± 0.621
3.966GluAsp: 3.966 ± 0.951
5.949GluGlu: 5.949 ± 2.893
2.479GluPhe: 2.479 ± 0.553
4.71GluGly: 4.71 ± 0.954
0.744GluHis: 0.744 ± 0.578
6.197GluIle: 6.197 ± 1.456
2.479GluLys: 2.479 ± 0.848
5.206GluLeu: 5.206 ± 1.342
3.223GluMet: 3.223 ± 0.655
2.231GluAsn: 2.231 ± 0.758
2.231GluPro: 2.231 ± 0.719
1.487GluGln: 1.487 ± 0.426
3.471GluArg: 3.471 ± 1.386
4.462GluSer: 4.462 ± 1.612
2.479GluThr: 2.479 ± 0.608
3.471GluVal: 3.471 ± 0.608
0.248GluTrp: 0.248 ± 0.345
1.487GluTyr: 1.487 ± 0.483
0.0GluXaa: 0.0 ± 0.0
Phe
1.487PheAla: 1.487 ± 0.454
0.0PheCys: 0.0 ± 0.0
2.231PheAsp: 2.231 ± 0.813
1.735PheGlu: 1.735 ± 0.551
1.487PhePhe: 1.487 ± 0.65
2.231PheGly: 2.231 ± 0.792
1.487PheHis: 1.487 ± 0.912
2.231PheIle: 2.231 ± 0.609
3.966PheLys: 3.966 ± 0.808
3.471PheLeu: 3.471 ± 0.949
0.744PheMet: 0.744 ± 0.491
3.223PheAsn: 3.223 ± 0.901
2.231PhePro: 2.231 ± 0.688
2.231PheGln: 2.231 ± 1.116
2.479PheArg: 2.479 ± 0.637
2.727PheSer: 2.727 ± 0.43
1.735PheThr: 1.735 ± 0.544
1.983PheVal: 1.983 ± 0.742
0.496PheTrp: 0.496 ± 0.304
0.744PheTyr: 0.744 ± 0.567
0.0PheXaa: 0.0 ± 0.0
Gly
1.239GlyAla: 1.239 ± 0.545
0.496GlyCys: 0.496 ± 0.304
3.471GlyAsp: 3.471 ± 0.882
2.975GlyGlu: 2.975 ± 1.855
1.983GlyPhe: 1.983 ± 0.833
2.479GlyGly: 2.479 ± 0.663
1.983GlyHis: 1.983 ± 1.073
2.975GlyIle: 2.975 ± 0.833
3.471GlyLys: 3.471 ± 1.534
7.933GlyLeu: 7.933 ± 1.523
1.239GlyMet: 1.239 ± 0.558
3.471GlyAsn: 3.471 ± 1.027
1.983GlyPro: 1.983 ± 0.41
1.983GlyGln: 1.983 ± 0.613
2.479GlyArg: 2.479 ± 0.458
5.454GlySer: 5.454 ± 0.627
4.958GlyThr: 4.958 ± 0.96
3.471GlyVal: 3.471 ± 0.876
0.992GlyTrp: 0.992 ± 0.417
1.735GlyTyr: 1.735 ± 0.542
0.0GlyXaa: 0.0 ± 0.0
His
0.496HisAla: 0.496 ± 0.278
0.248HisCys: 0.248 ± 0.152
0.992HisAsp: 0.992 ± 0.589
1.487HisGlu: 1.487 ± 0.699
1.735HisPhe: 1.735 ± 0.751
0.248HisGly: 0.248 ± 0.305
0.744HisHis: 0.744 ± 0.307
2.479HisIle: 2.479 ± 0.578
1.983HisLys: 1.983 ± 0.922
2.975HisLeu: 2.975 ± 0.764
0.496HisMet: 0.496 ± 0.289
0.992HisAsn: 0.992 ± 0.417
1.983HisPro: 1.983 ± 0.609
2.231HisGln: 2.231 ± 0.731
0.744HisArg: 0.744 ± 0.348
2.231HisSer: 2.231 ± 1.078
1.239HisThr: 1.239 ± 0.38
1.487HisVal: 1.487 ± 0.984
0.248HisTrp: 0.248 ± 0.152
0.496HisTyr: 0.496 ± 0.289
0.0HisXaa: 0.0 ± 0.0
Ile
3.223IleAla: 3.223 ± 1.572
1.735IleCys: 1.735 ± 0.544
4.958IleAsp: 4.958 ± 1.034
4.462IleGlu: 4.462 ± 0.474
2.479IlePhe: 2.479 ± 0.608
5.949IleGly: 5.949 ± 0.847
2.975IleHis: 2.975 ± 0.791
3.471IleIle: 3.471 ± 0.87
5.454IleLys: 5.454 ± 0.81
6.693IleLeu: 6.693 ± 0.973
2.727IleMet: 2.727 ± 0.888
3.223IleAsn: 3.223 ± 0.68
4.71IlePro: 4.71 ± 0.978
1.983IleGln: 1.983 ± 0.858
3.471IleArg: 3.471 ± 1.028
5.206IleSer: 5.206 ± 1.363
4.214IleThr: 4.214 ± 0.799
2.727IleVal: 2.727 ± 1.182
0.992IleTrp: 0.992 ± 0.765
3.471IleTyr: 3.471 ± 1.019
0.0IleXaa: 0.0 ± 0.0
Lys
1.735LysAla: 1.735 ± 0.927
1.239LysCys: 1.239 ± 0.484
3.471LysAsp: 3.471 ± 1.06
4.214LysGlu: 4.214 ± 1.304
3.223LysPhe: 3.223 ± 0.894
2.479LysGly: 2.479 ± 0.401
0.992LysHis: 0.992 ± 0.424
4.214LysIle: 4.214 ± 1.524
4.214LysLys: 4.214 ± 1.783
6.445LysLeu: 6.445 ± 0.53
2.975LysMet: 2.975 ± 0.595
1.239LysAsn: 1.239 ± 0.504
3.223LysPro: 3.223 ± 1.059
0.992LysGln: 0.992 ± 0.464
3.471LysArg: 3.471 ± 0.688
5.454LysSer: 5.454 ± 2.193
4.958LysThr: 4.958 ± 1.311
5.949LysVal: 5.949 ± 1.079
0.992LysTrp: 0.992 ± 0.608
3.966LysTyr: 3.966 ± 0.977
0.0LysXaa: 0.0 ± 0.0
Leu
5.206LeuAla: 5.206 ± 1.329
1.983LeuCys: 1.983 ± 0.833
5.454LeuAsp: 5.454 ± 1.343
3.471LeuGlu: 3.471 ± 1.31
3.966LeuPhe: 3.966 ± 1.074
6.197LeuGly: 6.197 ± 1.032
1.983LeuHis: 1.983 ± 0.638
8.428LeuIle: 8.428 ± 1.132
5.949LeuLys: 5.949 ± 0.976
7.685LeuLeu: 7.685 ± 1.911
4.958LeuMet: 4.958 ± 0.856
5.206LeuAsn: 5.206 ± 0.797
2.727LeuPro: 2.727 ± 0.932
0.992LeuGln: 0.992 ± 0.36
7.189LeuArg: 7.189 ± 1.192
6.941LeuSer: 6.941 ± 1.833
8.676LeuThr: 8.676 ± 2.297
4.462LeuVal: 4.462 ± 0.978
1.239LeuTrp: 1.239 ± 0.374
1.983LeuTyr: 1.983 ± 0.402
0.0LeuXaa: 0.0 ± 0.0
Met
0.744MetAla: 0.744 ± 0.456
0.0MetCys: 0.0 ± 0.0
2.231MetAsp: 2.231 ± 0.735
3.471MetGlu: 3.471 ± 0.681
0.992MetPhe: 0.992 ± 0.763
1.735MetGly: 1.735 ± 0.542
0.248MetHis: 0.248 ± 0.152
2.231MetIle: 2.231 ± 0.56
3.966MetLys: 3.966 ± 0.813
2.479MetLeu: 2.479 ± 0.458
1.487MetMet: 1.487 ± 0.434
1.983MetAsn: 1.983 ± 0.58
0.496MetPro: 0.496 ± 0.572
1.487MetGln: 1.487 ± 0.54
1.239MetArg: 1.239 ± 0.548
4.71MetSer: 4.71 ± 0.699
0.992MetThr: 0.992 ± 0.776
0.992MetVal: 0.992 ± 0.415
0.496MetTrp: 0.496 ± 0.271
0.992MetTyr: 0.992 ± 0.995
0.0MetXaa: 0.0 ± 0.0
Asn
2.727AsnAla: 2.727 ± 0.628
0.744AsnCys: 0.744 ± 0.456
2.727AsnAsp: 2.727 ± 0.863
2.727AsnGlu: 2.727 ± 0.942
1.239AsnPhe: 1.239 ± 0.592
2.231AsnGly: 2.231 ± 1.144
1.735AsnHis: 1.735 ± 0.926
3.471AsnIle: 3.471 ± 0.99
1.735AsnLys: 1.735 ± 0.573
9.172AsnLeu: 9.172 ± 2.804
0.992AsnMet: 0.992 ± 0.388
2.479AsnAsn: 2.479 ± 1.264
2.727AsnPro: 2.727 ± 0.64
1.983AsnGln: 1.983 ± 0.825
1.735AsnArg: 1.735 ± 0.451
4.71AsnSer: 4.71 ± 1.168
2.479AsnThr: 2.479 ± 0.691
2.231AsnVal: 2.231 ± 0.688
0.992AsnTrp: 0.992 ± 0.424
2.231AsnTyr: 2.231 ± 0.885
0.0AsnXaa: 0.0 ± 0.0
Pro
1.487ProAla: 1.487 ± 0.268
0.496ProCys: 0.496 ± 0.304
2.727ProAsp: 2.727 ± 1.518
3.471ProGlu: 3.471 ± 1.15
1.487ProPhe: 1.487 ± 0.382
3.223ProGly: 3.223 ± 1.585
0.992ProHis: 0.992 ± 0.656
5.206ProIle: 5.206 ± 0.915
2.231ProLys: 2.231 ± 1.698
3.223ProLeu: 3.223 ± 0.972
1.735ProMet: 1.735 ± 0.293
1.983ProAsn: 1.983 ± 0.841
1.735ProPro: 1.735 ± 0.921
1.487ProGln: 1.487 ± 0.34
3.223ProArg: 3.223 ± 0.679
4.958ProSer: 4.958 ± 1.022
2.479ProThr: 2.479 ± 0.634
3.223ProVal: 3.223 ± 1.059
0.496ProTrp: 0.496 ± 0.304
1.487ProTyr: 1.487 ± 0.601
0.0ProXaa: 0.0 ± 0.0
Gln
0.744GlnAla: 0.744 ± 0.372
0.496GlnCys: 0.496 ± 0.278
1.735GlnAsp: 1.735 ± 1.271
1.487GlnGlu: 1.487 ± 0.45
2.479GlnPhe: 2.479 ± 0.799
1.983GlnGly: 1.983 ± 0.552
0.0GlnHis: 0.0 ± 0.0
1.487GlnIle: 1.487 ± 1.06
1.983GlnLys: 1.983 ± 0.826
1.239GlnLeu: 1.239 ± 0.572
1.735GlnMet: 1.735 ± 0.92
2.479GlnAsn: 2.479 ± 0.607
1.239GlnPro: 1.239 ± 1.133
0.992GlnGln: 0.992 ± 0.653
1.239GlnArg: 1.239 ± 0.423
4.462GlnSer: 4.462 ± 1.739
1.239GlnThr: 1.239 ± 0.701
2.975GlnVal: 2.975 ± 0.698
0.496GlnTrp: 0.496 ± 0.271
1.239GlnTyr: 1.239 ± 0.374
0.0GlnXaa: 0.0 ± 0.0
Arg
3.223ArgAla: 3.223 ± 1.071
1.239ArgCys: 1.239 ± 0.423
3.718ArgAsp: 3.718 ± 1.194
3.471ArgGlu: 3.471 ± 0.783
3.223ArgPhe: 3.223 ± 1.047
3.718ArgGly: 3.718 ± 1.22
1.983ArgHis: 1.983 ± 0.643
2.975ArgIle: 2.975 ± 0.639
2.727ArgLys: 2.727 ± 1.137
3.471ArgLeu: 3.471 ± 0.561
1.487ArgMet: 1.487 ± 0.941
3.966ArgAsn: 3.966 ± 0.834
1.487ArgPro: 1.487 ± 1.34
2.727ArgGln: 2.727 ± 1.105
3.471ArgArg: 3.471 ± 1.225
2.479ArgSer: 2.479 ± 0.531
5.206ArgThr: 5.206 ± 1.081
1.983ArgVal: 1.983 ± 0.969
1.239ArgTrp: 1.239 ± 0.519
0.992ArgTyr: 0.992 ± 0.36
0.0ArgXaa: 0.0 ± 0.0
Ser
3.223SerAla: 3.223 ± 0.82
1.487SerCys: 1.487 ± 0.763
5.454SerAsp: 5.454 ± 1.309
7.437SerGlu: 7.437 ± 2.144
2.727SerPhe: 2.727 ± 0.963
4.71SerGly: 4.71 ± 0.999
1.487SerHis: 1.487 ± 0.635
7.437SerIle: 7.437 ± 1.617
5.702SerLys: 5.702 ± 1.23
8.924SerLeu: 8.924 ± 1.365
0.992SerMet: 0.992 ± 0.38
3.471SerAsn: 3.471 ± 0.633
4.71SerPro: 4.71 ± 1.762
2.479SerGln: 2.479 ± 0.401
5.949SerArg: 5.949 ± 0.936
9.42SerSer: 9.42 ± 1.655
4.462SerThr: 4.462 ± 0.983
5.206SerVal: 5.206 ± 1.485
0.992SerTrp: 0.992 ± 0.483
2.231SerTyr: 2.231 ± 0.517
0.0SerXaa: 0.0 ± 0.0
Thr
1.487ThrAla: 1.487 ± 1.005
1.487ThrCys: 1.487 ± 0.981
2.479ThrAsp: 2.479 ± 0.57
3.718ThrGlu: 3.718 ± 0.999
1.487ThrPhe: 1.487 ± 0.677
2.727ThrGly: 2.727 ± 0.625
1.239ThrHis: 1.239 ± 0.453
5.454ThrIle: 5.454 ± 1.436
5.206ThrLys: 5.206 ± 2.354
5.702ThrLeu: 5.702 ± 1.34
2.479ThrMet: 2.479 ± 0.697
3.223ThrAsn: 3.223 ± 1.095
2.231ThrPro: 2.231 ± 0.746
1.983ThrGln: 1.983 ± 1.383
2.727ThrArg: 2.727 ± 0.819
6.445ThrSer: 6.445 ± 1.367
2.727ThrThr: 2.727 ± 0.644
3.966ThrVal: 3.966 ± 1.005
1.487ThrTrp: 1.487 ± 0.634
2.727ThrTyr: 2.727 ± 0.902
0.0ThrXaa: 0.0 ± 0.0
Val
1.239ValAla: 1.239 ± 0.568
0.496ValCys: 0.496 ± 0.304
3.223ValAsp: 3.223 ± 0.73
1.983ValGlu: 1.983 ± 0.911
2.231ValPhe: 2.231 ± 0.507
2.975ValGly: 2.975 ± 0.904
1.487ValHis: 1.487 ± 0.588
4.214ValIle: 4.214 ± 0.947
2.727ValLys: 2.727 ± 0.863
3.471ValLeu: 3.471 ± 0.67
1.239ValMet: 1.239 ± 0.598
2.727ValAsn: 2.727 ± 0.477
4.462ValPro: 4.462 ± 0.745
1.735ValGln: 1.735 ± 0.561
3.471ValArg: 3.471 ± 1.033
4.214ValSer: 4.214 ± 1.279
4.958ValThr: 4.958 ± 1.257
2.231ValVal: 2.231 ± 0.695
0.992ValTrp: 0.992 ± 0.417
4.214ValTyr: 4.214 ± 1.692
0.0ValXaa: 0.0 ± 0.0
Trp
0.744TrpAla: 0.744 ± 0.675
0.248TrpCys: 0.248 ± 0.152
0.248TrpAsp: 0.248 ± 0.152
1.239TrpGlu: 1.239 ± 0.541
0.744TrpPhe: 0.744 ± 0.567
1.487TrpGly: 1.487 ± 0.912
0.248TrpHis: 0.248 ± 0.152
1.735TrpIle: 1.735 ± 0.586
0.496TrpLys: 0.496 ± 0.278
0.248TrpLeu: 0.248 ± 0.396
0.248TrpMet: 0.248 ± 0.305
0.496TrpAsn: 0.496 ± 0.429
0.496TrpPro: 0.496 ± 0.304
0.248TrpGln: 0.248 ± 0.152
0.992TrpArg: 0.992 ± 0.417
2.727TrpSer: 2.727 ± 0.781
1.735TrpThr: 1.735 ± 0.745
1.239TrpVal: 1.239 ± 1.085
0.0TrpTrp: 0.0 ± 0.0
0.496TrpTyr: 0.496 ± 0.304
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.983TyrAla: 1.983 ± 0.801
0.744TyrCys: 0.744 ± 0.815
1.983TyrAsp: 1.983 ± 0.459
0.744TyrGlu: 0.744 ± 0.317
1.487TyrPhe: 1.487 ± 0.563
2.479TyrGly: 2.479 ± 0.415
2.231TyrHis: 2.231 ± 0.393
1.983TyrIle: 1.983 ± 0.584
2.975TyrLys: 2.975 ± 0.766
3.966TyrLeu: 3.966 ± 1.072
0.496TyrMet: 0.496 ± 0.278
1.239TyrAsn: 1.239 ± 0.57
1.983TyrPro: 1.983 ± 0.786
0.992TyrGln: 0.992 ± 0.483
1.487TyrArg: 1.487 ± 0.601
3.223TyrSer: 3.223 ± 1.163
1.735TyrThr: 1.735 ± 0.928
2.231TyrVal: 2.231 ± 0.485
0.496TyrTrp: 0.496 ± 0.485
0.248TyrTyr: 0.248 ± 0.152
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4035 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski