Amino acid dipepetide frequency for Snakehead retrovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.295AlaAla: 4.295 ± 1.798
0.716AlaCys: 0.716 ± 0.213
5.727AlaAsp: 5.727 ± 2.047
5.965AlaGlu: 5.965 ± 2.068
1.67AlaPhe: 1.67 ± 0.513
3.34AlaGly: 3.34 ± 0.887
0.716AlaHis: 0.716 ± 0.213
2.863AlaIle: 2.863 ± 0.733
3.579AlaLys: 3.579 ± 1.306
6.442AlaLeu: 6.442 ± 0.701
2.147AlaMet: 2.147 ± 0.344
0.477AlaAsn: 0.477 ± 0.151
3.34AlaPro: 3.34 ± 0.934
1.193AlaGln: 1.193 ± 0.336
3.34AlaArg: 3.34 ± 0.887
4.295AlaSer: 4.295 ± 0.923
4.056AlaThr: 4.056 ± 0.428
3.579AlaVal: 3.579 ± 0.595
2.863AlaTrp: 2.863 ± 0.83
1.193AlaTyr: 1.193 ± 0.23
0.0AlaXaa: 0.0 ± 0.0
Cys
1.432CysAla: 1.432 ± 0.324
0.0CysCys: 0.0 ± 0.0
0.477CysAsp: 0.477 ± 0.381
0.239CysGlu: 0.239 ± 0.153
0.477CysPhe: 0.477 ± 0.435
1.67CysGly: 1.67 ± 0.816
0.239CysHis: 0.239 ± 0.218
0.239CysIle: 0.239 ± 0.153
2.147CysLys: 2.147 ± 0.518
1.432CysLeu: 1.432 ± 0.556
0.716CysMet: 0.716 ± 0.342
0.954CysAsn: 0.954 ± 0.339
0.954CysPro: 0.954 ± 0.303
2.147CysGln: 2.147 ± 0.363
1.67CysArg: 1.67 ± 0.795
1.67CysSer: 1.67 ± 0.628
1.432CysThr: 1.432 ± 0.454
0.477CysVal: 0.477 ± 0.151
0.954CysTrp: 0.954 ± 0.443
0.239CysTyr: 0.239 ± 0.218
0.0CysXaa: 0.0 ± 0.0
Asp
1.67AspAla: 1.67 ± 0.513
1.909AspCys: 1.909 ± 0.82
1.193AspAsp: 1.193 ± 0.545
3.579AspGlu: 3.579 ± 0.73
1.193AspPhe: 1.193 ± 0.639
2.863AspGly: 2.863 ± 0.476
2.147AspHis: 2.147 ± 0.662
2.386AspIle: 2.386 ± 0.67
3.818AspLys: 3.818 ± 0.809
3.579AspLeu: 3.579 ± 0.761
1.432AspMet: 1.432 ± 0.426
3.34AspAsn: 3.34 ± 1.256
3.102AspPro: 3.102 ± 0.721
2.625AspGln: 2.625 ± 0.79
1.909AspArg: 1.909 ± 0.304
3.102AspSer: 3.102 ± 0.721
2.863AspThr: 2.863 ± 0.615
1.67AspVal: 1.67 ± 0.399
1.193AspTrp: 1.193 ± 0.56
2.147AspTyr: 2.147 ± 1.015
0.0AspXaa: 0.0 ± 0.0
Glu
5.249GluAla: 5.249 ± 0.663
0.954GluCys: 0.954 ± 0.339
3.102GluAsp: 3.102 ± 0.571
5.965GluGlu: 5.965 ± 1.252
0.954GluPhe: 0.954 ± 0.134
5.965GluGly: 5.965 ± 1.633
1.909GluHis: 1.909 ± 0.824
4.056GluIle: 4.056 ± 1.033
6.442GluLys: 6.442 ± 2.379
6.681GluLeu: 6.681 ± 2.07
0.716GluMet: 0.716 ± 0.173
1.432GluAsn: 1.432 ± 0.18
3.34GluPro: 3.34 ± 1.116
4.056GluGln: 4.056 ± 0.988
1.67GluArg: 1.67 ± 0.523
2.625GluSer: 2.625 ± 1.029
6.204GluThr: 6.204 ± 0.628
2.863GluVal: 2.863 ± 0.649
2.147GluTrp: 2.147 ± 0.734
1.193GluTyr: 1.193 ± 0.524
0.0GluXaa: 0.0 ± 0.0
Phe
2.147PheAla: 2.147 ± 0.304
1.432PheCys: 1.432 ± 0.324
0.477PheAsp: 0.477 ± 0.306
0.954PheGlu: 0.954 ± 0.553
0.0PhePhe: 0.0 ± 0.0
0.954PheGly: 0.954 ± 0.134
1.193PheHis: 1.193 ± 0.23
1.67PheIle: 1.67 ± 0.742
0.716PheLys: 0.716 ± 0.388
2.147PheLeu: 2.147 ± 0.571
0.0PheMet: 0.0 ± 0.0
0.954PheAsn: 0.954 ± 0.553
0.239PhePro: 0.239 ± 0.153
1.193PheGln: 1.193 ± 0.529
1.432PheArg: 1.432 ± 0.409
1.67PheSer: 1.67 ± 0.46
2.147PheThr: 2.147 ± 0.354
3.102PheVal: 3.102 ± 0.689
0.477PheTrp: 0.477 ± 0.306
0.239PheTyr: 0.239 ± 0.153
0.0PheXaa: 0.0 ± 0.0
Gly
3.818GlyAla: 3.818 ± 0.83
1.193GlyCys: 1.193 ± 0.336
3.818GlyAsp: 3.818 ± 0.837
3.34GlyGlu: 3.34 ± 1.657
1.909GlyPhe: 1.909 ± 0.606
7.874GlyGly: 7.874 ± 1.428
3.818GlyHis: 3.818 ± 0.809
2.863GlyIle: 2.863 ± 0.968
4.534GlyLys: 4.534 ± 1.389
8.113GlyLeu: 8.113 ± 0.732
2.147GlyMet: 2.147 ± 0.628
3.818GlyAsn: 3.818 ± 0.568
7.874GlyPro: 7.874 ± 1.402
3.818GlyGln: 3.818 ± 0.508
4.295GlyArg: 4.295 ± 1.351
2.625GlySer: 2.625 ± 0.396
5.249GlyThr: 5.249 ± 1.154
4.772GlyVal: 4.772 ± 1.37
3.102GlyTrp: 3.102 ± 0.245
2.147GlyTyr: 2.147 ± 0.661
0.0GlyXaa: 0.0 ± 0.0
His
0.716HisAla: 0.716 ± 0.213
0.716HisCys: 0.716 ± 0.459
2.625HisAsp: 2.625 ± 0.64
0.954HisGlu: 0.954 ± 0.33
0.477HisPhe: 0.477 ± 0.306
2.147HisGly: 2.147 ± 1.143
1.193HisHis: 1.193 ± 0.336
0.716HisIle: 0.716 ± 0.213
0.954HisLys: 0.954 ± 0.443
2.386HisLeu: 2.386 ± 0.474
0.716HisMet: 0.716 ± 0.173
0.716HisAsn: 0.716 ± 0.653
2.147HisPro: 2.147 ± 0.764
1.432HisGln: 1.432 ± 0.454
2.625HisArg: 2.625 ± 0.567
1.67HisSer: 1.67 ± 0.349
1.193HisThr: 1.193 ± 0.668
2.863HisVal: 2.863 ± 0.902
0.716HisTrp: 0.716 ± 0.389
0.716HisTyr: 0.716 ± 0.389
0.0HisXaa: 0.0 ± 0.0
Ile
3.34IleAla: 3.34 ± 0.481
0.716IleCys: 0.716 ± 0.213
2.386IleAsp: 2.386 ± 0.655
2.386IleGlu: 2.386 ± 0.67
1.432IlePhe: 1.432 ± 0.57
2.147IleGly: 2.147 ± 0.462
1.193IleHis: 1.193 ± 0.768
0.239IleIle: 0.239 ± 0.153
3.579IleLys: 3.579 ± 0.315
4.534IleLeu: 4.534 ± 0.765
0.954IleMet: 0.954 ± 0.303
0.954IleAsn: 0.954 ± 0.339
1.909IlePro: 1.909 ± 0.893
1.67IleGln: 1.67 ± 0.628
4.295IleArg: 4.295 ± 1.485
2.386IleSer: 2.386 ± 1.065
1.432IleThr: 1.432 ± 0.627
3.818IleVal: 3.818 ± 0.61
1.909IleTrp: 1.909 ± 0.416
0.954IleTyr: 0.954 ± 0.303
0.0IleXaa: 0.0 ± 0.0
Lys
4.772LysAla: 4.772 ± 1.746
0.954LysCys: 0.954 ± 0.303
4.056LysAsp: 4.056 ± 0.33
5.727LysGlu: 5.727 ± 1.84
1.909LysPhe: 1.909 ± 0.245
5.965LysGly: 5.965 ± 0.705
0.716LysHis: 0.716 ± 0.389
4.295LysIle: 4.295 ± 0.749
6.681LysLys: 6.681 ± 2.517
4.772LysLeu: 4.772 ± 0.535
0.477LysMet: 0.477 ± 0.306
2.625LysAsn: 2.625 ± 0.389
4.056LysPro: 4.056 ± 0.621
3.34LysGln: 3.34 ± 0.588
4.056LysArg: 4.056 ± 0.444
3.102LysSer: 3.102 ± 0.472
3.579LysThr: 3.579 ± 0.281
5.011LysVal: 5.011 ± 0.974
1.909LysTrp: 1.909 ± 0.383
2.147LysTyr: 2.147 ± 0.544
0.0LysXaa: 0.0 ± 0.0
Leu
5.249LeuAla: 5.249 ± 1.489
1.432LeuCys: 1.432 ± 0.454
2.147LeuAsp: 2.147 ± 0.282
8.828LeuGlu: 8.828 ± 2.713
3.34LeuPhe: 3.34 ± 0.456
9.306LeuGly: 9.306 ± 1.184
1.432LeuHis: 1.432 ± 0.18
2.863LeuIle: 2.863 ± 1.109
6.92LeuLys: 6.92 ± 1.082
6.204LeuLeu: 6.204 ± 0.86
2.147LeuMet: 2.147 ± 0.346
2.147LeuAsn: 2.147 ± 0.837
4.772LeuPro: 4.772 ± 1.035
5.011LeuGln: 5.011 ± 0.782
5.488LeuArg: 5.488 ± 1.111
2.386LeuSer: 2.386 ± 1.506
6.204LeuThr: 6.204 ± 0.518
4.295LeuVal: 4.295 ± 0.658
2.863LeuTrp: 2.863 ± 0.402
2.625LeuTyr: 2.625 ± 0.396
0.0LeuXaa: 0.0 ± 0.0
Met
2.386MetAla: 2.386 ± 0.549
0.477MetCys: 0.477 ± 0.49
1.432MetAsp: 1.432 ± 0.484
1.193MetGlu: 1.193 ± 0.603
0.477MetPhe: 0.477 ± 0.435
1.909MetGly: 1.909 ± 0.62
0.239MetHis: 0.239 ± 0.153
0.239MetIle: 0.239 ± 0.153
1.67MetLys: 1.67 ± 0.654
1.67MetLeu: 1.67 ± 0.893
0.0MetMet: 0.0 ± 0.0
0.239MetAsn: 0.239 ± 0.153
0.954MetPro: 0.954 ± 0.41
1.432MetGln: 1.432 ± 0.367
0.239MetArg: 0.239 ± 0.218
2.147MetSer: 2.147 ± 0.628
1.67MetThr: 1.67 ± 0.695
1.432MetVal: 1.432 ± 0.556
1.67MetTrp: 1.67 ± 1.075
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.193AsnAla: 1.193 ± 0.768
1.193AsnCys: 1.193 ± 0.335
1.432AsnAsp: 1.432 ± 0.405
1.67AsnGlu: 1.67 ± 0.948
0.477AsnPhe: 0.477 ± 0.151
2.863AsnGly: 2.863 ± 0.464
0.477AsnHis: 0.477 ± 0.435
1.432AsnIle: 1.432 ± 0.454
3.102AsnLys: 3.102 ± 0.724
2.625AsnLeu: 2.625 ± 0.752
1.67AsnMet: 1.67 ± 0.302
0.954AsnAsn: 0.954 ± 0.303
4.056AsnPro: 4.056 ± 1.126
4.295AsnGln: 4.295 ± 1.178
2.386AsnArg: 2.386 ± 0.861
2.863AsnSer: 2.863 ± 0.793
2.863AsnThr: 2.863 ± 0.818
0.954AsnVal: 0.954 ± 0.339
0.954AsnTrp: 0.954 ± 0.303
0.477AsnTyr: 0.477 ± 0.223
0.0AsnXaa: 0.0 ± 0.0
Pro
2.386ProAla: 2.386 ± 0.549
1.193ProCys: 1.193 ± 0.365
3.818ProAsp: 3.818 ± 0.752
5.011ProGlu: 5.011 ± 0.877
1.909ProPhe: 1.909 ± 0.431
5.011ProGly: 5.011 ± 0.494
1.67ProHis: 1.67 ± 0.513
2.386ProIle: 2.386 ± 1.455
4.056ProLys: 4.056 ± 0.505
4.534ProLeu: 4.534 ± 0.773
0.716ProMet: 0.716 ± 0.405
3.102ProAsn: 3.102 ± 0.529
4.772ProPro: 4.772 ± 1.956
3.34ProGln: 3.34 ± 0.854
3.818ProArg: 3.818 ± 1.353
4.056ProSer: 4.056 ± 0.608
5.727ProThr: 5.727 ± 0.901
3.818ProVal: 3.818 ± 0.8
0.716ProTrp: 0.716 ± 0.173
2.147ProTyr: 2.147 ± 0.952
0.239ProXaa: 0.239 ± 0.153
Gln
4.772GlnAla: 4.772 ± 1.231
0.716GlnCys: 0.716 ± 0.213
1.67GlnAsp: 1.67 ± 0.893
3.34GlnGlu: 3.34 ± 0.852
0.954GlnPhe: 0.954 ± 0.134
6.681GlnGly: 6.681 ± 1.405
1.67GlnHis: 1.67 ± 0.476
2.625GlnIle: 2.625 ± 1.173
3.579GlnLys: 3.579 ± 0.925
5.249GlnLeu: 5.249 ± 0.889
1.432GlnMet: 1.432 ± 0.897
3.102GlnAsn: 3.102 ± 0.387
4.534GlnPro: 4.534 ± 1.4
3.579GlnGln: 3.579 ± 0.612
3.34GlnArg: 3.34 ± 0.575
1.193GlnSer: 1.193 ± 0.48
2.625GlnThr: 2.625 ± 0.638
4.534GlnVal: 4.534 ± 0.776
0.954GlnTrp: 0.954 ± 0.324
2.625GlnTyr: 2.625 ± 0.776
0.0GlnXaa: 0.0 ± 0.0
Arg
1.909ArgAla: 1.909 ± 0.927
0.477ArgCys: 0.477 ± 0.49
4.056ArgAsp: 4.056 ± 0.842
4.772ArgGlu: 4.772 ± 1.536
1.193ArgPhe: 1.193 ± 0.336
4.056ArgGly: 4.056 ± 0.659
1.432ArgHis: 1.432 ± 0.772
3.102ArgIle: 3.102 ± 0.38
3.579ArgLys: 3.579 ± 0.737
3.34ArgLeu: 3.34 ± 0.703
0.0ArgMet: 0.0 ± 0.0
3.818ArgAsn: 3.818 ± 0.842
6.681ArgPro: 6.681 ± 2.15
4.056ArgGln: 4.056 ± 0.858
3.34ArgArg: 3.34 ± 1.966
4.295ArgSer: 4.295 ± 0.789
1.909ArgThr: 1.909 ± 0.806
2.147ArgVal: 2.147 ± 1.103
1.909ArgTrp: 1.909 ± 0.723
0.716ArgTyr: 0.716 ± 0.213
0.0ArgXaa: 0.0 ± 0.0
Ser
3.818SerAla: 3.818 ± 1.297
0.716SerCys: 0.716 ± 0.213
2.625SerAsp: 2.625 ± 1.029
1.432SerGlu: 1.432 ± 0.405
1.193SerPhe: 1.193 ± 0.368
3.579SerGly: 3.579 ± 1.509
1.193SerHis: 1.193 ± 0.765
2.386SerIle: 2.386 ± 0.754
3.34SerLys: 3.34 ± 0.777
6.92SerLeu: 6.92 ± 0.437
0.716SerMet: 0.716 ± 0.342
3.579SerAsn: 3.579 ± 0.565
3.579SerPro: 3.579 ± 1.361
5.011SerGln: 5.011 ± 1.277
1.909SerArg: 1.909 ± 0.615
1.193SerSer: 1.193 ± 0.482
3.579SerThr: 3.579 ± 1.219
2.147SerVal: 2.147 ± 0.911
2.863SerTrp: 2.863 ± 1.064
0.716SerTyr: 0.716 ± 0.653
0.0SerXaa: 0.0 ± 0.0
Thr
5.249ThrAla: 5.249 ± 0.813
1.67ThrCys: 1.67 ± 0.742
1.193ThrAsp: 1.193 ± 0.336
4.295ThrGlu: 4.295 ± 0.592
1.193ThrPhe: 1.193 ± 0.768
6.92ThrGly: 6.92 ± 1.304
2.625ThrHis: 2.625 ± 0.89
1.909ThrIle: 1.909 ± 0.416
4.534ThrLys: 4.534 ± 0.797
4.534ThrLeu: 4.534 ± 1.32
2.147ThrMet: 2.147 ± 0.604
2.147ThrAsn: 2.147 ± 1.101
2.863ThrPro: 2.863 ± 0.357
3.579ThrGln: 3.579 ± 0.909
3.579ThrArg: 3.579 ± 0.468
4.534ThrSer: 4.534 ± 0.807
6.442ThrThr: 6.442 ± 1.166
4.772ThrVal: 4.772 ± 1.076
2.147ThrTrp: 2.147 ± 0.304
1.67ThrTyr: 1.67 ± 0.628
0.0ThrXaa: 0.0 ± 0.0
Val
3.579ValAla: 3.579 ± 0.465
1.432ValCys: 1.432 ± 0.686
2.625ValAsp: 2.625 ± 0.917
3.579ValGlu: 3.579 ± 0.465
0.954ValPhe: 0.954 ± 0.134
5.011ValGly: 5.011 ± 1.004
1.67ValHis: 1.67 ± 1.152
3.34ValIle: 3.34 ± 0.702
3.34ValLys: 3.34 ± 0.476
5.488ValLeu: 5.488 ± 0.808
0.954ValMet: 0.954 ± 0.461
1.432ValAsn: 1.432 ± 0.696
3.818ValPro: 3.818 ± 0.411
4.295ValGln: 4.295 ± 0.448
2.625ValArg: 2.625 ± 0.363
3.818ValSer: 3.818 ± 1.025
5.727ValThr: 5.727 ± 1.085
1.432ValVal: 1.432 ± 0.335
0.716ValTrp: 0.716 ± 0.388
1.432ValTyr: 1.432 ± 0.367
0.0ValXaa: 0.0 ± 0.0
Trp
2.386TrpAla: 2.386 ± 0.763
1.193TrpCys: 1.193 ± 1.101
2.147TrpAsp: 2.147 ± 0.95
2.386TrpGlu: 2.386 ± 0.965
0.954TrpPhe: 0.954 ± 0.41
1.909TrpGly: 1.909 ± 0.836
0.954TrpHis: 0.954 ± 0.33
1.909TrpIle: 1.909 ± 0.414
2.863TrpLys: 2.863 ± 0.622
2.147TrpLeu: 2.147 ± 0.757
1.909TrpMet: 1.909 ± 1.227
1.193TrpAsn: 1.193 ± 0.213
0.239TrpPro: 0.239 ± 0.153
2.147TrpGln: 2.147 ± 0.478
2.863TrpArg: 2.863 ± 0.403
0.716TrpSer: 0.716 ± 0.389
1.432TrpThr: 1.432 ± 0.454
1.432TrpVal: 1.432 ± 0.411
0.716TrpTrp: 0.716 ± 0.342
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.432TyrAla: 1.432 ± 0.688
0.477TyrCys: 0.477 ± 0.49
0.716TyrAsp: 0.716 ± 0.342
1.909TyrGlu: 1.909 ± 0.269
0.477TyrPhe: 0.477 ± 0.151
0.716TyrGly: 0.716 ± 0.213
1.193TyrHis: 1.193 ± 0.48
0.716TyrIle: 0.716 ± 0.342
0.716TyrLys: 0.716 ± 0.292
2.863TyrLeu: 2.863 ± 0.908
0.239TyrMet: 0.239 ± 0.218
0.954TyrAsn: 0.954 ± 0.33
1.193TyrPro: 1.193 ± 0.336
1.193TyrGln: 1.193 ± 0.213
1.909TyrArg: 1.909 ± 0.957
2.147TyrSer: 2.147 ± 0.354
1.67TyrThr: 1.67 ± 0.466
2.147TyrVal: 2.147 ± 0.419
0.716TyrTrp: 0.716 ± 0.292
0.716TyrTyr: 0.716 ± 0.624
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.239XaaGly: 0.239 ± 0.153
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4192 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski