Amino acid dipepetide frequency for Ceratitis capitata sigmavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.256AlaAla: 1.256 ± 0.607
0.503AlaCys: 0.503 ± 0.461
2.764AlaAsp: 2.764 ± 0.879
1.508AlaGlu: 1.508 ± 0.547
1.508AlaPhe: 1.508 ± 0.505
1.005AlaGly: 1.005 ± 0.437
0.251AlaHis: 0.251 ± 0.152
0.754AlaIle: 0.754 ± 0.456
2.261AlaLys: 2.261 ± 0.874
5.276AlaLeu: 5.276 ± 1.954
0.503AlaMet: 0.503 ± 0.461
2.764AlaAsn: 2.764 ± 1.267
1.005AlaPro: 1.005 ± 0.652
2.01AlaGln: 2.01 ± 0.654
1.508AlaArg: 1.508 ± 0.641
2.764AlaSer: 2.764 ± 1.017
3.015AlaThr: 3.015 ± 0.839
1.508AlaVal: 1.508 ± 0.641
0.503AlaTrp: 0.503 ± 0.604
1.256AlaTyr: 1.256 ± 0.374
0.0AlaXaa: 0.0 ± 0.0
Cys
0.503CysAla: 0.503 ± 0.304
0.503CysCys: 0.503 ± 0.304
1.005CysAsp: 1.005 ± 0.969
0.754CysGlu: 0.754 ± 0.439
0.503CysPhe: 0.503 ± 0.385
1.005CysGly: 1.005 ± 0.608
0.251CysHis: 0.251 ± 0.152
1.256CysIle: 1.256 ± 0.579
1.005CysLys: 1.005 ± 0.436
1.759CysLeu: 1.759 ± 0.556
0.0CysMet: 0.0 ± 0.0
1.759CysAsn: 1.759 ± 0.829
1.005CysPro: 1.005 ± 0.805
0.0CysGln: 0.0 ± 0.0
0.754CysArg: 0.754 ± 0.613
1.759CysSer: 1.759 ± 0.375
0.754CysThr: 0.754 ± 0.456
1.005CysVal: 1.005 ± 0.322
0.503CysTrp: 0.503 ± 0.304
0.503CysTyr: 0.503 ± 0.388
0.0CysXaa: 0.0 ± 0.0
Asp
2.01AspAla: 2.01 ± 1.798
2.01AspCys: 2.01 ± 0.566
2.513AspAsp: 2.513 ± 0.751
2.764AspGlu: 2.764 ± 0.798
2.261AspPhe: 2.261 ± 0.717
2.01AspGly: 2.01 ± 0.478
0.754AspHis: 0.754 ± 0.403
4.271AspIle: 4.271 ± 0.863
3.518AspLys: 3.518 ± 1.769
6.533AspLeu: 6.533 ± 1.485
1.508AspMet: 1.508 ± 0.505
3.015AspAsn: 3.015 ± 0.93
2.764AspPro: 2.764 ± 0.304
3.015AspGln: 3.015 ± 0.816
2.01AspArg: 2.01 ± 0.402
4.774AspSer: 4.774 ± 1.129
2.513AspThr: 2.513 ± 0.331
3.266AspVal: 3.266 ± 1.2
1.508AspTrp: 1.508 ± 1.257
2.261AspTyr: 2.261 ± 0.838
0.0AspXaa: 0.0 ± 0.0
Glu
2.261GluAla: 2.261 ± 1.11
0.503GluCys: 0.503 ± 0.461
4.774GluAsp: 4.774 ± 1.613
1.508GluGlu: 1.508 ± 0.645
1.759GluPhe: 1.759 ± 0.769
2.01GluGly: 2.01 ± 0.708
1.508GluHis: 1.508 ± 0.646
5.779GluIle: 5.779 ± 1.728
2.261GluLys: 2.261 ± 0.26
4.523GluLeu: 4.523 ± 1.337
0.754GluMet: 0.754 ± 0.444
1.759GluAsn: 1.759 ± 0.522
1.508GluPro: 1.508 ± 0.428
1.508GluGln: 1.508 ± 1.152
2.01GluArg: 2.01 ± 0.606
5.276GluSer: 5.276 ± 1.477
3.518GluThr: 3.518 ± 0.727
2.513GluVal: 2.513 ± 0.743
1.256GluTrp: 1.256 ± 0.424
3.015GluTyr: 3.015 ± 0.681
0.0GluXaa: 0.0 ± 0.0
Phe
1.256PheAla: 1.256 ± 0.527
0.251PheCys: 0.251 ± 0.152
2.261PheAsp: 2.261 ± 1.086
2.764PheGlu: 2.764 ± 0.741
1.005PhePhe: 1.005 ± 0.436
2.513PheGly: 2.513 ± 0.904
0.754PheHis: 0.754 ± 0.593
1.759PheIle: 1.759 ± 0.522
4.271PheLys: 4.271 ± 0.84
5.276PheLeu: 5.276 ± 1.601
0.754PheMet: 0.754 ± 0.382
3.266PheAsn: 3.266 ± 0.22
3.769PhePro: 3.769 ± 0.996
1.759PheGln: 1.759 ± 0.522
2.764PheArg: 2.764 ± 0.604
2.513PheSer: 2.513 ± 1.127
1.759PheThr: 1.759 ± 0.832
2.261PheVal: 2.261 ± 1.1
0.251PheTrp: 0.251 ± 0.152
1.759PheTyr: 1.759 ± 0.615
0.0PheXaa: 0.0 ± 0.0
Gly
1.005GlyAla: 1.005 ± 0.506
0.503GlyCys: 0.503 ± 0.3
2.513GlyAsp: 2.513 ± 1.08
1.005GlyGlu: 1.005 ± 0.681
2.513GlyPhe: 2.513 ± 1.034
2.513GlyGly: 2.513 ± 1.08
1.759GlyHis: 1.759 ± 0.775
3.769GlyIle: 3.769 ± 1.31
1.759GlyLys: 1.759 ± 0.873
6.281GlyLeu: 6.281 ± 0.595
0.503GlyMet: 0.503 ± 0.385
2.764GlyAsn: 2.764 ± 0.788
0.754GlyPro: 0.754 ± 0.444
3.266GlyGln: 3.266 ± 0.729
1.005GlyArg: 1.005 ± 0.421
3.769GlySer: 3.769 ± 1.68
2.764GlyThr: 2.764 ± 1.504
1.759GlyVal: 1.759 ± 0.832
0.754GlyTrp: 0.754 ± 0.456
3.518GlyTyr: 3.518 ± 1.194
0.0GlyXaa: 0.0 ± 0.0
His
1.256HisAla: 1.256 ± 0.426
0.754HisCys: 0.754 ± 0.816
0.754HisAsp: 0.754 ± 0.439
0.503HisGlu: 0.503 ± 0.304
1.256HisPhe: 1.256 ± 0.424
1.759HisGly: 1.759 ± 0.534
1.508HisHis: 1.508 ± 1.045
3.769HisIle: 3.769 ± 0.885
1.256HisLys: 1.256 ± 0.979
3.518HisLeu: 3.518 ± 0.563
0.754HisMet: 0.754 ± 0.586
1.508HisAsn: 1.508 ± 0.647
2.01HisPro: 2.01 ± 0.482
1.005HisGln: 1.005 ± 0.436
1.508HisArg: 1.508 ± 0.789
2.261HisSer: 2.261 ± 0.956
1.759HisThr: 1.759 ± 0.537
0.754HisVal: 0.754 ± 0.439
0.251HisTrp: 0.251 ± 0.152
1.005HisTyr: 1.005 ± 0.561
0.0HisXaa: 0.0 ± 0.0
Ile
4.271IleAla: 4.271 ± 1.609
1.256IleCys: 1.256 ± 0.76
4.271IleAsp: 4.271 ± 0.948
4.271IleGlu: 4.271 ± 1.442
2.513IlePhe: 2.513 ± 0.841
4.523IleGly: 4.523 ± 0.912
2.764IleHis: 2.764 ± 1.518
6.784IleIle: 6.784 ± 1.395
7.035IleLys: 7.035 ± 1.353
7.286IleLeu: 7.286 ± 0.779
0.251IleMet: 0.251 ± 0.443
8.794IleAsn: 8.794 ± 1.584
5.276IlePro: 5.276 ± 1.174
3.015IleGln: 3.015 ± 0.749
3.769IleArg: 3.769 ± 1.124
5.025IleSer: 5.025 ± 0.651
5.025IleThr: 5.025 ± 1.165
4.02IleVal: 4.02 ± 0.687
1.005IleTrp: 1.005 ± 0.421
3.015IleTyr: 3.015 ± 1.454
0.0IleXaa: 0.0 ± 0.0
Lys
1.759LysAla: 1.759 ± 0.921
1.256LysCys: 1.256 ± 0.53
2.01LysAsp: 2.01 ± 0.422
4.523LysGlu: 4.523 ± 1.878
4.02LysPhe: 4.02 ± 0.844
2.513LysGly: 2.513 ± 0.942
1.759LysHis: 1.759 ± 0.811
3.769LysIle: 3.769 ± 0.419
4.774LysLys: 4.774 ± 1.395
5.528LysLeu: 5.528 ± 1.209
2.01LysMet: 2.01 ± 0.29
2.764LysAsn: 2.764 ± 0.311
3.769LysPro: 3.769 ± 1.62
2.764LysGln: 2.764 ± 1.348
2.764LysArg: 2.764 ± 0.685
4.523LysSer: 4.523 ± 0.973
4.523LysThr: 4.523 ± 1.654
4.271LysVal: 4.271 ± 1.418
1.508LysTrp: 1.508 ± 0.7
2.513LysTyr: 2.513 ± 1.343
0.0LysXaa: 0.0 ± 0.0
Leu
4.523LeuAla: 4.523 ± 0.95
1.759LeuCys: 1.759 ± 0.589
6.03LeuAsp: 6.03 ± 2.588
5.276LeuGlu: 5.276 ± 1.244
5.528LeuPhe: 5.528 ± 1.804
4.774LeuGly: 4.774 ± 1.015
2.261LeuHis: 2.261 ± 0.608
10.804LeuIle: 10.804 ± 1.293
8.291LeuLys: 8.291 ± 1.461
9.548LeuLeu: 9.548 ± 1.722
3.769LeuMet: 3.769 ± 0.592
7.286LeuAsn: 7.286 ± 1.473
2.764LeuPro: 2.764 ± 1.468
2.261LeuGln: 2.261 ± 0.565
5.276LeuArg: 5.276 ± 1.013
10.553LeuSer: 10.553 ± 2.081
9.296LeuThr: 9.296 ± 1.77
6.281LeuVal: 6.281 ± 1.574
0.754LeuTrp: 0.754 ± 0.613
2.513LeuTyr: 2.513 ± 0.591
0.0LeuXaa: 0.0 ± 0.0
Met
0.754MetAla: 0.754 ± 0.456
0.251MetCys: 0.251 ± 0.152
0.503MetAsp: 0.503 ± 0.837
1.256MetGlu: 1.256 ± 0.621
0.754MetPhe: 0.754 ± 0.326
1.005MetGly: 1.005 ± 0.411
0.251MetHis: 0.251 ± 0.443
2.261MetIle: 2.261 ± 0.814
1.256MetLys: 1.256 ± 0.424
2.261MetLeu: 2.261 ± 1.093
1.256MetMet: 1.256 ± 0.832
1.508MetAsn: 1.508 ± 1.233
0.251MetPro: 0.251 ± 0.152
0.251MetGln: 0.251 ± 0.335
0.503MetArg: 0.503 ± 0.304
2.261MetSer: 2.261 ± 0.26
1.508MetThr: 1.508 ± 0.671
0.754MetVal: 0.754 ± 0.514
0.251MetTrp: 0.251 ± 0.425
1.256MetTyr: 1.256 ± 0.54
0.0MetXaa: 0.0 ± 0.0
Asn
1.759AsnAla: 1.759 ± 0.9
1.005AsnCys: 1.005 ± 0.608
2.513AsnAsp: 2.513 ± 0.502
2.261AsnGlu: 2.261 ± 0.436
2.513AsnPhe: 2.513 ± 0.89
1.759AsnGly: 1.759 ± 1.08
2.764AsnHis: 2.764 ± 1.021
6.281AsnIle: 6.281 ± 0.846
5.528AsnLys: 5.528 ± 0.707
6.281AsnLeu: 6.281 ± 1.015
1.256AsnMet: 1.256 ± 0.395
4.523AsnAsn: 4.523 ± 0.521
4.774AsnPro: 4.774 ± 0.823
4.02AsnGln: 4.02 ± 1.395
2.261AsnArg: 2.261 ± 1.092
5.528AsnSer: 5.528 ± 1.617
4.02AsnThr: 4.02 ± 1.223
2.513AsnVal: 2.513 ± 1.08
1.005AsnTrp: 1.005 ± 0.322
3.015AsnTyr: 3.015 ± 0.855
0.0AsnXaa: 0.0 ± 0.0
Pro
1.508ProAla: 1.508 ± 0.652
1.005ProCys: 1.005 ± 0.485
3.266ProAsp: 3.266 ± 0.789
1.759ProGlu: 1.759 ± 0.532
1.256ProPhe: 1.256 ± 0.527
2.01ProGly: 2.01 ± 1.435
1.005ProHis: 1.005 ± 0.436
3.518ProIle: 3.518 ± 1.227
2.764ProLys: 2.764 ± 1.19
6.03ProLeu: 6.03 ± 1.297
1.256ProMet: 1.256 ± 0.689
2.261ProAsn: 2.261 ± 0.978
0.503ProPro: 0.503 ± 0.304
1.759ProGln: 1.759 ± 0.678
2.513ProArg: 2.513 ± 0.875
4.523ProSer: 4.523 ± 1.41
3.518ProThr: 3.518 ± 1.617
2.513ProVal: 2.513 ± 0.653
0.251ProTrp: 0.251 ± 0.152
1.759ProTyr: 1.759 ± 0.752
0.0ProXaa: 0.0 ± 0.0
Gln
1.256GlnAla: 1.256 ± 0.997
0.251GlnCys: 0.251 ± 0.425
1.256GlnAsp: 1.256 ± 0.675
2.261GlnGlu: 2.261 ± 0.53
1.759GlnPhe: 1.759 ± 0.806
2.513GlnGly: 2.513 ± 0.889
0.754GlnHis: 0.754 ± 0.331
3.518GlnIle: 3.518 ± 1.01
0.754GlnLys: 0.754 ± 0.539
5.276GlnLeu: 5.276 ± 1.695
0.0GlnMet: 0.0 ± 0.0
3.266GlnAsn: 3.266 ± 0.944
0.251GlnPro: 0.251 ± 0.346
1.256GlnGln: 1.256 ± 1.321
1.508GlnArg: 1.508 ± 0.4
2.261GlnSer: 2.261 ± 0.767
2.764GlnThr: 2.764 ± 1.085
2.764GlnVal: 2.764 ± 0.825
0.754GlnTrp: 0.754 ± 0.326
1.005GlnTyr: 1.005 ± 0.599
0.0GlnXaa: 0.0 ± 0.0
Arg
1.256ArgAla: 1.256 ± 0.53
0.754ArgCys: 0.754 ± 0.613
4.271ArgAsp: 4.271 ± 1.826
2.261ArgGlu: 2.261 ± 0.52
2.513ArgPhe: 2.513 ± 0.828
1.508ArgGly: 1.508 ± 0.912
1.005ArgHis: 1.005 ± 0.379
2.764ArgIle: 2.764 ± 0.458
2.01ArgLys: 2.01 ± 0.592
6.784ArgLeu: 6.784 ± 1.357
0.754ArgMet: 0.754 ± 0.326
2.513ArgAsn: 2.513 ± 0.828
1.005ArgPro: 1.005 ± 0.477
1.508ArgGln: 1.508 ± 0.685
3.015ArgArg: 3.015 ± 1.015
4.271ArgSer: 4.271 ± 0.697
2.01ArgThr: 2.01 ± 0.592
2.01ArgVal: 2.01 ± 0.665
1.005ArgTrp: 1.005 ± 0.592
2.01ArgTyr: 2.01 ± 0.402
0.0ArgXaa: 0.0 ± 0.0
Ser
3.015SerAla: 3.015 ± 0.801
1.759SerCys: 1.759 ± 0.879
4.271SerAsp: 4.271 ± 1.052
5.528SerGlu: 5.528 ± 0.947
3.266SerPhe: 3.266 ± 1.139
3.266SerGly: 3.266 ± 1.018
3.015SerHis: 3.015 ± 0.978
7.286SerIle: 7.286 ± 1.633
5.025SerLys: 5.025 ± 0.758
8.794SerLeu: 8.794 ± 1.035
1.256SerMet: 1.256 ± 0.891
4.523SerAsn: 4.523 ± 1.155
4.271SerPro: 4.271 ± 1.023
2.513SerGln: 2.513 ± 0.817
2.513SerArg: 2.513 ± 0.568
7.538SerSer: 7.538 ± 1.445
6.281SerThr: 6.281 ± 1.557
2.513SerVal: 2.513 ± 0.955
2.01SerTrp: 2.01 ± 0.955
4.271SerTyr: 4.271 ± 1.64
0.0SerXaa: 0.0 ± 0.0
Thr
1.005ThrAla: 1.005 ± 0.322
1.256ThrCys: 1.256 ± 0.609
3.769ThrAsp: 3.769 ± 0.542
3.518ThrGlu: 3.518 ± 0.969
2.764ThrPhe: 2.764 ± 0.577
3.015ThrGly: 3.015 ± 0.996
2.764ThrHis: 2.764 ± 0.907
7.789ThrIle: 7.789 ± 1.005
3.769ThrLys: 3.769 ± 0.782
6.784ThrLeu: 6.784 ± 1.308
1.256ThrMet: 1.256 ± 0.374
3.266ThrAsn: 3.266 ± 0.722
4.774ThrPro: 4.774 ± 1.968
1.256ThrGln: 1.256 ± 1.065
3.518ThrArg: 3.518 ± 0.911
4.774ThrSer: 4.774 ± 0.835
4.774ThrThr: 4.774 ± 2.476
5.025ThrVal: 5.025 ± 0.849
0.503ThrTrp: 0.503 ± 0.304
3.266ThrTyr: 3.266 ± 1.064
0.0ThrXaa: 0.0 ± 0.0
Val
1.508ValAla: 1.508 ± 0.597
0.251ValCys: 0.251 ± 0.152
3.015ValAsp: 3.015 ± 0.954
3.518ValGlu: 3.518 ± 0.976
2.261ValPhe: 2.261 ± 0.91
1.508ValGly: 1.508 ± 1.43
1.508ValHis: 1.508 ± 0.54
4.774ValIle: 4.774 ± 1.281
2.513ValLys: 2.513 ± 0.574
5.025ValLeu: 5.025 ± 1.449
1.005ValMet: 1.005 ± 0.411
3.769ValAsn: 3.769 ± 1.298
1.759ValPro: 1.759 ± 0.467
0.754ValGln: 0.754 ± 0.326
2.764ValArg: 2.764 ± 0.753
4.271ValSer: 4.271 ± 1.526
5.276ValThr: 5.276 ± 0.988
1.508ValVal: 1.508 ± 0.548
0.754ValTrp: 0.754 ± 0.331
2.01ValTyr: 2.01 ± 0.884
0.0ValXaa: 0.0 ± 0.0
Trp
0.251TrpAla: 0.251 ± 0.152
0.251TrpCys: 0.251 ± 0.152
1.256TrpAsp: 1.256 ± 0.579
0.754TrpGlu: 0.754 ± 0.456
1.005TrpPhe: 1.005 ± 0.599
1.759TrpGly: 1.759 ± 1.064
0.251TrpHis: 0.251 ± 0.152
1.508TrpIle: 1.508 ± 0.661
1.508TrpLys: 1.508 ± 0.547
0.251TrpLeu: 0.251 ± 0.152
0.251TrpMet: 0.251 ± 0.425
1.256TrpAsn: 1.256 ± 0.609
0.754TrpPro: 0.754 ± 0.456
0.503TrpGln: 0.503 ± 0.304
0.754TrpArg: 0.754 ± 0.444
1.256TrpSer: 1.256 ± 0.54
0.754TrpThr: 0.754 ± 0.742
0.754TrpVal: 0.754 ± 0.367
0.0TrpTrp: 0.0 ± 0.0
0.503TrpTyr: 0.503 ± 0.461
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.508TyrAla: 1.508 ± 1.377
0.503TyrCys: 0.503 ± 0.461
2.261TyrAsp: 2.261 ± 1.139
2.01TyrGlu: 2.01 ± 0.742
2.01TyrPhe: 2.01 ± 0.665
1.508TyrGly: 1.508 ± 0.49
2.261TyrHis: 2.261 ± 0.544
1.759TyrIle: 1.759 ± 0.522
1.759TyrLys: 1.759 ± 0.829
6.533TyrLeu: 6.533 ± 0.964
1.005TyrMet: 1.005 ± 1.068
3.015TyrAsn: 3.015 ± 0.99
2.01TyrPro: 2.01 ± 0.812
1.005TyrGln: 1.005 ± 0.592
2.513TyrArg: 2.513 ± 0.433
3.266TyrSer: 3.266 ± 1.069
3.015TyrThr: 3.015 ± 0.961
1.759TyrVal: 1.759 ± 0.889
0.754TyrTrp: 0.754 ± 0.407
1.508TyrTyr: 1.508 ± 0.505
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3981 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski