Amino acid dipepetide frequency for Solenopsis invicta virus 3 (SINV-3)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.351AlaAla: 3.351 ± 0.152
1.005AlaCys: 1.005 ± 0.096
1.843AlaAsp: 1.843 ± 0.415
2.513AlaGlu: 2.513 ± 0.122
1.508AlaPhe: 1.508 ± 0.026
4.021AlaGly: 4.021 ± 0.088
0.67AlaHis: 0.67 ± 0.064
1.508AlaIle: 1.508 ± 0.21
2.681AlaLys: 2.681 ± 0.02
5.696AlaLeu: 5.696 ± 0.073
1.173AlaMet: 1.173 ± 0.242
1.508AlaAsn: 1.508 ± 0.026
1.675AlaPro: 1.675 ± 0.16
3.016AlaGln: 3.016 ± 0.421
2.513AlaArg: 2.513 ± 0.114
2.848AlaSer: 2.848 ± 0.082
3.853AlaThr: 3.853 ± 0.251
0.67AlaVal: 0.67 ± 0.064
0.335AlaTrp: 0.335 ± 0.032
1.508AlaTyr: 1.508 ± 0.447
0.0AlaXaa: 0.0 ± 0.0
Cys
2.178CysAla: 2.178 ± 0.09
0.0CysCys: 0.0 ± 0.0
1.34CysAsp: 1.34 ± 0.128
0.67CysGlu: 0.67 ± 0.064
0.503CysPhe: 0.503 ± 0.07
0.335CysGly: 0.335 ± 0.032
0.0CysHis: 0.0 ± 0.0
0.67CysIle: 0.67 ± 0.064
1.005CysLys: 1.005 ± 0.096
0.67CysLeu: 0.67 ± 0.064
0.0CysMet: 0.0 ± 0.0
2.513CysAsn: 2.513 ± 0.122
1.34CysPro: 1.34 ± 0.128
1.34CysGln: 1.34 ± 0.128
0.335CysArg: 0.335 ± 0.032
1.005CysSer: 1.005 ± 0.096
0.335CysThr: 0.335 ± 0.032
1.005CysVal: 1.005 ± 0.096
0.0CysTrp: 0.0 ± 0.0
1.34CysTyr: 1.34 ± 0.128
0.0CysXaa: 0.0 ± 0.0
Asp
1.508AspAla: 1.508 ± 0.026
3.016AspCys: 3.016 ± 0.289
5.026AspAsp: 5.026 ± 0.245
3.518AspGlu: 3.518 ± 0.018
3.016AspPhe: 3.016 ± 0.184
2.848AspGly: 2.848 ± 0.318
1.34AspHis: 1.34 ± 0.108
5.361AspIle: 5.361 ± 0.277
3.183AspLys: 3.183 ± 0.05
7.539AspLeu: 7.539 ± 0.106
1.508AspMet: 1.508 ± 0.026
2.848AspAsn: 2.848 ± 0.318
2.848AspPro: 2.848 ± 0.555
1.508AspGln: 1.508 ± 0.026
1.675AspArg: 1.675 ± 0.16
2.848AspSer: 2.848 ± 0.155
1.675AspThr: 1.675 ± 0.16
4.021AspVal: 4.021 ± 0.149
1.173AspTrp: 1.173 ± 0.242
2.681AspTyr: 2.681 ± 0.02
0.0AspXaa: 0.0 ± 0.0
Glu
2.345GluAla: 2.345 ± 0.012
0.335GluCys: 0.335 ± 0.032
2.681GluAsp: 2.681 ± 0.257
5.864GluGlu: 5.864 ± 0.443
4.021GluPhe: 4.021 ± 0.385
1.005GluGly: 1.005 ± 0.096
2.345GluHis: 2.345 ± 0.012
7.371GluIle: 7.371 ± 0.706
5.529GluLys: 5.529 ± 0.411
5.696GluLeu: 5.696 ± 0.073
1.005GluMet: 1.005 ± 0.096
4.523GluAsn: 4.523 ± 0.078
2.513GluPro: 2.513 ± 0.35
2.178GluGln: 2.178 ± 0.09
2.513GluArg: 2.513 ± 0.35
2.178GluSer: 2.178 ± 0.146
3.183GluThr: 3.183 ± 0.05
2.681GluVal: 2.681 ± 0.257
0.168GluTrp: 0.168 ± 0.102
1.843GluTyr: 1.843 ± 0.058
0.0GluXaa: 0.0 ± 0.0
Phe
2.345PheAla: 2.345 ± 0.012
0.67PheCys: 0.67 ± 0.064
5.361PheAsp: 5.361 ± 0.277
2.848PheGlu: 2.848 ± 0.155
4.356PhePhe: 4.356 ± 0.417
3.518PheGly: 3.518 ± 0.254
1.34PheHis: 1.34 ± 0.128
4.858PheIle: 4.858 ± 0.111
6.701PheLys: 6.701 ± 0.405
4.021PheLeu: 4.021 ± 0.149
0.335PheMet: 0.335 ± 0.032
6.366PheAsn: 6.366 ± 0.373
2.01PhePro: 2.01 ± 0.193
2.178PheGln: 2.178 ± 0.09
2.848PheArg: 2.848 ± 0.155
3.351PheSer: 3.351 ± 0.152
4.021PheThr: 4.021 ± 0.385
3.351PheVal: 3.351 ± 0.084
0.0PheTrp: 0.0 ± 0.0
4.021PheTyr: 4.021 ± 0.088
0.0PheXaa: 0.0 ± 0.0
Gly
3.518GlyAla: 3.518 ± 0.491
1.34GlyCys: 1.34 ± 0.128
2.848GlyAsp: 2.848 ± 0.082
1.675GlyGlu: 1.675 ± 0.16
1.675GlyPhe: 1.675 ± 0.312
2.848GlyGly: 2.848 ± 0.318
1.005GlyHis: 1.005 ± 0.096
3.351GlyIle: 3.351 ± 0.152
2.513GlyLys: 2.513 ± 0.114
3.351GlyLeu: 3.351 ± 0.152
2.513GlyMet: 2.513 ± 0.122
3.016GlyAsn: 3.016 ± 0.184
1.675GlyPro: 1.675 ± 0.076
2.848GlyGln: 2.848 ± 0.155
0.67GlyArg: 0.67 ± 0.172
2.178GlySer: 2.178 ± 0.146
2.513GlyThr: 2.513 ± 0.35
2.681GlyVal: 2.681 ± 0.216
1.675GlyTrp: 1.675 ± 0.16
1.843GlyTyr: 1.843 ± 0.058
0.0GlyXaa: 0.0 ± 0.0
His
0.168HisAla: 0.168 ± 0.102
0.168HisCys: 0.168 ± 0.102
0.335HisAsp: 0.335 ± 0.032
1.173HisGlu: 1.173 ± 0.006
2.345HisPhe: 2.345 ± 0.225
0.335HisGly: 0.335 ± 0.032
0.0HisHis: 0.0 ± 0.0
2.513HisIle: 2.513 ± 0.122
2.345HisLys: 2.345 ± 0.225
2.178HisLeu: 2.178 ± 0.09
0.335HisMet: 0.335 ± 0.032
1.005HisAsn: 1.005 ± 0.14
0.335HisPro: 0.335 ± 0.032
0.67HisGln: 0.67 ± 0.172
1.675HisArg: 1.675 ± 0.16
1.005HisSer: 1.005 ± 0.096
1.34HisThr: 1.34 ± 0.108
0.67HisVal: 0.67 ± 0.064
0.0HisTrp: 0.0 ± 0.0
0.67HisTyr: 0.67 ± 0.064
0.0HisXaa: 0.0 ± 0.0
Ile
4.356IleAla: 4.356 ± 0.056
2.178IleCys: 2.178 ± 0.09
3.183IleAsp: 3.183 ± 0.286
4.858IleGlu: 4.858 ± 0.111
4.188IlePhe: 4.188 ± 0.283
2.513IleGly: 2.513 ± 0.114
1.508IleHis: 1.508 ± 0.026
5.529IleIle: 5.529 ± 0.411
4.523IleLys: 4.523 ± 0.315
7.036IleLeu: 7.036 ± 0.201
1.34IleMet: 1.34 ± 0.128
5.696IleAsn: 5.696 ± 0.164
4.356IlePro: 4.356 ± 0.181
5.193IleGln: 5.193 ± 0.143
2.178IleArg: 2.178 ± 0.146
8.209IleSer: 8.209 ± 0.431
5.696IleThr: 5.696 ± 0.073
3.686IleVal: 3.686 ± 0.116
0.335IleTrp: 0.335 ± 0.032
3.016IleTyr: 3.016 ± 0.184
0.0IleXaa: 0.0 ± 0.0
Lys
2.345LysAla: 2.345 ± 0.012
1.34LysCys: 1.34 ± 0.128
5.529LysAsp: 5.529 ± 0.411
3.518LysGlu: 3.518 ± 0.219
10.89LysPhe: 10.89 ± 0.452
2.681LysGly: 2.681 ± 0.02
1.843LysHis: 1.843 ± 0.058
8.209LysIle: 8.209 ± 0.431
5.696LysLys: 5.696 ± 0.545
6.701LysLeu: 6.701 ± 0.405
2.681LysMet: 2.681 ± 0.216
5.529LysAsn: 5.529 ± 0.062
2.513LysPro: 2.513 ± 0.35
4.691LysGln: 4.691 ± 0.213
3.016LysArg: 3.016 ± 0.289
4.691LysSer: 4.691 ± 0.213
4.356LysThr: 4.356 ± 0.056
3.351LysVal: 3.351 ± 0.084
0.503LysTrp: 0.503 ± 0.07
3.183LysTyr: 3.183 ± 0.05
0.0LysXaa: 0.0 ± 0.0
Leu
3.518LeuAla: 3.518 ± 0.018
0.67LeuCys: 0.67 ± 0.064
5.529LeuAsp: 5.529 ± 0.298
3.016LeuGlu: 3.016 ± 0.289
4.523LeuPhe: 4.523 ± 0.078
4.356LeuGly: 4.356 ± 0.181
2.01LeuHis: 2.01 ± 0.193
4.021LeuIle: 4.021 ± 0.088
6.869LeuLys: 6.869 ± 0.067
5.361LeuLeu: 5.361 ± 0.04
1.675LeuMet: 1.675 ± 0.16
7.371LeuAsn: 7.371 ± 0.003
3.351LeuPro: 3.351 ± 0.084
2.01LeuGln: 2.01 ± 0.044
2.01LeuArg: 2.01 ± 0.193
8.879LeuSer: 8.879 ± 0.023
4.523LeuThr: 4.523 ± 0.158
5.361LeuVal: 5.361 ± 0.277
1.005LeuTrp: 1.005 ± 0.096
4.021LeuTyr: 4.021 ± 0.149
0.0LeuXaa: 0.0 ± 0.0
Met
0.67MetAla: 0.67 ± 0.064
0.503MetCys: 0.503 ± 0.07
0.335MetAsp: 0.335 ± 0.032
2.848MetGlu: 2.848 ± 0.082
0.503MetPhe: 0.503 ± 0.07
0.838MetGly: 0.838 ± 0.038
0.335MetHis: 0.335 ± 0.032
2.01MetIle: 2.01 ± 0.044
1.843MetLys: 1.843 ± 0.058
1.005MetLeu: 1.005 ± 0.096
0.838MetMet: 0.838 ± 0.038
2.848MetAsn: 2.848 ± 0.082
1.005MetPro: 1.005 ± 0.14
0.503MetGln: 0.503 ± 0.07
0.0MetArg: 0.0 ± 0.0
2.01MetSer: 2.01 ± 0.193
1.34MetThr: 1.34 ± 0.128
0.503MetVal: 0.503 ± 0.07
0.838MetTrp: 0.838 ± 0.038
1.173MetTyr: 1.173 ± 0.006
0.0MetXaa: 0.0 ± 0.0
Asn
1.508AsnAla: 1.508 ± 0.21
2.345AsnCys: 2.345 ± 0.225
4.356AsnAsp: 4.356 ± 0.181
5.696AsnGlu: 5.696 ± 0.545
5.696AsnPhe: 5.696 ± 0.164
3.351AsnGly: 3.351 ± 0.084
2.01AsnHis: 2.01 ± 0.044
5.529AsnIle: 5.529 ± 0.535
6.031AsnLys: 6.031 ± 0.105
8.544AsnLeu: 8.544 ± 0.009
2.178AsnMet: 2.178 ± 0.209
5.026AsnAsn: 5.026 ± 0.937
3.183AsnPro: 3.183 ± 0.286
1.005AsnGln: 1.005 ± 0.377
0.335AsnArg: 0.335 ± 0.032
6.701AsnSer: 6.701 ± 0.777
2.681AsnThr: 2.681 ± 0.216
4.356AsnVal: 4.356 ± 0.056
0.0AsnTrp: 0.0 ± 0.0
3.853AsnTyr: 3.853 ± 0.222
0.0AsnXaa: 0.0 ± 0.0
Pro
1.508ProAla: 1.508 ± 0.21
0.335ProCys: 0.335 ± 0.032
4.021ProAsp: 4.021 ± 0.324
2.513ProGlu: 2.513 ± 0.35
3.686ProPhe: 3.686 ± 0.116
1.843ProGly: 1.843 ± 0.178
0.335ProHis: 0.335 ± 0.032
2.345ProIle: 2.345 ± 0.248
3.351ProLys: 3.351 ± 0.321
2.513ProLeu: 2.513 ± 0.122
0.0ProMet: 0.0 ± 0.0
5.026ProAsn: 5.026 ± 0.464
0.67ProPro: 0.67 ± 0.064
1.508ProGln: 1.508 ± 0.026
1.005ProArg: 1.005 ± 0.096
3.351ProSer: 3.351 ± 0.388
2.178ProThr: 2.178 ± 0.146
1.843ProVal: 1.843 ± 0.058
0.335ProTrp: 0.335 ± 0.032
2.848ProTyr: 2.848 ± 0.082
0.0ProXaa: 0.0 ± 0.0
Gln
3.183GlnAla: 3.183 ± 0.286
0.0GlnCys: 0.0 ± 0.0
0.67GlnAsp: 0.67 ± 0.064
3.686GlnGlu: 3.686 ± 0.353
2.01GlnPhe: 2.01 ± 0.193
1.843GlnGly: 1.843 ± 0.178
0.67GlnHis: 0.67 ± 0.064
3.518GlnIle: 3.518 ± 0.254
2.681GlnLys: 2.681 ± 0.257
1.508GlnLeu: 1.508 ± 0.21
1.34GlnMet: 1.34 ± 0.345
1.34GlnAsn: 1.34 ± 0.345
3.183GlnPro: 3.183 ± 0.05
2.681GlnGln: 2.681 ± 0.925
3.518GlnArg: 3.518 ± 0.018
3.518GlnSer: 3.518 ± 0.018
3.853GlnThr: 3.853 ± 0.251
1.843GlnVal: 1.843 ± 0.058
0.335GlnTrp: 0.335 ± 0.204
2.178GlnTyr: 2.178 ± 0.146
0.0GlnXaa: 0.0 ± 0.0
Arg
1.173ArgAla: 1.173 ± 0.006
0.335ArgCys: 0.335 ± 0.032
1.508ArgAsp: 1.508 ± 0.026
2.178ArgGlu: 2.178 ± 0.146
1.675ArgPhe: 1.675 ± 0.16
0.838ArgGly: 0.838 ± 0.038
0.0ArgHis: 0.0 ± 0.0
2.345ArgIle: 2.345 ± 0.012
5.026ArgLys: 5.026 ± 0.008
3.686ArgLeu: 3.686 ± 0.116
0.168ArgMet: 0.168 ± 0.102
2.345ArgAsn: 2.345 ± 0.225
0.335ArgPro: 0.335 ± 0.032
3.016ArgGln: 3.016 ± 0.184
3.518ArgArg: 3.518 ± 0.219
1.34ArgSer: 1.34 ± 0.108
0.838ArgThr: 0.838 ± 0.038
2.681ArgVal: 2.681 ± 0.02
0.838ArgTrp: 0.838 ± 0.038
2.345ArgTyr: 2.345 ± 0.225
0.0ArgXaa: 0.0 ± 0.0
Ser
2.01SerAla: 2.01 ± 0.28
0.67SerCys: 0.67 ± 0.064
4.523SerAsp: 4.523 ± 0.078
3.351SerGlu: 3.351 ± 0.084
4.356SerPhe: 4.356 ± 0.417
4.356SerGly: 4.356 ± 0.529
1.34SerHis: 1.34 ± 0.128
5.696SerIle: 5.696 ± 0.073
11.392SerLys: 11.392 ± 0.091
4.523SerLeu: 4.523 ± 0.158
1.508SerMet: 1.508 ± 0.026
3.853SerAsn: 3.853 ± 0.459
2.848SerPro: 2.848 ± 0.155
1.005SerGln: 1.005 ± 0.14
2.178SerArg: 2.178 ± 0.146
4.523SerSer: 4.523 ± 0.631
2.681SerThr: 2.681 ± 0.216
5.193SerVal: 5.193 ± 0.094
0.335SerTrp: 0.335 ± 0.032
2.513SerTyr: 2.513 ± 0.114
0.0SerXaa: 0.0 ± 0.0
Thr
3.016ThrAla: 3.016 ± 0.289
0.0ThrCys: 0.0 ± 0.0
3.853ThrAsp: 3.853 ± 0.459
2.848ThrGlu: 2.848 ± 0.082
3.016ThrPhe: 3.016 ± 0.289
2.178ThrGly: 2.178 ± 0.09
1.675ThrHis: 1.675 ± 0.16
5.361ThrIle: 5.361 ± 0.513
2.848ThrLys: 2.848 ± 0.155
4.188ThrLeu: 4.188 ± 0.283
1.34ThrMet: 1.34 ± 0.108
4.021ThrAsn: 4.021 ± 0.324
2.681ThrPro: 2.681 ± 0.453
3.853ThrGln: 3.853 ± 0.222
1.005ThrArg: 1.005 ± 0.14
2.681ThrSer: 2.681 ± 0.216
2.345ThrThr: 2.345 ± 0.248
3.686ThrVal: 3.686 ± 0.116
0.335ThrTrp: 0.335 ± 0.032
1.675ThrTyr: 1.675 ± 0.312
0.0ThrXaa: 0.0 ± 0.0
Val
1.34ValAla: 1.34 ± 0.128
1.675ValCys: 1.675 ± 0.16
2.848ValAsp: 2.848 ± 0.318
3.686ValGlu: 3.686 ± 0.116
3.351ValPhe: 3.351 ± 0.084
2.681ValGly: 2.681 ± 0.216
0.67ValHis: 0.67 ± 0.172
3.351ValIle: 3.351 ± 0.084
3.518ValLys: 3.518 ± 0.018
3.518ValLeu: 3.518 ± 0.219
0.838ValMet: 0.838 ± 0.038
5.026ValAsn: 5.026 ± 0.481
2.178ValPro: 2.178 ± 0.146
3.016ValGln: 3.016 ± 0.289
2.01ValArg: 2.01 ± 0.193
4.356ValSer: 4.356 ± 0.056
2.513ValThr: 2.513 ± 0.122
2.178ValVal: 2.178 ± 0.146
1.173ValTrp: 1.173 ± 0.006
3.016ValTyr: 3.016 ± 0.052
0.0ValXaa: 0.0 ± 0.0
Trp
0.67TrpAla: 0.67 ± 0.064
0.0TrpCys: 0.0 ± 0.0
1.005TrpAsp: 1.005 ± 0.096
0.335TrpGlu: 0.335 ± 0.032
0.335TrpPhe: 0.335 ± 0.032
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.67TrpIle: 0.67 ± 0.064
1.005TrpLys: 1.005 ± 0.096
0.67TrpLeu: 0.67 ± 0.172
0.0TrpMet: 0.0 ± 0.0
1.34TrpAsn: 1.34 ± 0.128
1.173TrpPro: 1.173 ± 0.006
0.335TrpGln: 0.335 ± 0.204
0.503TrpArg: 0.503 ± 0.07
0.503TrpSer: 0.503 ± 0.307
0.503TrpThr: 0.503 ± 0.07
0.503TrpVal: 0.503 ± 0.07
0.0TrpTrp: 0.0 ± 0.0
0.335TrpTyr: 0.335 ± 0.032
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.183TyrAla: 3.183 ± 0.05
0.0TyrCys: 0.0 ± 0.0
2.345TyrAsp: 2.345 ± 0.012
3.183TyrGlu: 3.183 ± 0.05
2.848TyrPhe: 2.848 ± 0.155
3.351TyrGly: 3.351 ± 0.152
0.168TyrHis: 0.168 ± 0.102
5.361TyrIle: 5.361 ± 0.277
4.523TyrLys: 4.523 ± 0.158
1.675TyrLeu: 1.675 ± 0.16
1.005TyrMet: 1.005 ± 0.096
3.351TyrAsn: 3.351 ± 0.388
1.34TyrPro: 1.34 ± 0.108
1.005TyrGln: 1.005 ± 0.14
2.681TyrArg: 2.681 ± 0.02
2.681TyrSer: 2.681 ± 0.02
2.178TyrThr: 2.178 ± 0.383
2.681TyrVal: 2.681 ± 0.02
0.503TyrTrp: 0.503 ± 0.07
2.345TyrTyr: 2.345 ± 0.012
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (5970 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski