Duy Hiếu's Blog

Database Indexing - Strategies for good indexes and high performance

Hiếu Phạm Duy — Sat, 06 Jan 2024 14:02:36 GMT

In the last post, we walked through the basic concept, data structures, and supported query types of database indexes. Today, I will continue to share strategies for good indexes and high query performance.

Please bear in mind that you must practice these strategies to master them. Let's create your database and try with sample data such as Sakila,...

I. Choose The Right Column

Firstly, please list all queries for that table before deciding which indexes to create.

Secondly, don't create the index for columns with less distinct values (low cardinality).

E.g. status, action, is_active,...

You can check the cardinality figures by SHOW INDEX FROM + your tables

However, if the distribution of your status column is unbalanced => Index still works if you only search by a value with lower distribution.

II. Create Prefix Index

If you need to create an index for a string/varchar column, I believe that you should create a prefix index:

Long enough to work efficiently
Short enough to reduce the index's storage

Example

Let's see the customer table with an index of the last_name column here and calculate the difference between the full-length index and prefix index of the first 6 chars:

Full length: key length = 45 * 4 + 1 (store length) = 181 bytes
Prefix index: key length = 6 * 4 + 1 (store length) = 25 bytes

Yes, the key length is 7 times larger.

How to find a good length

A good way to calculate a good prefix length is by computing the full columns selectivity and trying to make the prefixs selectivity close to that value.

E.g.: I have a city_demo table and I want to create an index for the city column.

Step 1: we target a selectivity near 0.0312 as full-length selectivity.

Step 2: evaluate many different lengths in one query, which is useful on very large tables:

Step 3: This query shows that seven characters are good enough for my prefix index.

III. Composite Index

Sometimes, you need to query by 2 columns in the same query. Please consider creating a composite index instead of an index for each column.

Individual indexes on lots of columns wont help our database improve performance for most queries.

For example, MySQL can cope a little with such poorly indexed tables when it employs a strategy known as index merge, which permits a query to make limited use of multiple indexes from a single table to locate desired rows. It can use both indexes, scanning them simultaneously and merging the results.

However, the algorithm's buffering, sorting, and merging operations use lots of CPU and memory resources. So a composite index should be your first choice.

Another point is When you create a composite index for multiple columns, you can use that index in a query with the first column of the index.

E.g: The index of (username, created_at) can be used for both queries:

Select * from users where username=hieupd and create_at > 2022-01-01
Select * from users where username=hieupd

\==> No need an index for the username column only.

IV. Choose The Order for Composite Index

How?

Did you decide to create a composite index? OK good!

Now how about the order of columns?

I asked the question to at least 30 interviewees, and only one of them could answer.

The answer is:

Choose the column with higher selectivity as the first column (in most cases)

We can check the selectivity of columns by the query count distinct / count *.

Pay Attention!!!

When creating an index (A, B), no need to create an index only for A
Should put your range condition to the right of queries
In B-tree, if the range condition comes first in WHERE, the second column will not be used in that index
E.g.: In the above image, I have a composite B-tree index of (date_of_birth, subsidiary_id)
If I query by:
Select * from uses when date_of_birth between 01-Jan-71 and 09-Jan-71 and subsidiary_id = 27;
\=> Our database only uses the index for the first range condition, and then scans on left nodes for the second condition (subsidiary_id = 27) without index supports.

V. Clustered Index

Definition

Clustered indexes arent a separate type of index. Rather, theyre an approach to data storage.

When a table has a clustered index, its rows are actually stored in the indexs leaf pages.

The leaf pages contain full rows, but the node pages contain only the indexed columns.
You can have only one clustered index per table because you cant store the rows in two places at once.
Clustered indexes normally faster than non-clustered indexes

For instance, in MySQL InnoDB, the clustered index is our primary key. If you dont define a primary key, InnoDB will try to use a unique non-nullable index instead.

Pay Attention!!!

In MySQL InnoDB, choosing the right PK is very important
Should not insert PK values randomly => need to reorder data, increase Disk I/O, disk fragmentation

VI. Covering Index

Indexes need to be designed for the whole query, not just the WHERE clause.

A Covering Index is:

An index that covers all data needed for a query (only B-tree can be used to cover indexes)
An index reduces Disk I/O and latency because no need to look up into disk.

So, please query only what you need, should not query redundant data (SELECT * FROM).

E.g.: In MySQL InnoDB, when you issue a query that is covered by an index (an index-covered query), youll see Using index in the Extra column in EXPLAIN:

VII. Redundant and Duplicate Index

You should analyze your database indexes and remove duplicated and unused indexes. It helps our database to not need to maintain redundant indexes and reduce the impact for INSERT, UPDATE, and DELETE operations.

E.g.: In MySQL InnoDB, The best way to identify unused indexes is with performance_schema and sys:

Notice that if there is an index on (A, B), another index on (A) would be redundant because it is a prefix of the first index.

In most cases, you dont want redundant indexes, and to avoid them you should extend existing indexes rather than add new ones. However, there are times when youll need redundant indexes for performance reasons. Extending an existing index might make it much larger and reduce performance for some queries.

VIII. Partial Index

So far we have only discussed which columns to add to an index. With partial (PostgreSQL) or filtered (SQL Server) indexes you can also specify the rows that are indexed.

A partial index is useful for commonly used where conditions that use constant values - like the status code in the following example:

Queries like this are very common in queuing systems to fetch all unprocessed tasks.

Benefits

Specify the rows that are indexed
Reduce disk space and index size

Pay Attention!!!

Only Oracle, MongoDB, and PostgreSQL support
Very common in queuing systems

IX. Update Index Statistic

Over time, our data and indexes can become fragmented, which might reduce performance. So, we can consider running:

ANALYZE TABLE: calculates statistics for indexes
In MySQL, ANALYZE TABLE returns a result set with the columns shown in the following table.

E.g.: ANALYZE TABLE users

OPTIMIZE TABLE: reorganizes the physical storage of table data and associated index data, to reduce storage space and improve I/O efficiency when accessing the table.
In MySQL, OPTIMIZE TABLE returns a result set with the columns shown in the following table.

For InnoDB tables, OPTIMIZE TABLE is mapped to ALTER TABLE ... FORCE, which rebuilds the table to update index statistics and free unused space in the clustered index.

Pay Attention!!!
- Think twice before acting, especially on your Prod databases.
- Need to be reviewed by database experts because it can cause downtime.

X. No Index

Last but not least, indexing is not a silver bullet. Sometimes, you get stuck improving your index performance. Please stay calm and think about alternative solutions rather than trying hard with indexing, such as:

Give up :D
Use suitable databases for your requirements
E.g.: To support aggregate queries, please use OLAP databases instead of OLTPs.
Build temp tables for heavy queries
Partition your tables, and archive data frequently (based on data retention)
Combine multiple databases
Apply CDC (Change Data Capture) to replicate data and scale your read performance.

XI. Conclusion

We just covered 10 strategies for effective indexing and achieving high query performance. I hope this article is useful for anyone working with indexes.

Bear in mind that each database and its storage engines have different index implementations and support different index types. Therefore, be mindful of the database type, storage engine, and version you are using before creating/optimizing your indexes.

XII. References

Database Indexing - Data Structures & Supported Queries

Hiếu Phạm Duy — Mon, 25 Dec 2023 14:10:29 GMT

I will tell you a fact that

Most of the hard issues come from databases when your system grows.

The bigger your system, the harder your database issues. Index, schema design, partition, and replication,... are important techniques to work with databases.

In particular, mastering indexing is a MUST for backend engineers.

I. Introduction

Indexes are data structures that storage engines use to find rows quickly. This is the most powerful way to improve query performance.

Store in a separate space with source data
Index Creation does not change the source data
Databases create a copy of the column that we create index and link to the original data
Index is implemented at storage engine layer
Each database, storage engine supports different index types

A backend engineer needs to master database indexing, or someday they headaches with database issues when their system grows x5 x10.

Benefits

Improve the search performance of SELECT
Reduce servers workload and I/O of disk

Notes: Index also works for UPDATE queries with WHERE when using indexes to look up the record and update

E.g. UPDATE WHERE id = 1

Limitation

More space for index (disk, memory)
INSERT, UPDATE, DELETE queries are slower due to index maintenance
Too many indexes cause a slowdown in the database server
Confuses when the optimizer selects an execution strategy

II. Data Structure of Index

There are a lot of index types. However, in terms of structure, we have 2 most common: B-tree and Hash. So I will focus on these two types in this post.

1. B-tree index

This is a Balanced Tree (not a binary tree). Each node contains N values and pointers to the below nodes. Leaf nodes values are sorted and each leaf node contains a pointer to the original table.

However, in modern databases such as MySQL and PostgreSQL, when we talk about B-tree, it actually is a B+ tree.

In B+ tree, leaf nodes are connected by a DOUBLE LINKED LIST. So it works more efficiently than B-tree in the case of range queries.

Each leaf node is stored in a block (page) the smallest unit of the database. Each storage engine has a different block size.

E.g. MySQL (InnoDB) has a default block size of index = 16kB.

a. Searching on B+-tree

The below picture shows an index fragment to illustrate a search for the key "57". The tree traversal starts at the root node on the left-hand side. Each entry is processed in ascending order until a value is greater than or equal to (>=) the search term (57). Then the database follows the reference to the corresponding branch node and repeats the procedure until the tree traversal reaches a leaf node.

From leaf nodes, we have connections to the table data. Unlike the index, the table data is not sorted at all.

b. Supported Query Types

As I said, each database or storage engine supports different index types. So within the scope of this post, let's assume that we are working with MySQL + InnoDB engine.

we have a composite index of 2 columns employee_id and subsidiary_id.

Match full value
Your database will use your index if you query by full values (=)
E.g.
- WHERE employee_id = 1
- WHERE employee_id = 123 and subsidiary_id = 20
Match leftmost prefix
Your database can use a composite index when searching with the leading (leftmost) columns. An index with three columns can be used when searching for the first column, when searching with the first two columns together, and when searching using all columns.
E.g.
- OK: WHERE employee_id = 1
- NOT OK: WHERE subsidiary_id = 20
Match range condition
Your database will use your index if you query by a range condition on the first column.
E.g.
- OK: WHERE employee_id >= 123 and employee_id < 125
- NOT OK: WHERE subsidiary_id > 20
Match equal in the left and range in the right column
E.g.
- OK : WHERE employee_id = 123 and subsidiary_id > 20
- NOT OK : WHERE employee_id > 123 and subsidiary_id = 20
  \==> Only use index for employee_id search.
LIKE Operator (leftmost)
LIKE filters can only use the characters before the first wildcard during tree traversal. The remaining characters are just filter predicates that do not narrow the scanned index range.
The more selective the prefix before the first wildcard is, the smaller the scanned index range becomes.

E.g.
- OK: WHERE name LIKE WI%ND
- NOT OK: WHERE name LIKE %WIND%
Covering Indexes
The index covers the entire query so it is also called a covering index. It prevents table access and runs pretty fast.
E.g. SELECT subsidiary_id FROM Employee WHERE employee_id > 123
Index Merge
There are queries where a single index cannot do a perfect job.
E.g. queries with two or more independent range conditions

You can of course accept the filter predicate and use a multi-column index nevertheless. That is the best solution in many cases anyway.
But what I want to say here is we have another option: two separate indexes, one for each column. Then the database must scan both indexes first and then combine the results.
Sorting/Group By
Your database will use your index in case of ordering or grouping because index data is sorted.
E.g. Select id, name FROM users ORDER BY id DESC limit 10
Function Indexes
You must use the same function of your query to create the index.
E.g.
Your query is:
SELECT id, fullname FROM users WHERE UPPER(fullname) = UPPER(Pham Duy Hieu)
\=> You need to create an index with the UPPER function:
CREATE INDEX idx_name on users (UPPER(fullname))

c. Index with NULL values

When we create the index for Nullable columns, MySQL needs 1 byte to detect null/non-null values. Besides, optimizing queries (internal) in MySQL can be more challenging.

E.g. I have a users table

You can check the key length in Explain result:

explain select * from users where departure_id = 1;

You can see that departure_id is interger 4 bytes, right? However, the real key length is 5 bytes due to 1 byte for its length.

So you should set NOT NULL for your columns whenever you can. Even when you do need to store a no value in a table, you might not need to use NULL. Perhaps you can use zero, a special value, or an empty string instead.

However, dont be too afraid of using NULL when you need to represent an unknown value. In some cases, its better to use NULL than a magical constant.

2. Hash Index

The index type is based on the Hashing technique to build a Hash Table and store indexed data. However, it is only supported by MySQL (Memory), Oracle and PostgreSQL.

If you are using MySQL InnoDB and you create a HASH index, InnoDB silently changes HASH to Btree.

Here are some points of a hash index:

Good performance with "="
Don't support range queries (>, <, >=, <=) and sorting, group by
Hash Collision: should not be used for columns with many duplicated values
More space for hashing values (than B-tree)

We have just walked through the most common data structures for indexing and the supported query types accordingly. In the next post, I will share common indexing strategies allowing you to start practicing on your own. See youuuuu!

(continued)

IV. References

Spring Boot 3 And Java 17 Migration Guide

Hiếu Phạm Duy — Thu, 16 Nov 2023 17:24:53 GMT

Spring Boot 3.0 is a new major release that offers new features and improvements. However, it requires Java 17 as a minimum version and comes with numerous compatibility issues if you intend to upgrade.

I. Pros and Cons

You need to analyze the pros and cons of our migration. In my experience, there are some points you should review:

1. Pros

Java 17 Baseline

Youll need to upgrade to JDK 17 before you can develop Spring Boot 3.0 applications. This means you can take advantage of the latest features and performance improvements that Java 17 offers.

GraalVM Native Image Support

GraalVM Native Images provide a new way to deploy and run Java applications. It provides various advantages, like an instant startup and reduced memory consumption (pain points of Spring Boot apps).

Improved observability with Micrometer and Micrometer Tracing

You can check more details here.

2. Cons

Time/Resouces constraints

Migrating to a new major release takes time and resources, especially for testing. This migration affects all your flows so needs to be tested carefully. While you can update your code within a few days, please plan for testing to span more than a week (the duration depends on the size of your project).

Risk of new bugs

As mentioned earlier, the migration affects all your flows. Therefore, if your test coverage doesn't cover all your code, please be careful. Test your end-to-end flows and scrutinize the logs for any new exceptions or discrepancies compared to the previous state.

II. Before we start

If youre currently running with an earlier version of Spring Boot, I recommend that you upgrade to Spring Boot 2.7 before migrating to Spring Boot 3.0. It minimizes compatibility issues as much as possible.

Review Dependencies

You can review your dependencies and dependency management for 3.x to assess how your project is affected.

For dependencies that are not managed by Spring Boot, you can identify the compatible version before upgrading.

Review Deprecations

Classes, methods and properties that were deprecated in Spring Boot 2.x have been removed in this release. Prior to upgrading, please ensure that you are not calling any deprecated methods.

III. Migrate to Spring Boot 3 and Java 17

1. Spring Boot Template Project

I will use my project as an example for you guys to share how I migrated from Spring Boot 2.7 (Java 11) to Spring Boot 3 and Java 17.

Github: https://github.com/hieubz/spring-boot-template-project

This project includes the implementation of common backend features, designed to assist both myself and other Spring Boot developers in coding more efficiently. For further details, you can read more here.

2 . Migration Steps

Configuration Properties Migration

Let's add the migrator by adding the following to your Maven pom.xml:

<dependency>    <groupId>org.springframework.bootgroupId>    <artifactId>spring-boot-properties-migratorartifactId>    <scope>runtimescope>dependency>

This will analyze your applications environment and print diagnostics at startup console logs. Then you can based on that update your properties accordingly.

Update Dependencies

We start with the parent pom spring-boot-starter-parent and Java version

    <parent>        <groupId>org.springframework.bootgroupId>        <artifactId>spring-boot-starter-parentartifactId>        <version>3.1.5version>        <relativePath/>     parent>    <properties>        <java.version>17java.version>    properties>

Tips: We shouldn't specify the version of Spring Data JPA, Spring Web, Spring Data Redis,... because their compatible versions are already declared in the parent POM.

        <dependency>            <groupId>org.springframework.bootgroupId>            <artifactId>spring-boot-starter-data-jpaartifactId>        dependency>        <dependency>            <groupId>org.springframework.bootgroupId>            <artifactId>spring-boot-starter-webartifactId>        dependency>

If you are working with MySQL, let's replace your mysql-connector-java by:

        <dependency>            <groupId>com.mysqlgroupId>            <artifactId>mysql-connector-jartifactId>            <version>8.1.0version>        dependency>

If you are using logback for logging, let's update it to v1.4.11

        <dependency>            <groupId>ch.qos.logbackgroupId>            <artifactId>logback-classicartifactId>            <version>1.4.11version>        dependency>

If you are using Openfeign for integrations, let's update it to v4.x

        <dependency>            <groupId>org.springframework.cloudgroupId>            <artifactId>spring-cloud-starter-openfeignartifactId>            <version>4.0.4version>        dependency>

If you are using Spring JDBC, let's update it to v6.x

        <dependency>            <groupId>org.springframeworkgroupId>            <artifactId>spring-jdbcartifactId>            <version>6.0.12version>        dependency>

If you are using Jackson for data binding, let's update it to v2.15.x

        <dependency>            <groupId>com.fasterxml.jackson.coregroupId>            <artifactId>jackson-databindartifactId>            <version>2.15.3version>        dependency>                <dependency>            <groupId>com.fasterxml.jackson.datatypegroupId>            <artifactId>jackson-datatype-jsr310artifactId>            <version>2.15.3version>        dependency>

If you are using Redisson as a Redis client, let's update it to v3.24.x

        <dependency>            <groupId>org.redissongroupId>            <artifactId>redisson-spring-boot-starterartifactId>            <version>3.24.3version>        dependency>        <dependency>            <groupId>org.redissongroupId>            <artifactId>redisson-spring-data-31artifactId>            <version>3.24.3version>        dependency>

You should use the default spring-boot-starter-test for unit testing. However, If you are customizing mockito-core, let's update it to v5.3.x

        <dependency>            <groupId>org.mockitogroupId>            <artifactId>mockito-coreartifactId>            <version>5.3.1version>            <scope>testscope>        dependency>

If you are using jjwt for authentication, let's update it to 0.12.x

        <dependency>            <groupId>io.jsonwebtokengroupId>            <artifactId>jjwtartifactId>            <version>0.12.3version>        dependency>

If you are using Springdoc for API documentation, let's replace your springdoc-openapi-ui by:

        <dependency>            <groupId>org.springdocgroupId>            <artifactId>springdoc-openapi-starter-webmvc-uiartifactId>            <version>2.2.0version>        dependency>

If you are using spring-kafka as your Kafka client, let's update it to v3.0.x

        <dependency>            <groupId>org.springframework.kafkagroupId>            <artifactId>spring-kafkaartifactId>            <version>3.0.10version>        dependency>

Rebuild and change the code

After updating your dependencies, let's rebuild your code first and check for any issues
```
  mvn clean package
```
Spring Boot 3.0 has migrated from Java EE to Jakarta EE APIs for all dependencies. So you should face the javax issue in your first build:

Then you just need to replace all javax in your imports by jakarta (should use Ctrl+Shift+R to replace on IntelliJ)
If the Spring migrator is working, you can see a WARNING log in the startup console:

Simply update as suggested.
In Hibernate 6, you can use MySQLDialect for all MySQL versions (MySQL5Dialect, MySQL8Dialect have been deprecated)

spring.jpa.properties.hibernate.dialect=org.hibernate.dialect.MySQLDialect

JPA SpringPhysicalNamingStrategy is replaced by CamelCaseToUnderscoresNamingStrategy

org.springframework.boot.orm.jpa.hibernate.SpringPhysicalNamingStrategy==> org.hibernate.boot.model.naming.CamelCaseToUnderscoresNamingStrategy

Spring Security is not working if you are using WebSecurityConfigurerAdapter (deprecated). Besides, in v3.x, you have to use lambda to configure filterChain.

So you have to remove the extension of WebSecurityConfigurerAdapter class.

// Now@Configuration@EnableWebSecurity@EnableGlobalMethodSecurity(prePostEnabled = true)public class WebSecurityConfig extends WebSecurityConfigurerAdapter {}// Then without extension@Configuration@EnableWebSecurity@EnableMethodSecuritypublic class WebSecurityConfig {}

Then do several updates for AuthenticationManager bean creation:

// Now with the class extends WebSecurityConfigurerAdapter  @Bean  @Override  public AuthenticationManager authenticationManagerBean() throws Exception {    // Get AuthenticationManager bean    return super.authenticationManagerBean();  }// Then without extension  @Bean  public AuthenticationManager authenticationManager(AuthenticationConfiguration authConfig) throws Exception {    return authConfig.getAuthenticationManager();  }

And SecurityFilterChain:

/** We have to use Lambda for SecurityFilterChain configuration **/// Now with the class extends WebSecurityConfigurerAdapter  @Override  protected void configure(HttpSecurity http) throws Exception {      http.cors().and().csrf().disable();      http.authorizeRequests().antMatchers(AUTH_WHITELIST).permitAll().anyRequest().authenticated();      http.sessionManagement().sessionCreationPolicy(SessionCreationPolicy.STATELESS);      http.exceptionHandling().authenticationEntryPoint(authEntryPointJwt);      http.addFilterBefore(jwtAuthenticationFilter(), UsernamePasswordAuthenticationFilter.class);  }// Then with lambda  @Bean  protected SecurityFilterChain configure(HttpSecurity http) throws Exception {      http.cors(AbstractHttpConfigurer::disable).csrf(AbstractHttpConfigurer::disable);      http.authorizeHttpRequests(auth -> auth.requestMatchers(AUTH_WHITELIST).permitAll().anyRequest().authenticated());      http.sessionManagement(s -> s.sessionCreationPolicy(SessionCreationPolicy.STATELESS));      http.exceptionHandling(ex -> ex.authenticationEntryPoint(authEntryPointJwt));      http.addFilterBefore(jwtAuthenticationFilter(), UsernamePasswordAuthenticationFilter.class);      return http.build();  }

Powermock is not working with Spring Boot 3 and JDK 17. Its latest update is in Feb 2022. So you need to move to Mockito if you are using Powermock.
The jjwt library updates its API, so we update our code:

// Now  public String generateJwtToken(Long userId) {    return Jwts.builder()        .setSubject(userId.toString())        .setIssuedAt(new Date())        .setExpiration(new Date(new Date().getTime() + jwtExpirationMs))        .signWith(SignatureAlgorithm.HS256, jwtSecret.getBytes(StandardCharsets.UTF_8))        .compact();  }// Then  public String generateJwtToken(Long userId) {    return Jwts.builder()        .subject(userId.toString())        .issuedAt(new Date())        .expiration(new Date(new Date().getTime() + jwtExpirationMs))        .signWith(Keys.hmacShaKeyFor(jwtSecret.getBytes(StandardCharsets.UTF_8)), Jwts.SIG.HS256)        .compact();  }

And the JWT parser:

// Nowpublic Claims getJwtTokenClaim(String jwt) {   return Jwts.parser()        .setSigningKey(jwtSecret.getBytes(StandardCharsets.UTF_8))        .parseClaimsJws(jwt)        .getBody();}// Thenpublic Claims getJwtTokenClaim(String jwt) {    return Jwts.parser()        .verifyWith(Keys.hmacShaKeyFor(jwtSecret.getBytes(StandardCharsets.UTF_8)))        .build()        .parseSignedClaims(jwt)        .getPayload();}

Finally, rebuild and check for any issues. In my project, there are no issues left, so I stop updating the code here.

Testing

Since this is a major upgrade, carefully test all APIs for discrepancies or exceptions. Review them one by one and monitor logs.

IV. Conclusion

We just completed the migration to Spring Boot 3 and JDK 17. There are many issues during this process, so stay calm and get things done :))). I hope this is helpful for anyone planning the migration.

Hieu Pham.

Câu Chuyện Về Con Cá Chình

Hiếu Phạm Duy — Sun, 06 Aug 2023 02:56:49 GMT

Thi c i, ng dn Nht Bn ra bin bt c chnh, v thuyn nh, khi tr v n b c chnh cht ht.

Nhng c mt ng dn, mi ln anh ch c v chng u cn sng. V th c ca anh bn c gi cao gp i ngi khc. My nm sau, ngi ng dn ny tr thnh mt ph ng giu c vang danh gn xa. n khi bnh nng khng th ra bin c na, ngi ng dn mi em b mt ca mnh ni li vi con trai v b quyt gip c chnh khng cht:

Trong khoang thuyn cha y c chnh, ng b mt con c nheo vo . Trong t nhin, c chnh lun nh nhau vi nheo.

chng li nhng t cng kch ca c nheo, c chnh buc phi c gng nghnh chin. Trong tnh trng u tranh nh vy, bn nng sng ca c chnh s c huy ng ti a, cho nn n vn cn sng khi vo n b.

Ngi ng dn cn ni vi con trai, nguyn nhn khin c chnh cht l v chng bit chng b bt, trc mt chng ch c ci cht, hy vng sng b dp tt, cho nn trong khoang khng c bao lu th chng u cht ht.

Cui cng bc ng dn khuyn cc con, phi dng cm u tranh, ch c u tranh, cuc sng mi trn y nim tin v hy vng.

Nht Bn, khi tr con va mi hiu chuyn, cu chuyn u tin m cha m k cho chng chnh l cu chuyn v c chnh.

Tt c nhng a tr Nht Bn t nh c truyn nim tin: Ch c dng cm chin u, mi c c thnh cng v hy vng!

(Su tm)

Best Practices in Java

Hiếu Phạm Duy — Sun, 19 Mar 2023 08:04:58 GMT

I'd like to share some best practices to help you effectively use the Java programming language and its fundamental libraries.

This article has 4 parts:

General programming: how to use variables, control structures, libraries, data types,... effectively.
Lambdas and Streams: how to make the best use of functional interfaces, lambdas, and method references.
Exceptions: guidelines for using exceptions effectively.
Concurrency: write clear and correct concurrent programs.

Ok, let's go!!!

I. General Programming

a. Prefer for-each loops to traditional for loops

for-each loops get rid of the clutter and the opportunity for error by hiding the iterator or index variable. There is no performance penalty for using for-each

However, there are some common situations where you cant use for-each:

need the array index in order to do something
need to traverse multiple collections in parallel, then you need explicit control over the iterator or index variable

b. Prefer primitive types to boxed primitives

Primitives are more time and space efficient than boxed primitives
Applying the == operator to boxed primitives is almost always wrong => be careful when comparing boxed primitives.
When you mix primitives and boxed primitives in an operation, the boxed primitive is auto-unboxed. If a null object reference is auto-unboxed, you get a NullPointerException
Repeatedly boxed and unboxed causing performance degradation.

c. Avoid Strings where other types are more appropriate

If its numeric, it should be translated into the appropriate numeric type, such as int, long, float, or BigInteger
If its the answer to a yes-or-no question, it should be translated into an appropriate enum type or a boolean

d. Beware the performance of string concatenation

Strings are immutable, so when two strings are concatenated, the contents of both are copied and Java creates a new String

To achieve acceptable performance, use a StringBuilder in place of a String

e. Refer to objects by their interfaces

If appropriate interface types exist, then parameters, return values, variables, and fields should all be declared using interface types

Your program will be much more flexible to switch implementations. However, it is entirely appropriate to refer to an object by a class rather than an interface if no appropriate interface exists. If there is no appropriate interface, just use the least specific class in the class hierarchy that provides the required functionality**.**

II. Lambdas and Streams

a. Prefer lambdas to anonymous classes

In Java 8, the language formalized the notion that interfaces with a single abstract method are special and deserve special treatment. These interfaces are now known as functional interfaces, and the language allows you to create instances of these interfaces using lambda expressions, or lambdas for short.

However, lambdas lack names and documentation. If a computation isnt self-explanatory or exceeds a few lines, dont put it in a lambda. One line is ideal for lambda, and three lines are a reasonable maximum.

b. Prefer method references to lambdas

Java provides a way to generate function objects even more succinct than lambdas: method references

should become:

However, where method references are shorter and clearer, use them; where they arent, stick with lambdas.

c. Favor the use of standard functional interfaces

If one of the standard functional interfaces does the job, you should generally use it in preference to a purpose-built functional interface.

Of course you need to write your own if none of the standard ones does what you need, for example if you require a predicate that takes three parameters.

d. Use streams judiciously

Overusing streams makes programs hard to read and maintain.
In the absence of explicit types, careful naming of lambda parameters is essential to the readability of stream pipelines.
Using helper methods is important for readability in stream pipelines.

f. Prefer Collection to Stream as a return type

When writing a method that returns a sequence of elements, remember that some of your users may want to process them as a stream while others may want to iterate over them

\=> Collection or an appropriate subtype is generally the best return type

g. Use caution when making streams parallel

Java 8 introduced streams, which can be parallelized with a single call to the parallel method.

Parallelizing a pipeline is unlikely to increase its performance if the source is from Stream.iterate, or the intermediate operation limit is used
Performance gains from parallelism are best on streams over ArrayList, HashMap, HashSet, and ConcurrentHashMap instances; arrays; int ranges; and long ranges. What these data structures have in common is that they can all be accurately and cheaply split into subranges of any desired sizes, which makes it easy to divide work among parallel threads
Do not even attempt to parallelize a stream pipeline unless you have good reason to believe that it will preserve the correctness of the computation and increase its speed

III. Exceptions

a. Use checked exceptions for recoverable conditions and runtime exceptions for programming errors

Use checked exceptions for conditions from which the caller can reasonably be expected to recover.
Ex: For example, suppose a checked exception is thrown when an attempt to make a purchase with a gift card fails due to insufficient funds. The exception should provide an accessor method to query the amount of the shortfall. This will enable the caller to relay the amount to the shopper.
If a program throws an unchecked exception or an error, it is generally the case that recovery is impossible and continued execution would do more harm than good.
Ex: ArrayIndexOutOfBoundsException, NullPointerException
If it isnt clear whether recovery is possible, youre probably better off using an unchecked exception

b. Avoid unnecessary use of checked exceptions

Overuse of checked exceptions places a burden on the user of the API
If callers wont be able to recover from failures, throw unchecked exceptions. If recovery may be possible and you want to force callers to handle exceptional conditions, first consider returning an optional.

However, if optional provides insufficient information in the case of failure, you should throw a checked exception.

c. Favor the use of standard exceptions

The Java libraries provide a set of exceptions that cover most of the exception-throwing needs of most APIs.

This table summarizes the most commonly reused exceptions:

Do not reuse Exception, RuntimeException, Throwable, or Error directly.

d. Throw exceptions appropriate to the abstraction

Higher layers should catch lower-level exceptions and, in their place, throw exceptions that can be explained in terms of the higher-level abstraction (exception translation)

e. Include failure-capture information in detail messages

To capture a failure, the detail message of an exception should contain the values of all parameters and fields that contributed to the exception.

Ex: The detail message of an IndexOutOfBoundsException should contain the lower bound*, the* upper bound*, and the* index value that failed to lie between the bounds

Do not include passwords, encryption keys,... in detail messages

f. Strive for failure atomicity

A failed method invocation should leave the object in the state that it was in prior to the invocation. There are several ways to achieve this effect:

Design immutable objects
For methods that operate on mutable objects ==> check parameters for validity before performing the operation (exceptions will be thrown before object modification commences)

g. Don't ignore exceptions

When the designers of an API declare a method to throw an exception, they are trying to tell you something. Dont ignore it!

If you choose to ignore an exception, the catch block should contain a comment explaining why it is appropriate to do so, and the variable should be named ignored

IV. Concurrency

a. Synchronize access to shared mutable data

The synchronized keyword ensures that only a single thread can execute a method or block at one time.

when multiple threads share mutable data, each thread that reads or writes the data must perform synchronization

without synchronization, there is no guarantee that one threads changes will be visible to another thread
volatile modifier performs no mutual exclusion, but guarantees that any thread that reads the field will see the most recently written value

However, you have to be careful when using volatile. For example, in the following method, the problem is that the increment operator (++) is not atomic. It performs two operations (volatile still works if performs one operation) on the nextSerialNumber field: first it reads the value, and then it writes back a new value:

If a second thread reads the field between the time a thread reads the old value and writes back a new one, the second thread will see the same value as the first and return the same serial number ==> computes the wrong results.

One way to fix it is to use AtomicLong.

The best way to avoid the problems discussed is not to share mutable data.

b. Avoid excessive synchronization

Excessive synchronization can cause reduced performance, deadlock,...
Inside a synchronized region, do not invoke a method that is designed to be overridden, or one provided by a client in the form of a function object
\=> has no knowledge of what the method does and has no control over it.

There are some ways to move the alien method invocations out of the synchronized block:

taking a snapshot of shared mutable data => safely traversed without a lock
use concurrent collections*: CopyOnWriteArrayList*, ConcurrentHashMap,...

As a rule, you should do as little work as possible inside synchronized regions! If you must perform some time-consuming activity, find a way to move it out of the synchronized region.

If you are writing a mutable class, you have two options: you can omit all synchronization and allow the client to synchronize externally if concurrent use is desired, or you can synchronize internally, making the class thread-safe

c. Prefer executors, tasks, and streams to threads

Instead of creating and managing threads manually, you can create a thread pool with a fixed or variable number of threads. I prefer ThreadPoolExecutor or ThreadPoolTaskExecutor (in SpringBoot) for a heavily loaded production server.

d. Prefer concurrency utilities to wait and notify

Instead of using wait and notify, you should use the higher-level concurrency utilities in java.util.concurrent package: Executor, concurrent collections and synchronizers.

use ConcurrentHashMap in preference to Collections.synchronizedMap
For interval timing, use System.nanoTime rather than System.currentTimeMillis
(more accurate and more precise and is unaffected by adjustments to the systems real-time clock)

If you have to maintain legacy code that uses wait and notify, always use the wait loop idiom (inside a synchronized region) to invoke the wait method; never invoke it outside of a loop:

e. Use lazy initialization judiciously

The best advice for lazy initialization is dont do it unless you need to. It decreases the cost of initializing a class or creating an instance, at the expense of increasing the cost of accessing the lazily initialized field.

Under most circumstances, normal initialization is preferable to lazy initialization.

If you must initialize a field lazily in order to achieve your performance goals or break a harmful initialization circularity:

For instance fields, use double-check idiom (singleton pattern)
For static fields, use lazy initialization holder class idiom

f. Don't depend on the thread scheduler

When many threads are runnable, the thread scheduler determines which ones get to run and for how long. Any reasonable operating system will try to make this determination fairly, but the policy can vary. Therefore, well-written programs shouldnt depend on the details of this policy.

The best way to write a robust, responsive, portable program is to ensure that the average number of runnable threads is not significantly greater than the number of processors. This leaves the thread scheduler with little choice: it simply runs the runnable threads till theyre no longer runnable. Note that the number of runnable threads isnt the same as the total number of threads, which can be much higher. Threads that are waiting are not runnable.

In terms of the Executor Framework, this means sizing thread pools appropriately and keeping tasks short, but not too short, or dispatching overhead will harm performance.

Phewww!!! I've just shared with you guys some important tips and tricks to work with Java efficiently. If you find it helpful, please share it with everyone to join hands in building a stronger tech community in Vietnam.

References

Effective Java 3rd Edition

Spring Boot Template Project Introduction

Hiếu Phạm Duy — Sun, 12 Feb 2023 11:20:52 GMT

Whenever I need to implement a new feature, I typically start by researching available solutions on Google. Then I will experiment with one or more solutions to select the best one.

However, after several months, I might forget how to implement that feature. Consequently, I end up having to revisit internet resources, conduct research once again, and reattempt the implementation.

The above situation is very common, not just for me but also among my friends and colleagues. We waste a lot of time recalling and researching solutions - things could be synthesized in a template project for the next implementation.

So I have developed this project which includes implementations designed to assist both myself and other Spring Boot developers in seamlessly diving into coding. Each commit within the project represents the implementation of a specific technique.

Link: https://github.com/hieubz/spring-boot-template-project

Tech stack: Java 11, Spring Boot 2.7.x

Every time I need to implement a feature in this list, I can search by feature keywords (such as Redis, retry, email,...) to find the corresponding commit, then apply the code of that commit for my feature. No need to search and remember boilerplate code anymore!

We will have basic features here:

or advanced things like:

There are also modules related to Unit Test, Monitoring:

I hope this project helps Spring Boot developers increase their productivity, allowing them to allocate more time to other important activities.

I plan to upgrade to Spring Boot 3 and Java 17 in the near future.

See yaaa!

Review sách Dev UP

Hiếu Phạm Duy — Sun, 12 Feb 2023 11:17:34 GMT

y l mt cun sch kh hay, ton din v nhiu kha cnh trong i sng lp trnh vin. Tc gi l anh Nguyn Hin - mt lp trnh vin, nh qun l, lnh o, t vn lu nm. Ni dung c trnh by kh gin d nhng gn gi, l c kt kinh nghim nhiu nm ca tc gi.

Cun sch c chia lm 5 phn: Nhng th lng nan, th nghim, nh gi, hc tp, thc thi. Nm phn cng chnh l 5 bc trong m hnh vng lp DevUP m tc gi xut. Ni nm na n l vic bn la chn mt vic g , th nghim v thc thi ri rt ra bi hc, sau c tip tc vng lp hng ngy ngy mt tin b.

Experiment - Valuation - Unlearn - Performance Vng lp 4 bc ny s gip bn lin tc phi vn ng, hc hi ci mi, rn luyn k nng, m rng cc mi quan h vi cng ng trong v ngoi ngnh IT. Mi ln i ht mt cung trn, chnh l mt ln LTV nng cp c trnh , k nng ca mnh. N c th tn 1 thng, cng c th ch 1 tun.

Bn cng s bit cch nh gi ROI - Return Of Investment mi khi a ra quyt nh.

VD: bn c nn tng Java 8 ri th gia vic hc thm Java 11 v hc mi Golang, ci no c ROI cao hn? Ci ty thuc vo hon cnh ca bn. Nu cng ty bn chung Java v sp ti c xu hng s dng Java 11, th ng nhin vic hc Java 11 c ROI cao hn. Nhng nu bn c mong mun chuyn sang mi trng mi v Go l ngn ng c chung cc cng ty , th u t vo Go thi im hin ti l mt la chn ng n. Cn nu bn thy Return t 2 vic trn l nh nhau, bn nn hc Java 11 v n gin Investment cho vic hc Java 11 t hn, do ROI tt hn.

V mt iu na l mt chuyn mn gii phi i km vi kh nng nhn ton cnh th mi mang li kt qu. Nu khng, c th mt phn mm hon ho c sinh ra nhng khng c ngi dng.

Trn y l mt vi nhn xt v cun sch. Gi l ti chi tit tng phn m mnh tm tt li theo hiu ca mnh, sau cn c li.

I. Hiu nhng th lng nan

Cuc sng lun i km vi nhng hon cnh o le, khng r trng en, i hi bn phi ra quyt nh. ng hay sai ? cn ty vo gc nhn.

Bn thn mnh cng tng ng trc nhiu ng ba ng: i lm Dev hay BA, i ht mi ti kim tin, vui nhng bp bnh hay theo ngnh IT chn.

y l i vi nh hng ngh nghip. Cn ch ring trong ngnh IT cng c hng chc th lng nan c tc gi a ra nh gi.

Chung quy li, ng trc nhng la chn, tc gi khuyn cc LTV nn cn nhc c-mt trong di hn thay v nhn vo nhng th trc mt nh lng bng hay v name ca mt cng ty. Lm vic nc ngoi v lu di cha hn phi l tt, hay li trong nc cng khng hn l tt hu. Tc gi ch ra u/nhc im ca tng la chn, cho chng ta ci nhn tng tri ca mt ngi lu nm trong ngh.

i hay , tp on ln hay startup, i su hay i rng, lng tng bao nhiu l ,... ty vic bn mun g, mun t c g trong 5-10 nm ti. Mc tiu di hn s nh hng ti nhng quyt nh ca bn trong hin ti.

An "easy" short-term decision may introduce long-term problems- Noam Wasserman -

II. Th Nghim

Khi trong th lng nan, LTV ch cn cch la chn theo nhng g linh tnh mch bo (ci ny l ti thm vo :D). V nu c phn vn th cng vn c nm . Chi bng ta c chn v th nghim.

Hy th nghim mt thit k, cng c, hay cng ngh trong nhng pet project ca ring mnh. Khi tm nh hng ca bn trong t chc tng ln, hy mnh dn xut nhng kt qu t nhng th nghim c nhn.

Theo kinh nghim ca ti, bn s khng th tri nghim nhng cng ngh, d n, domain m bn mun nu ch lm nhng g c giao. Hy mnh dn xin chuyn team nu thc s mun tri nghim v tr khc. V chm ch th nghim trong nhng pet project ca bn.

III. nh gi

Sau khi th nghim th cng phi nh gi kt qu ch nh: ng hay sai, mnh ang ng u, hc c iu g? V sau nhng th nghim v nh gi , con ng ca bn ngy cng r rng, hoc ch t bn cng c thm d kin cho nhng quyt nh sp ti.

nh gi xem mnh ang Junior hay Senior, la chn no c ROI tt hn, gc nhn ca mnh ton cnh cha, phn hi t ng nghip hay cp trn nh th no?

IV. Hc tp

Sau khi bit mnh ang u, cn thiu g, th l lc bn cn b sung kin thc, k nng cn thiu hoc cha hon chnh. Hc tp cng c ngha l sp t, cu trc li h thng kin thc c nhm tip thu nhng kin thc mi. Nm na l unlearn, xem li nhng g mnh hc v t b nu chng khng cn ng n.

Tc gi a ra kin v nhng iu tng chng nh l chn l, thng nh cm ba i vi LTV. T chuyn t ra lp d, hay tr hn, tm l kiu g cng c bug, t tng i hi yu cu hay quy trnh phi r rng,... cho ti t tng LTV ch lm ti 40 tui. Nhng li suy ngh ny tng nh l bnh thng nhng li tr nn bt thng hoc sai trong thi im hin ti.

Ngay bn thn mnh cng tng ngh nh vy khi cn i hc. Cho ti khi i lm, gp nhiu ngi anh/ch rt gii trong ngh, mnh mi thy nhng iu nhiu LTV hay chp ming cho qua hay coi l hin nhin thc cht ang khin h tr nn xu xa, d di, thiu chuyn nghip, v xa hn l b tt li pha sau.

ng c bm vo nhng iu xa c. Hy hc cch th thch mi th, t phong cch code, cng ngh, quy trnh, thm ch, n cch thc xy dng h thng phn mm.

V. Thc thi

C lm th mi c n, khng lm m i c n th ...

y, hc chn r th phi thc hnh, hnh ng hng ngy mt cch ng n. chnh l lc LTV chuyn ha nhng th nghim thnh mt t duy, k nng bn vng ca bn thn, thm ch nh mt thi quen hay phn x v iu kin.

V gi tr ca mt LTV sau cng s c o m bng gi tr kt qu m anh ta mang li trong cng vic, cho t chc, cho cng ng.

Do vy, ngoi vic thc thi cng vic, LTV cn phi ci thin nhng k nng c bn nhng cc k quan trng khi lm vic trong mt t chc nh problem solving, qun l cng vic, giao tip/cng tc vi ng nghip, hay ch n gin vic bit say no.

Cui cng, khng ai pht trin c chuyn mn nu tch mnh ra khi cng ng chuyn mn. Tham gia v ng gp vo cng ng chuyn mn nn l u tin quan trng ca LTV. Hy theo di v hc hi nhng LTV giu kinh nghim. Hy ch ng chia s kin thc, kinh nghim ca mnh vi cng ng, qua gip ch cho nhng ngi khc v bn cng s nhn c nhng phn bin ca nhng LTV khc hon thin mnh.

l i dng nhng g mnh nhn xt v tm tt li t cun Dev UP. Nhiu kin thc lm, khng nh ht c, nn chc thnh thong phi c li. Ae no cha c th ln Tiki m t v nh. B ch lm :D

Thi nay th thi nh, ti i chy b =)))

-- Hanoi 12/11/2022 18h23 --

High Performance in MySQL - Part 2

Hiếu Phạm Duy — Sun, 12 Feb 2023 11:04:52 GMT

Today I will continue to share my note about 3 topics: Replication, Table partitioning, and Scaling with MySQL.

You could read Part 1 here: High Performance in MySQL - Part 1

Notes: if you have read High Performance in MySQL book, you could realize that my series is only a note I rewrite from this book. My main goal is that I won't forget what I've learned, and can look it up online whenever I need it.

I. Replication

Replication lets you configure one or more servers as replicas of a master server, keeping their data synchronized with the source copy.

How Replication works

At a high level, replication is a simple three-part process:

The source records changes to its data in its binary log as binary log events.
The replica copies the sources binary log events to its local relay log.
The replica replays the events in the relay log, applying the changes to its own data.

Benefits

Data distribution: useful for maintaining a copy of your data in a geographically distant location, such as a different data center or cloud region
Scaling read traffic: distribute read queries across several servers, which works very well for read-intensive applications (need to setup LB)
Backups: a valuable technique for helping with backups. However, a replica is neither a backup nor a substitute for backups.
Analytics and Reporting: Using a dedicated replica for reporting/analytics (online analytical processing, or OLAP) queries
High availability and failover: avoid making MySQL a single point of failure in your application
Testing MySQL upgrades: common practice to set up a replica with an upgraded MySQL version and use it to ensure that your queries work as expected before upgrading every instance.

Replication Problems and Solutions

Binary Logs Corrupted on the Source: rebuild your replicas
Non-unique Server IDs: accidentally configure two replicas with the same server ID => be careful when setting up your replicas
Undefined Server IDs: will not let you start the replica
Missing Temporary Tables: use row-based replication | name your temporary tables consistently (prefix with temporary_, for example) and use replication rules to skip replicating them entirely
Not Replicating All Updates: misuse SET SQL_LOG_BIN = 0 or dont understand the replication filtering rules, your replica might not execute some updates that have taken place on the source
Replication Lag:
- Multithreaded replication
- Use sharding: scale reads with replicas, scale writes with sharding
- Turn off sync bin log on replicas (when sharding is not a viable option because of effort or design issues)

Keep it simple. Dont do anything fancy, such as using replication rings or replication filters, unless you really need to.

II. Table Partitioning

Table Partitioning is the way a MySQL database splits its actual data down into separate tables, but still gets treated as a single table by the SQL layer

Partitioning Rules

must add the partition key into the primary key

Ex: the primary key must include created column

CREATE TABLE userslogs (    username VARCHAR(20) NOT NULL,    logdata BLOB NOT NULL,    created DATETIME NOT NULL,    PRIMARY KEY(username, created))PARTITION BY RANGE( YEAR(created) )(    PARTITION from_2013_or_less VALUES LESS THAN (2014),    PARTITION from_2014 VALUES LESS THAN (2015),    PARTITION from_2015 VALUES LESS THAN (2016),    PARTITION from_2016_and_up VALUES LESS THAN MAXVALUE

the table itself becomes a virtual concept. The partitions hold the data and any indexes are built on the data in the partitions.
MySQL supports horizontal partitioning but not vertical
Partition types:
- Range: it is great because you have groups of known IDs in each table, and it helps range queries.
- Hash: load balances the table, and allows you to write to partitions more concurrently. This makes range queries on the partition key a bad idea.

Benefits

Deletion: quickly delete data that is no longer needed
Storage: possible to store more data in one table than can be held on a single disk or file system partition
Performance: Query data faster when only accessing a smaller volume of data (could apply partition pruning)

III. Scaling MySQL

Scaling is the systems ability to support growing traffic

Read-Bound Workloads

When adding more application nodes to scale the clients serving requests leads to some database issues:

High CPU: means the server is spending all of its time processing queries. The higher CPU utilization gets, the more latency you will see in queries.
Heavy disk read IOPS or throughput: indicating that you are going to disk very often or for large numbers of rows read from disk

\==> adding indexes, optimizing queries, and caching data you can cache

Write-Bound Workloads

There are some examples of write-bound workloads:

Peak e-commerce season and sales are growing, along with the number of orders to track.
Signups are growing exponentially (Ex: ChatGPT reaches 100 million users 2 months after launch)

All of these are business use cases that lead to exponentially more database writes that you now have to scale.

\==> scale up (add more RAM, CPU and disk) or scale out (functional sharding)

1.Scaling Read

Use Replica Read Pools
A very common way to manage these read pools is to use a load balancer (HAProxy, Nginx) to run a virtual IP that acts as an intermediary for all traffic meant to go to the read replicas

2.Scaling Write with Sharding

Sharding means splitting your data into different, smaller database clusters so that you can execute more writes on more source hosts at the same time.
Do not split based on the structure of the engineering team. That will always change at some point. Do split tables based on business function
Most applications shard only the data that needs shardingtypically, the parts of the data set that will grow very large. And not just the data that is growing rapidly but also the data that logically belongs with it and will regularly be queried at the same time (partitioning)
Do not shy away from tackling spots where separate business concerns have been intermingled in the data and you need to advocate for not just data separation but also application refactoring and introducing API access across those boundaries.

Choosing a Partitioning Scheme

A good partitioning key is usually the primary key of a very important entity in the database. These keys determine the unit of sharding. For example, if you partition your data by a user ID or a client ID, the unit of sharding is the user or client.
Diagram your data model with an entity-relationship diagram or an equivalent tool that shows all the entities and their relationships. Try to lay out the diagram so that the related entities are close together.
Consider your applications queries as well. Even if two entities are related in some way, if you seldom or never join on the relationship, you can break the relationship to implement the sharding.
Choosing a partitioning key that lets you avoid cross-shard queries as much as possible but also makes shards small enough that you wont have problems with disproportionately large chunks of data

Querying across Shards

Most sharded applications have at least some queries that need to aggregate or join data from multiple shards (for reports)
Strive to make your queries as simple as possible and contained within one shard.
For those cases where some cross-shard aggregation is needed, we recommend you make that part of the application logic.
Cross-shard queries can also benefit from summary tables. You can build them by traversing all the shards and storing the results redundantly on each shard when theyre complete. If duplicating the data on each shard is too wasteful, you can consolidate the summary tables onto another data store so theyre stored only once.

IV. Summary

Optimizing and scaling MySQL is a journey. Before you dive into scalability bottlenecks, make sure youve optimized your queries, checked your indexes, and have a solid configuration for MySQL.

Once optimized, focus on determining whether you are read-bound or write-bound, and then consider what strategies work best to solve any immediate issues.

For read-bound workloads, our recommendation is to move to read pools unless replication lag is an impossible problem to overcome. If lag is an issue or if your problem is write-bound, you need to consider sharding as your next step.

That's all about MySQL I know so far. I hope it could help you to consolidate your knowledge and give you some ideas to optimize your MySQL.

High Performance in MySQL - Part 1

Hiếu Phạm Duy — Thu, 03 Nov 2022 07:37:25 GMT

MySQL is an open-source relational database management system and is one of the most common databases. Everyone uses MySQL and me too. But whether we are using it correctly and optimally.

Today I will share my experience and what I learned in 3 main topics: Schema Design, Indexing and Query Optimization.

But before we dig dive into them, we should understand MySQL's logical architecture. In other words, we should understand how MySQL processes our queries.

When we send a query to MySQL:

MySQL will check and open connection (reject if too many connections)
Parser will parse the query to check the syntax
Optimizer will optimize the query (based on Performance Schema)
Run the optimized query

And you see, MySQL will optimize our queries depending on metrics in the Performance Schema before running them. So which data types we use, how we design our tables, how we do the indexing and how we write the queries, will determine the performance of our database. It's hard to read and remember all knowledge so I write it as a quick note with multiple points. Anw, let's start!

I. Schema Design

1. Data Types

Using optimal data types not only reduces the storage space but also improves the query performance (because query data will be loaded onto RAM => if RAM is full, data will be flushed to disk => poor performance)

Smaller is usually better
Simple is good (e.g, use decimal rather than string for lat/long values)
Avoid null if possible: harder for MySQL to optimize queries

Let's check the data types MySQL has and discuss when we should use them:

Varchar: variable length
- use 1 or 2 extra bytes to store length, 1 byte if the value requires no more than 255 bytes, and 2 bytes if its more
  Ex: varchar(10) will use up to 11 bytes, varchar(500) will use up to 502 bytes
- Should use varchar(255) to maximize the use of a column because it needs only 1 extra byte for length.
Char: fixed length of characters => good choice for storing hashed user password
Text: string data types designed to store large amounts of data, cannot index full length. It will be stored off the table when the value is greater than 8kb and needs to be read from the disk when querying.
Blob: store binary data (images, videos,...) => in practice, we should save images and videos in other storage space and store image paths in MySQL (avoid full RAM)
Integer: tinyint 1 byte, small int 2 bytes, int 4 bytes, bigint 8 bytes=> the width for integer types such as int(11) which is meaningless for most applications because it does not restrict the range of values
Real number: Float 4 bytes, Double 8 bytes. Double has greater precision than Float
Datetime: 1000 - 9999, 8 bytes
Timestamp: 4 bytes, range 1970-2038, timestamp values will be converted from the current time zone to UTC for storage, and converted back from UTC to the current time zone for retrieval
Decimal: store numbers with fractional part (e.g, financial data)
Enum: cannot update value orders in enum, will be stored as int (should use tinyint instead so that you do not need to alter)

Notes: with varchar, text type, we should create a prefixed index if needed.

2. Schema Design Mistakes

Too many columns: heavy bin log could lead to replication lag, poor performance when pulling data onto memory for read/join queries,...
Too many joins: should join on 3 tables at most (harder for query optimization)
Should not use null

II. Indexing

The index is a data structure that storage engines use to find rows quickly. Index performance can drop very quickly when our dataset grows.

1. Types of Indexes

a. B-tree indexes

A data structure that store data in its node in sorted order
Leaf pages contain a link to the next for fast range traversals through nodes (has order)
Leaf pages have pointers to the indexed data
Storage engine does not have to scan the whole table to find the desired data

Advantages

B-trees store indexed columns in order => useful for range searching
Work well for lookups by the full key value, a key range, a key prefix
Store replated values close together
Index stores a copy of values => some queries can be satisfied from the index alone (covering index)

=> Index reduces the amount of data the server has to examine

=> Index helps the server avoid sorting and temporary tables

Limitation

Cant skip columns in the index

Cant optimize accesses with any columns to the right of the first range condition

VD: where last_name = Smith and first_name like J% and dob = . Because the LIKE is a range condition so MySQL cannot apply index for **dob** column searching.

Btree works with the following kinds of queries:

Match on the full key value specifies values for all columns in the index
Match a leftmost prefix: uses only the first column in the index
Match a column prefix: uses only the first column in the index.
Match one part exactly and match a range on another part
Index-only queries: access only the index, not the row storage

b. Hash indexes

Hash indexes use hash tables to store data and have somewhat different characteristics from those just discussed:

They are used only for equality comparisons that use the = or != operators (but are very fast). They are not used for comparison operators such as < that find a range of values. Systems that rely on this type of single-value lookup are known as key-value stores; to use MySQL for such applications, use hash indexes wherever possible.
The optimizer cannot use a hash index to speed up ORDER BY operations. (This type of index cannot be used to search for the next entry in order.)
MySQL cannot determine approximately how many rows there are between two values (this is used by the range optimizer to decide which index to use).
Only whole keys can be used to search for a row. (With a B-tree index, any leftmost prefix of the key can be used to find rows.)

c. Adaptive Hash indexes

The InnoDB storage engine has a special feature called adaptive hash indexes. When InnoDB notices that some index values are being accessed very frequently, it builds a hash index for them in memory on top of B-tree indexes. This gives its B-tree indexes some properties of hash indexes, such as very fast hashed lookups. This process is completely automatic, and you cant control or configure it, although you can disable the adaptive hash index altogether.

2. Indexing Strategies

a. Prefix

Selectivity: ratio of the number of distinctly indexed values to the total number of rows in the table => higher is better
With varchar /Text column: must define prefix indexes => long enough to give good selectivity and short enough to save space
Cannot use prefix indexes for ORDER BY or GROUP BY queries, nor can it use them as covering indexes

b. MultiColumn indexes

You need a single index with all relevant columns (AND), not multiple indexes that have to be combined
OR condition: sometimes use lots of CPU and memory resources to merge
Choose the most selective columns first in the index (examine the distribution of values in the table, count distinct / count *)

c. Clustered Indexes

Store the Btree index and the rows together in the same structure
When a table has a clustered index, its rows are actually stored in the indexs leaf pages

d. Secondary Indexes

Secondary index accesses require two index lookups instead of one
A leaf node doesnt store a pointer to the referenced rows physical location; rather, it stores the rows primary key values
To find a row from a secondary index, the storage engine first finds the leaf node in the secondary index and then uses the primary key values stored there to navigate the primary key and find the row (traverse two Btrees)

e. Covering Indexes

An index that contains all data needed for a query is called a covering index
Databases do not need to access the filesystem

III. Query Optimization

1. Optimize Data Access

a. Are You Asking the Database for Data You Dont Need?

You should find out whether your application is retrieving more data than you need

Fetching more rows than needed => need limit
Fetching all columns from a multi-table join => define what columns you need
Fetching all columns=> cannot use covering indexes, add more IO, memory
- Avoid using select * from
- But in some cases, we should query full objects, cache them, and use them many times could increase performance
Fetching the same data repeatedly => caching

b. Is MySQL Examining Too Much Data?

In MySQL, the simplest query cost metrics are:

Response time: Service time + Queue time
Number of rows examined
Number of rows returned

Ideally, the number of rows examined would be the same as the number returned (100%). To reduce the number of examined rows:

Use covering indexes: no need to retrieve rows from tables
Change the schema, using summary tables (prepare summary/report tables in advance)
Rewrite a complicated query so the MySQL optimizer is able to execute it optimally

2. Ways to Restructure Queries

a. Complex Queries Versus Many Queries

The traditional approach to database design emphasizes doing as much work as possible with as few queries as possible. This approach was historically better because of the cost of network communication and the overhead of the query parsing and optimization stages.

However, this advice doesnt apply as much to MySQL because it was designed tohandle connecting and disconnecting very efficiently and to respond to small, simple queries very quickly. Modern networks are also significantly faster than they used tobe, reducing network latency. So running multiple queries isnt necessarily such a bad thing.

Its still a good idea to use as few queries as possible, but sometimes you can make a query more efficient by decomposing it and executing a few simple queries instead of one complex one.

b. Chopping up a query

Need to delete old data => chop up a DELETE statement and run sequentially

 E.g: DELETE FROM messages WHERE created < DATE_SUB(NOW(),INTERVAL 3 MONTH);    => we will limit the number of affected rows: DELETE FROM messages WHERE created < DATE_SUB(NOW(),INTERVAL 3    MONTH) LIMIT 10000

=> minimize the impact on the server (smaller transactions), reduce replication lag

=> it might be a good idea to add some sleep time between DELETE statements to reduce the load on servers.

c. Join Decomposition

MySQL can run well over 100000 simple queries per second
Moderns networks are also significantly faster than they used to be

=> sometimes you can make a query more efficient by decomposing it and executing a few simple queries instead of one complex one.

Advantages

Caching can be more efficient
Reduce lock contention
Doing joins in the application makes it easier to scale the database by placing tables on different servers (because we do not need to join)
The queries themselves can be more efficient (such as using an IN() instead of a join lets MySQL retrieve rows more optimally)
You can reduce redundant row accesses. Doing a join in the application means you retrieve each row only once, whereas a join in the query is essentially a denormalization that might repeatedly access the same data

Disadvantages

Need to perform the join in the application
Require a certain level of your team

IV. Conclusion

Phewww, we've just discussed about 3 main and most important topics in MySQL. I can not show all my experience and knowledge about MySQL in one post, so this post is like a note for me or anyone else to review and recall tips and strategies to improve MySQL performance.

In the next post, we will discuss Replication and Scaling techniques in MySQL. I will share how I scaled my database and improved read/write performance.

See you next time!

References

LFU cache and Java implementation

Hiếu Phạm Duy — Sat, 01 Oct 2022 09:37:46 GMT

I found this one on VOZ forum when a member shared about Shopee coding interview. And this is one of the most interesting problems on Leetcode in my opinion. I need to combine both fancy data structures HashMap and LinkedList in my solution. Today, to enjoy my weekend, I will explain how I implement the LFU cache.

I. Preparation

A cache always has 2 main functions: get and put. And the requirements here are get and put method must run in O(1) average time complexity.

=> we will store data in a Hashmap.

we invalidate and remove the least frequently used key. When there are 2 or more keys with the same frequency, we will remove the least recently used key.

=> we will store the frequency of keys in a Hashmap <Frequency, LinkedList of Key> and will maintain the least recently used by the LinkedList of keys in the value.

=> we also need to maintain what is the min frequency of cache, so when you update the freqMap, you also need to update the min frequency.

And this is my note before I implement:

class LFUCache:
- capacity
- min freq
- freqMap: map of (freq, double linked list of nodes)
- cache: map of (key, node)
get:
- if not exist: return -1
- if exist:
  - check if the node has freq = min_freq and list size == 0 => min_freq++ (because after that we will increase cur freq by 1)
  - increase node frequency by 1
  - remove the node from the current LinkedList in freqMap and insert it to the head of the list at freq + 1 key
  - return value

put:
- if exist: update the value of the key in the cache, increase the freq of the node by 1, and update freqMap with the new freq.
- else:
  - full capacity: get linked list from freqMap by min freq of cache, remove the last node in freqMap and cache.
  - create a new node with freq = 1, reset min freq to 1
  - insert to freqMap: get the cur list or create a new list if not exist, add the new one, and put the list again.
  - insert into cache.

II. Implementation

Phewwww, here is my implementation:

import java.util.HashMap;import java.util.Map;class LFUCache {  private int capacity;  private int minFreq;  private Map freqMap = new HashMap<>();  private Map cache = new HashMap<>();  public LFUCache(int capacity) {    this.capacity = capacity;  }  public int get(int key) {    if (!cache.containsKey(key)) return -1;    Node node = cache.get(key);    updateFreqList(node);    return node.val;  }  public void put(int key, int value) {    if (capacity == 0) return;    // update value and frequency when exist    if (cache.containsKey(key)) {      Node node = cache.get(key);      node.val = value;      updateFreqList(node);    } else {      // remove when cache is full      if (cache.size() == capacity) {        DoubleLinkedList minFreqList = freqMap.get(minFreq);        Node minNode = minFreqList.removeLast();        cache.remove(minNode.key);      }      Node newNode = new Node(key, value);      minFreq = 1;      DoubleLinkedList curList = freqMap.getOrDefault(1, new DoubleLinkedList());      curList.add(newNode);      freqMap.put(1, curList);      cache.put(key, newNode);    }  }  private void updateFreqList(Node node) {    // remove node from cur list    DoubleLinkedList curList = freqMap.get(node.frequency);    curList.remove(node);    if (node.frequency == minFreq && curList.size == 0) minFreq++;    node.frequency++;    // insert into new list with new freq    DoubleLinkedList newList = freqMap.getOrDefault(node.frequency, new DoubleLinkedList());    newList.add(node);    freqMap.put(node.frequency, newList);  }  class Node {    int key;    int val;    Node next;    Node prev;    int frequency;    public Node(int k, int v) {      key = k;      val = v;      frequency = 1;    }  }  class DoubleLinkedList {    Node head;    Node tail;    int size;    public DoubleLinkedList() {      this.size = 0;      head = new Node(0, 0);      tail = new Node(0, 0);      head.next = tail;      tail.prev = head;    }    public void add(Node node) {      node.next = head.next;      head.next.prev = node;      node.prev = head;      head.next = node;      size++;    }    public void remove(Node node) {      node.prev.next = node.next;      node.next.prev = node.prev;      size--;    }    public Node removeLast() {      if (size > 0) {        Node tailNode = tail.prev;        remove(tailNode);        return tailNode;      }      return null;    }  }}

III. Result

Phewwwww, it's done. I implemented my own doubly linked list, and compare it with when I use LinkedList of Java Core. Surprisingly, my own linked list had better runtime (149ms) while Java core's LinkedList triple (509ms). That may be because the core LinkedList needs to do many other actions (when we add and remove) rather than focusing on only this problem =)))

Okay, that's all for today's post.

IV. Enjoy your weekend

By the way, let's see =))). Autumn has already come. Go out and enjoy now =)))

Enjoy your weekend!

Web Security Notes

Hiếu Phạm Duy — Tue, 13 Sep 2022 01:07:20 GMT

There are some security concepts in web development that a developer needs to understand. I wrote this note because nobody can remember all those things they read. Taking notes is a good way to store them instead of trying to remember them - impossible =)))

I. CORS

What: Technique that allows a server to indicate any other domains which can make requests to that server in the browser
Why we need: the same origin policy of browser which restricts JavaScript code from making requests from one domain to another domain
Ex: you open a Facebook tab and a hacker website tab on your browser. The tab Facebook uses JS to request to the server, if there is no same origin policy, JS of hacker website could also make requests to the Facebook server. Thats why the browser needs to have a policy to detect JS of a resource could access other resources or not
How to work:
- a client send a request to a server with Origin header which contains the domain of the current website
- The server will validate the Origin, and if valid, return a response with the header Access-Control-Allow-Origin (often have the same value as the Origin header)
- If there is no Access-Control-Allow-Origin header in a response or an invalid value there, the browser will return an error
Config CORS in Spring Boot:
- Use @CrossOrigin annotation to enable cross-origin calls from a list of other domains
- Define maxAge to cache the preflight response
- Config at method level, class level, or globally
- To config globally, you need to config a configuration bean that implement webMvcConfiguer to add CorsMappings

II. Attack Basics

1. SQL Injection

What: Attack that uses malicious SQL statement to access information that is not intended to display.
Results:
- Extract sensitive data
- Delete data or drop tables
How to prevent:
- do not use string concat to build a query
- use parameterized queries with spring JPA or JDBC to prevent that.
- Principle of least privilege: reduce the permissions of the application/users at runtime
  so it can at most edit data, but not change table structures
- Password hashing: use one-way hash algorithms such as Bcrypt, can use with salt

2. XSS - Cross-Site Scripting

Attackers can inject malicious JavaScript into your website.

a. Persistent XSS

How: user can add content such as new comments, and those comments which contain malicious JavaScript will be stored in database. So any other users who see that comment will be attacked, and their browser will run this javascript
Results:
- Spreading malicious js on social media sites, auto download something to your computer,...
- Steal your session
- Steal your sensitive data
How to prevent:
- Escape dynamic content: replace significant characters with HTML entity encoding => the script will never be treated as executable code by the browser
- Whitelist values: users have to select from a list rather than provide any input
- Implement Content-Security Policy that defines script source and tells the browser to never execute inline JavaScript
- HTTP-only cookies: mark cookies as HTTP-only so cookies will be received, stored, and sent by the browser only, but cannot be modified or read by JavaScript
Set HTTP-only on Spring:
- Create a ResponseCookie object
- Set HTTP only = true, set maxAge
- Add cookie to the response

b. Reflected XSS

How:
- Hacker injects malicious JavaScript into the query string and sends it to users
- Users click to that URL, the server will respond the parameter back to the user (such as on the search results page)
- Then the browser will render the script (send requests to the hacker website with the param is the cookie of users).
- Hacker server has session or cookie, then can call to that website as that user.
Types of pages will be attacked:
- Search results: search criteria get displayed back to the users
- Error pages: have an error message which contains invalid input, does the input get escaped properly when it is displayed back to the user?
- Form submissions: if a page post data, does any part of the data being submitted by the form get displayed back to the user?

3. CSRF

What: If hackers can forge HTTP requests to your site, they may be able to trick your users into triggering unintended actions
How to attack:
- User login to their website A
- Hacker create a malicious website and trick users to click on their link
- In the hackers website, there is a form or img to forge a request to website A with users cookie
- Website A has no way to distinguish between a forged request and an actual request
- That request could do some intended action via the users account such as posting a new post, spreading worms on social media, or transferring funds from the visitors account to others,...
How to prevent:
- Use REST standard: GET requests are used only to view resources => limit the harm that can be done by malicious URLs - an attacker will have to work much harder to generate a harmful POST request
- Client-side generated CSRF-tokens: client code generates and sends the same unique secret value in both Cookie and a custom HTTP header.
  => considering a website is only allowed to read/write a Cookie for its own domain, so only a real website can send the same value in both headers
  => The server will compare the token attached to the request with the value stored in the cookie.
- Check the HTTP Referer and Origin header: check the expected domain which triggered the request and reject any requests with abnormal domains

4. DDOS

What: Denial of Service, when attackers attempt to make your website unavailable to others
How: flooding with requests to exhaust all the available resources (server resources, network bandwidth)=> real users are unable to get access
Types:
- SYN flood: exploits 3-way handshake of TCP when client does not send ACK message to server so may connections do not close so that the server cannot open new connections with real users
- HTTP Flood: hackers will exploit legitimate GET or POST requests=> exhaust the servers connection pool
- Others: ...
How to prevent:
- Block IP address => hackers will use distributed DOS
- Apply rate limit for IP address, users
- Autoscaling
- Caching commonly accessed resources to reduce database access
- Serve your images, videos and other resources from CDN so that you are offloading accessed resources to a third-party service designed to withstand large amounts of traffic

III. Other Concepts

Hash vs Encrypt vs Encode

Hashing is the act of transforming data from arbitrary size to a fixed size value. => use for checking the integrity of data or verifying the password (Bcrypt hashing)
Encode is a way to transform data into another form to preserve usability. Ex: we transform text, audio, image into bits 1 and 0 so computers can understand, store and process them.
Encryption is the act of transforming data into another form to preserve the confidentiality
Ex: RSA encryption uses a public key to encrypt and a private key to decrypt data

Is there any way to crack Hash?

Yes of course. We can try to guess the original string by calculating the hash of every possible input and then comparing the results.

Encryption Algorithms

Symmetric encryption: uses the same key for both encrypting and decrypting data (such as AES,...)
Asymmetric encryption: uses a public key to encrypt and a private key to decrypt data (such as RSA,...)

To be continued...

Học gì và học để làm gì?

Hiếu Phạm Duy — Tue, 02 Aug 2022 16:19:07 GMT

Mi ngi c 1 mc tiu cuc i, tm gi l mission. C ngi mun thnh chuyn gia trong lnh vc ca h, c ngi mun pht trin ln lm qun l. Cng c ngi mun i kinh doanh, pht trin doanh nghip ca ring mnh. C ngi li ch cn chill chill sng cp i ti cp v, chng mng phn u.

Ty vo mission th no, t khc bn s bit mnh cn lm g. V bao gi cng th, khi mc tiu ca bn r, in ra dn c ln trn, th cng l lc bn nhn thc c vic g quan trng hay khng quan trng, hc ci ny hay ci kia, v hc dng cho vic g.

t mc tiu cng sm, bn cng rt ngn c thi gian hon thnh mc tiu, khng sa vo nhng th khng quan trng. ng nhin l phi c k lut na.

Ti xc nh mc tiu kh mun so vi bn b ng trang la. Ti cng khng hi hn v i li l tui tr nhiu sc mu cung bc. l qung thi gian sng vi m nhc, nhng chuyn i di rong rui, gp nhng con ngi th v v lng nghe nhng cu chuyn hay. n mt ngy, ti dng li v t hi mnh s lm g tip y. V cho n khi ra trng khong hn 1 nm, ti mi c cu tr li.

V t n nay, ti lun n lc cho nhng vic quan trng, lin quan trc tip n mc tiu. c bit l ti lun cn nhc vic c nn hc 1 th g khng, hay ti c thc s cn n trong cng vic hay cuc sng khng. Bi v hc m khng dng th qu lng ph thi gian. Cng ging nh hng ng kin thc bn c nhi nht trn trng, ti gi bn cn nh hay p dng c nhng g???

Ti gi i lm ri, c ai quan tm vng Benzen ra rng? hay cc hm tch phn tng tng lp lp dng cho vic chi? (Tr nhng bn nghin cu ng chuyn ngnh th mnh k ni).

Cho nn khi mt ai khen mt th g hay v bo ti hc i, ti lun ngh ngay ti vic s p dng n vo u, mc u tin, trc khi quyt nh dnh thi gian cho n.

Ngnh IT kin thc l v bin, nu khng bit chn lc th thc s bn d b "ci g cng bit m ch bit ci g". Nh c nhn ti xc nh trong 3 nm ti ch tp trung lm Backend, th ti phi dnh ti 95% ngun lc cho n. Hm trc, trong bui phng vn v tr Backend Dev, ti c hi nu khch hng yu cu th em c sn sng chuyn qua code Front-end khng? Ngay lp tc ti chia s lun nh hng ca bn thn v by t khng sn sng lm nh th. V ti bit nu chy theo nhng g khch hng yu cu th ti s b xoay nh chong chng, lu dn chng bit mnh mnh ci g.

X hi ngy cng phn cng r rng. Cc cng ty ang c xu hng tuyn ngi chuyn v mt lnh vc nht nh ch khng i hi mt ngi bit mi th nhng chng th g su sc. Tt nht bn nn chia s nh hng r rng vi nh tuyn dng tm c mt mi trng ph hp.

Quay li chuyn hc g v hc lm g, ch c bn mi tr li c, sau khi bit mission ca mnh. Tuy nhin hy nh: phn ln kin thc m bn hc trn trng chng p dng c cho cuc sng hin ti ca bn. Do vy la m hc th cn thit thi :))) Cn ti t th quan trng ang ch bn khm ph, vi iu kin bn phi nhn ra chng trc. Khi nhn ra ri th cm u m chy ti ch ch cn ng nghing lm g na :)))

02-08-2022 23:05

Duy Hieu

Enhance Your Experience With Xfast – Super Fast Super Cheap

Hiếu Phạm Duy — Wed, 01 Jun 2022 17:31:38 GMT

To bring our customers better experiences every day, Giao Hang Tiet Kiem officially launched the XFAST service with extremely outstanding features.

Where: Applicable for orders with pickup and delivery addresses within the urban area in Hanoi and Ho Chi Minh City.
Who: shops and personal customers with fruits, food, clothes, cosmetics, books, components,...
Time: 7h-21h30

XFAST commits that the maximum delivery time of orders will be specifically applied to each kilometer as follows:

0 - 3km delivered in 30 minutes
3 - 6km delivered in 60 minutes
6 - 9km delivered in 90 minutes
More than 9km delivery only from 2-3 hours

Cost:

0.9 USD for a package with distance <= 2km
plus 0.15USD/km if distance > 2km
order from 18h-6h: additional cost 0.2USD

For the best support, Shop can contact the following channels to answer questions from GHTK:

Email: cskh@ghtk.vn

Fanpage: Giaohangtietkiem.vn

Tips for Using Exceptions in Java

Hiếu Phạm Duy — Sun, 20 Mar 2022 08:39:56 GMT

There is a certain amount of controversy about the proper use of exceptions. Some programmers believe that all checked exceptions are a nuisance, others cant seem to throw enough of them. We think that exceptions (even checked exceptions) have their place, and offer you these tips for their proper use.

I. Exception handling is not supposed to replace a simple test.

As an example of this, we wrote some code that tries 10,000,000 times to pop an empty stack. It first does this by finding out whether the stack is empty.

if (!stack.isEmpty()) s.pop();

Next, we force it to pop the stack no matter what and then catch the EmptyStackException.

try {   stack.pop();}  catch (EmptyStackException e) {}

On my test laptop, the version that calls isEmpty ran in 20 milliseconds. The version that catches the EmptyStackException ran in 3640 milliseconds.

As you can see, it took far longer to catch an exception than to perform a simple test. The moral is: Use exceptions for exceptional circumstances only.

II. Do not micromanage exceptions.

Many programmers wrap every statement in a separate try block.

for (i = 0; i < 100; i++) {      try {        this.method1()      } catch (EmptyStackException e) {        // problem 1      }      try {        this.method2()      } catch (IOException e) {        // problem 2      }}

This approach blows up your code dramatically. Just wrap the entire task in a try block. If any one operation fails, you can then abandon the task.

for (i = 0; i < 100; i++) {      try {        this.method1()        this.method2()      } catch (EmptyStackException e) {        // problem 1      } catch (IOException e) {        // problem 2      }}

This code looks much cleaner. It fulfills one of the promises of exception handling: to separate normal processing from error handling.

III. Make good use of the exception hierarchy.

Dont just throw a RuntimeException. Find an appropriate subclass or create your own.Dont just catch Throwable. It makes your code hard to read and maintain.

Do not hesitate to turn an exception into another exception that is more appropriate. For example, when you parse an integer in a file, catch the NumberFormatException and turn it into a subclass of IOException or FileInputFormatException that you declared.

IV. Throw early, catch late

Some programmers worry about throwing exceptions when they detect errors. Maybe it would be better to return a dummy value rather than throw an exception when a method is called with invalid parameters?

For example, should Stack.pop return null, or throw an exception when a stack is empty? We think it is better to throw an EmptyStackException at the point of failure than to have a NullPointerException occur at later time.

Many programmers feel compelled to catch all exceptions that are thrown. Often, it is actually better to propagate the exception instead of catching it:

Higher-level methods are often better equipped to inform the user of errors or to abandon unsuccessful commands.

(Source: Core Java Volumn I - Fundamentals)

Design Hints for Inheritance in Java

Hiếu Phạm Duy — Sun, 13 Mar 2022 08:36:43 GMT

Some hints that we have found useful when using inheritance.

I. Place common operations and fields in the superclass.

Instead of replicating each field in subclasses, we should put common fields and methods on the superclass.

II. Dont use protected fields

Some programmers think it is a good idea to define most instance fields as protected, just in case so that subclasses can access these fields if they need to. However, the protected mechanism doesnt give much protection, for two reasons:

The set of subclasses is unbounded - anyone can form a subclass of your classes and then write code that directly accesses protected instance fields instead of using getter methods, thereby breaking encapsulation.
In Java, all classes in the same package have access to protected fields, whether or not they are subclasses.

III. Use inheritance to model the isa relationship.

Inheritance is a handy code-saver, but sometimes people overuse it. For example, suppose we need a Contractor class. Contractors have names and hire dates, but they do not have salaries. Instead, they are paid by the hour, and they do not stay around long enough to get a raise.

There is the temptation to form a subclass Contractor from Employee and add anhourlyWage field.

public class Employee {   private double salary;   private LocalDate hiredDate;}public class Contractor extends Employee {   private double hourlyWage;   . . .}

Look good, right?

But this is not a good idea, because now each contractor object has both a salary and hourly wage field.

The contractor-employee relationship fails the isa test. A contractor isnot a special case of an employee.

IV. Dont use inheritance unless all inherited methods make sense.

If you find any method in superclass which is not appropriate or doesn't make sense in the subclass, inheritance is not appropriate.

V. Dont change the expected behavior when you override a method.

The substitution principle applies not just to the syntax but, more importantly, to behavior. When you override a method, you should not unreasonably change its behavior. The compiler cant help you - it cannot check whether your redefinitions make sense.

Technical debt and bugs come from here :)))

VI. Use polymSourceorphism, not type information.

Whenever you find code of the form

if (x is of type 1)  action1(x);else if (x is of type 2)  action2(x);

think about polymorphism.

Do action1 and action2 represent a common concept? If so, make the concept a method of a common superclass or interface of both types. Then, you can simply call:

x.action();

Code that uses polymorphic methods or interface implementations is mucheasier to maintain and extend than code using multiple type tests.

VII. Dont overuse reflection.

The reflection mechanism lets you write programs with amazing generality, by detecting fields and methods at runtime. This capability can be extremely useful for systems programming, but it is usually not appropriate in applications. Reflection is fragilewith it, the compiler cannot help you find programming errors. Any errors are found at runtime and result in exceptions.

(Source : Core Java Volumn I - Fundamentals)

Class Design Hints in Java

Hiếu Phạm Duy — Wed, 09 Mar 2022 16:23:33 GMT

Some hints that will make your classes more acceptable in well-mannered OOP circles:

1. Always keep data private.

This is first and foremost; doing anything else violates encapsulation. Youmay need to write an accessor or mutator method occasionally, but you arestill better off keeping the instance fields private.

2. Always initialize data.

Java wont initialize local variables for you, but it will initialize instancefields of objects. Dont rely on the defaults, but initialize all variablesexplicitly, either by supplying a default or by setting defaults in allconstructors.

3. Dont use too many basic types in a class.

The idea is to replace multiple related uses of basic types with otherclasses. This keeps your classes easier to understand and to change.

For example, replace the following instance fields in a Customer class:

private String street;private String city;private String state;private int zip;

with a new class called Address. This way, you can easily cope withchanges to addresses, such as the need to deal with international addresses.

4. Not all fields need individual field accessors and mutators.

You may need to get and set an employees salary. But you certainly wontneed to change the hiring date once the object is constructed. And, quiteoften, objects have instance fields that you dont want others to get or set,such as an array of state abbreviations in an Address class.

5. Break up classes that have too many responsibilities.

This hint is, of course, vague: too many is obviously in the eye of the beholder. However, if there is an obvious way to break one complicated class into two classes that are conceptually simpler.(On the other hand, dont go overboard; ten classes, each with only onemethod, are usually an overkill.)

Ex: Here is an example of a bad design

public class CardDeck // bad design{private int[] value;private int[] suit;public CardDeck() { . . . }public void shuffle() { . . . }public int getTopValue() { . . . }public int getTopSuit() { . . . }public void draw() { . . . }}

This class really implements two separate concepts: a deck of cards, with itsshuffle and draw methods, and a card, with the methods to inspect itsvalue and suit. It makes sense to introduce a Card class that represents anindividual card. Now you have two classes, each with its own responsibilities:

public class CardDeck{private Card[] cards;public CardDeck() { . . . }public void shuffle() { . . . }public Card getTop() { . . . }public void draw() { . . . }}public class Card{private int value;private int suit;public Card(int aValue, int aSuit) { . . . }public int getValue() { . . . }public int getSuit() { . . . }}

6. Make the names of your classes and methods reflect their responsibilities.

Variables, methods and classes should have meaningful names that reflect what they represent
Class name should be a noun (Ex: Order, Package,...)
Accessor methods begin with a lowercase get (Ex: getSalary)
Mutator methods use a lowercase set (Ex: setSalary)

7. Prefer immutable classes

Problem with mutation is that it can happen concurrently when multiple threads try to update an object at the same time
When classes are immutable, it is safe to share their objects among multiple threads
Instead of mutating objects, we create methods to return new objects with the modified state
Make class immutable whenever you can
Of course, not all classes should be immutable. It would be strange to have the raiseSalary method return a new Employee object when an employee gets a raise.

(Source : Core Java Volumn I - Fundamentals).

Clean Architecture (Part 4)

Hiếu Phạm Duy — Sun, 19 Dec 2021 17:41:22 GMT

You could follow the previous posts here:

IV. Architecture

Before we go inside the Clean Architecture, we should answer that what is software architecture? and What does a software architect do?, andwhen does he or she do it?

1. What is Architecture?

First of all, a software architect must be a programmer, and continues to be aprogrammer. Vietnamese developers usually try to avoid coding and focus on higher level issues after several coding years. It's quite sad that the way of thinking has been creating a big gap between Vietnamese and US/Indian programmers. And in fact, there are not many quality programmers in Vietnam.

I could say that Software architects are the best programmers. And they continue to take programming tasks, while they also guide the rest of the team toward a design thatmaximizes productivity.

The architecture of a software system is the shape of a system given by those who build it. The form of that shape is in the division of that system into components, the arrangement of those components, and the ways in which those components communicate with each other.

All software systems can be decomposed into two major elements: policy and details

The policy element embodies all the business rules and procedures. The policy is where the true value of the system lives.
The details are those things that are necessary to enable humans, other systems, and programmers to communicate with the policy, but that do not impact the behavior of the policy at all. They include IO devices, databases, web systems, servers, frameworks, communication protocols, and so forth.

Good architects design the policy so that decisions about the details can be delayed and deferred for as long as possible.

I mean a good architecture is one in which decisions about frameworks, databases, web servers, libraries,... are deferrable. A good architecture does not depend on those decisions.

And a Good Architecture must support

The use cases and operation of the system.
A shopping cart application with a good architecture will look like ashopping cart application. The use cases of that system will be plainly visiblewithin the structure of that system. If the system must handle 100,000 customers persecond, the architecture must support that kind of throughput and responsetime for each use case that demands it.
The maintenance of the system.
Recall that the goal of an architect is to minimize the human resourcesrequired to build and maintain the required system
The development of the system.
Any organization that designs a system will produce a design whose structure isa copy of the organizations communication structure. A Good Architecture facilitates independent actions by those teams, so that the teams do not interfere with each other during development. This is accomplished by properly partitioning the system into well-isolated, independently developable components.
The deployment of the system.
The goal is immediate deployment. A good architecture does not rely on dozens of little configuration scripts and property file tweaks. A Good Architecture helps the system to be immediately deployable after build. And again, this is achieved through the proper partitioning and isolation of the components of the system.

2. Boundaries - Drawing lines

Software architecture is the art of drawing lines

You draw lines between things that matter and things that dont. The GUIdoesnt matter to the business rules, so there should be a line between them.The database doesnt matter to the GUI, so there should be a line between them. The database doesnt matter to the business rules, so there should be aline between them.

Specifically, you should put an interface between your design and data repository, which provides all the functionality we need to use when working with databases, but we do not implement those methods at first.

=> you can focus on getting the business rules written and tested before you have to make the database decision.

The same thing happens with the GUI. The core business rules should be kept separate from, and independent of, those components that are either optional or that can be implemented in many different forms.

Plugin architecture creates firewalls across which changes cannot propagate

To be honest, when you put those lines between components, you actually minimize the changes to propagate.

Arrange the code in those components such that the arrows between them point in one direction - toward the core business

Dependency arrows should be arranged to point from lower-level details to higher-level abstractions.

3. Screaming Architecture

If you are building a health care system, then when new programmers look at the source repository, their first impression should be, Oh, this is a heath care system.

Software architectures are structures that support the use cases of the system and scream about them.
If your architecture is based on frameworks, then it cannot be based on your use cases.
Good architectures are centered on use cases so that architects can safely describe the structures that support those use cases without committing to frameworks, tools, and environments.
If your system architecture is all about the use cases, you should be able to unit-test all those use cases without any of frameworks, databases, web servers
Those new programmers should be able to learn all the use cases of the system, yet still not know how the system is delivered.

4. The Clean Architecture

After some parts to warm up, we will talk about the "Clean". You could see some of architectures such as Hexagon, BCE,... but they all have the same objective, which is the separation of concerns. So Uncle Bob give us some characteristics of a clean architecture:

Divide the software into layers
Independent of frameworks, UI, database, external services
Testable without UI, database, web server, or any other external services

The outer circles are mechanisms (details), the inner circles are policies (business rules)
Source code dependencies must point only inward, toward higher-level policies

5. Details

as I said in previous parts, we must keep our business rules separate from those things:

Database
Web
Framework

Those things are significant, but they are details. You can use the framework - just dont couple to it. Framework authors know their own problems, and the problems of their coworkers and friends. And they write their frameworks to solve those problems - not yours.

The architecture of the framework is often not very clean and tends to violate the Dependency Rule

When your product matures, it may outgrow the facilities of the framework. Or the framework may evolve in a direction that you dont find helpful. You may be stuck upgrading to new versions that dont help you. Or a new and better framework may come along that you wish you could switch to.

I could see coupling issues in my current company. Even by some experienced architects, framework/database coupling still exists.

So:

Keep the framework behind an architectural boundary for as long as possible

V. Conclusion

We went through 4 part of my notes about Clean Architecture. We could see some principles which are really precious in software development. Applying those principles will prevent you from some common traps which could pull you down to the hell.

Remember to separate your software into layers and use interfaces between them to decouple business rules and details. And your decisions of details should be delayed as long as possible so that you could have more information to choose them properly.

Clean Architecture - Component Principles (Part 3)

Hiếu Phạm Duy — Mon, 06 Dec 2021 17:02:34 GMT

You could follow the previous posts here:

If the SOLID principles tell us how to arrange the bricks into walls androoms, then the component principles tell us how to arrange the rooms intobuildings.

III. Component Principles

Components are the units of deployment. They are the smallest entities thatcan be deployed as part of a system. In Java, they are jar files. In Ruby, theyare gem files. In .Net, they are DLLs.

1. Component Cohesion

Focus on the granularity of components and help the developer partition classes into components.

Uncle Bob gave us three principles of component cohesion:

REP: The Reuse/Release Equivalence Principle

Classes and modules that are grouped together into a component should be releasable together, share the same version number, are included under the same release documentation
Classes and modules that are formed into a component must belong to a cohesive group
If a component should be considered reusable it must be a releasable unit

CCP: The Common Closure Principle

Gather into components those classes that change for the same reasons and at the same times.
Separate into different components those classes that change at different times and for different reasons.
this is Single Responsibility Principle for components
A class should not contain multiple reasons to change
Drive components to be larger

CRP: The Common Reuse Principle

Don't force users of a component to depend on things they dont need
Drive components to be smaller

2. Component Coupling

Focus on the stability and relationship between the components

Acyclic Dependencies Principle

Allow no cycles in the component dependency graph

Stable Dependencies Principle - SDP

Depend in the direction of stability
I : Instability : I = Fan-out / (Fan-in + Fan-out)

with Fan-in : Incoming dependencies, Fan-out : Outgoing depenencies

I metric (Instability) of a component should be larger than the I metrics of the components that is depends on=> I metrics should decrease in the direction of dependency
Not all components should be stable

Stable Abstraction Principle - SAP

A component should be as abstract as it is stable
The software that encapsulates the high-level policies of the system should be placed into stable components (I = 0), Unstable components (I = 1) should contain only the software that is volatile (quickly and easily change)
A stable component should also be abstract so that its stability does not prevent it from being extended (Ex: interface, abstract class)
Unstable component should be concrete since its instability allows the concrete code within it to be easily changed

with Abstractness A = Number of abstract classes and interfaces / number of classes

Dependencies run in the direction of abstraction

Uncle Bob also give us the concept Zones of Exclusion

with:

Zone of Pain : highly stable and concrete component

=> cannot be extended because it is not abstract, very difficult to change because of its stability.

Zone of Useless : maximally abstract, yet has no dependents

=> such components are useless :)))

Good architects strive to position the majority of their components at endpoints on the Main Sequence.

=> But in reality, those components have the best characteristics if they are on, or close, to the Main Sequence.

(to be continued

Clean Architecture Notes - SOLID (Part 2)

Hiếu Phạm Duy — Sun, 05 Dec 2021 11:32:15 GMT

You could follow the previous post here

To build a building, we need bricks. On the one hand, if the bricks arent well made, the architecture of the building doesnt matter much. On the other hand, you can make a substantial mess with well-made bricks.

II. Design Principles

Good software systems begin with clean code. The SOLID principles tell us how to arrange our functions and data structures into classes, and how those classes should be interconnected

Or you could say:

The SOLID principles tell us how to arrange the bricks into walls and rooms

The Goal

The goal of the principles is the creation of mid-level software structures (module level) that:

Tolerate change

Are easy to understand

Are the basis of components that can be used in many software systems

1. Single Responsibility Principle

A module should be responsible to one, and only one actor

=> So that each one has only one reason to change

SRP is about functions and classes
At the level of components, it becomes Common Closure Principle
At the architectural level, it becomes the Axis of Change

2. Open-Closed Principle

Software classes, modules should be open for extension, but closed for modification

Higher-level components in that hierarchy are protected from the changes made to lower-level components

The goal is to make the system easy to extend without incurring a high impact of change. This goal is accomplished by partitioning the system into components, and arranging those components into a dependency hierarchy that protects higher-level components from changes in lower-level components.

3. Liskov Substitution Principle

Objects of a superclass shall be replaceable with objects of its subclasses without breaking the application.

The LSP can, and should, be extended to the level of architecture.

4. Interface Segregation Principle

It is harmful to depend on modules that contain more than you need.

=> if there is any change on the module that you depend on, could lead to recompiled and redeployed, even though nothing that it cared about has actually changed.

Ex: In Java, we have 2 classes which implement 1 interface. And each class need only one method from that interface.

The design violated the ISP so the source code of User 1 will inadvertently depend on op2 andop3, even though it doesnt call them.

=> segregating your big interface into smaller and more specific ones.

5. Dependency Inversion Principle

The Dependency Inversion Principle (DIP) tells us that the most flexiblesystems are those in which source code dependencies refer only toabstractions, not to concretions.

Stable Abstractions: interfaces are less volatile than implementations
Good architects work hard to reduce the volatility of interfaces, try to find ways to add functionality to implementations without making changes to the interfaces.
Dont refer to volatile concrete classes, but refer to abstract interfaces instead.
Dont derive from volatile concrete classes
Dont override concrete functions:

Concrete functions often require source code dependencies. When you override those functions, you do not eliminate those dependenciesindeed, you inherit them. To manage those dependencies, you should make the function abstract and create multiple implementations.

=> should create an interface and implement it.

Never mention the name of anything concrete and volatile

And how to deal with that? We could use Abstract Factory pattern to deal with the creation of volatile concrete objects.

Ex: The Application uses the ConcreteImpl through the Service interface. However, the Application must somehow create instances of the ConcreteImpl.

To achieve this without creating a source code dependency on the ConcreteImpl, the Application calls the makeSvc method of the ServiceFactory interface. This method is implemented by the ServiceFactoryImpl class, whichderives from ServiceFactory. That implementation instantiates the ConcreteImpl and returns it as a Service.

You could see that the curve line separates the abstract from the concrete. And the important thing is:

All source code dependencies cross that curved line pointing in the same direction, toward the abstract side

And the source code dependencies are inverted against the flow of control - which is why we refer to this principle as Dependency Inversion.

(to be continued)

Clean Architecture Notes - Do It Right ( Part 1)

Hiếu Phạm Duy — Sun, 05 Dec 2021 09:09:55 GMT

It doesnt take a huge amount of knowledge and skill to get a programworking. Kids in high school do it all the time.

Uncle Bob said that in Clean Architecture book. I spent 2 weeks for this book. So much knowledge was wrapped in it so I realized that I need to make a note before they come out of my head :)))

I. Do It Right

Getting software right is hard. It takes knowledge and skills that most young programmers havent yetacquired. It requires thought and insight that most programmers dont take the time to develop. It requires a level of discipline and dedication that most programmers never dreamed theyd need. Mostly, it takes a passion for the craft and the desire to be a professional.

And when you get software right, something magical happens: You dont need hordes of programmers to keep it working. You dont need massive requirements documents and huge issue tracking systems.

According to my previous experience, I've worked in a project where the design and architecture of the system made it easy to write and easy to maintain. But the previous version of that project was a bad thing. Code duplication, no design patterns, no OOP made it difficult to change and contained huge risks.

Goal

The goal of software architecture is to minimize the human resources required to build and maintain the required system.

The ultimate goal is always simple and easy to understand. But it requires huge effort to achieve.

Case Study

Uncle Bob gave us a case study from market-leading software products:

First, lets look at the growth of the engineering staff

Now lets look at the companys productivity over the same time period, asmeasured by simple lines of code

You could see that every release is supported by an ever-increasing number of developers, but the their performance has decreased dramatically. And the code was 40 times more expensive to produce in release 8 as opposed to release 1.

Do you think that is bad? Let's imagine what this picture looks like to the executives :)))

Needless to say, that is a nightmare. There is a trade-off between making software right and making it quickly.

A familiar lie

Developers have a familiar lie: We can clean it up later; we just have to get to market first! Of course, things never do get cleaned up later, because market pressures never abate.

Getting to market first simply means that youve now got a horde of competitors on your tail, and you have to stay ahead of them by running as fast as you can. They cant go back and clean things up because theyve got to get the next feature done, and the next, and the next, and the next :)))

I could see it in reality, in any software companies I joined. Products always evolve and you have no time to come back and clean your shit :)))

The fact is that making messes is always slower than staying clean, no matter which time scale you are using.

So:

The only way to go fast, is to go well.

The developers may think that the solution is to start over from scratch and redesign the whole system. Some of the companies I have worked with or know of, they all had to redesign their systems after a long period of rapid development.

The same overconfidence that led to the mess is now telling them that they can build it better if only they can start the race over :)))

Their overconfidence will drive the redesign into the same mess as the original project.

Conclusion

In every case, the best option is for the development organization to recognize and avoid its own overconfidence and to start taking the quality of its software architecture seriously. So you need to know what good software architecture is, to minimize effort and maximize productivity.

(to be continued)

Some notes when writing English emails

Hiếu Phạm Duy — Sat, 13 Nov 2021 13:47:22 GMT

Trong qu trnh i lm, vic phi c v vit email bng ting Anh l khng trnh khi, c bit nu bn lm trong cc cng ty nc ngoi. Hm nay mnh c ngi tng hp li cng nh tham kho mt s ti liu v rt ra mt s lu v vic vit mail ting Anh:

1. Hnh thc cho hi

Dear Sir/Madam: dng khi gi cho mt ai m ta khng bit r
=> kt thc bng Your faithfully
Dear Mr Jones: dng khi ta bit mt ngi no tn Jones
=> kt thc bng Your sincerely
Khi bit r nhau th c th kt thc bng With best wishes hoc With kind regards
hoc n gin hn, khng quan trng lm th c Best

2. Mt s lu khi vit body

Phn ni dung cn thng ip r rng, trnh cc cu qu di v s gy hiu lm
Tr khi gi cho ngi thn, bn nn trnh cc cu hi trc tip, qu thng thng

VD: Would you let me know when we may expect your visit?

thay v When will you visit us?

Trnh ghi con s ngy thng d gy hiu nhm

VD: 4-5-2021 = ngy 4 thng 5 trong ting Anh, nhng trong ting M l ngy 5 thng 4

=> 4th May 2021

Trnh vit tt (I'm, I've seen, I don't) m nn dng I am, I have seen, I do not
Trnh dng ting lng trong vn ni (gonna, wanna, gotta,...)
Trnh dng want cho ngi th nht (I) => dng would like
Trnh dng should cho ngi th 2 (You) => nghe kiu dy bo i tc

3. Mt s cu trc thng gp

a. Cm n ngi nhn v l do g

Thank you for your email about...
Thank you for contacting us...
Thank you for your prompt reply...

b. Trnh by l do vit email

I regret to inform you that...
We are pleased to announce that...
I am writing this email to inform you that...
I am pleased to inform you that...

c. Trnh by mong mun, yu cu

I would be grateful if you would let me know of...
Could you please send us...
If you have any question, please do not hesitate to contact us
Please let us know ...

d. Th hin s bit n/ li cm n

Thank you so much; I really appreciate that!
I appreciate your consideration/assistance/guidance/support
Thank you for sharing your expertise
Thank you for spending time with me
I appreciate your consideration and look forward to hearing from you
I appreciate having the opportunity to speak with you today about the Software Engineer position at GHTK.
I appreciate the time you and the GHTK team spent interviewing me
Thank you so much for referring me for the SE position at GHTK

e. Confirm an interview

Hello Ms. XXX,
Thank you very much for the invitation to interview for the Software Engineer position. I appreciate the opportunity, and I look forward to meeting with you on Mar 2nd 2022 at 9AM in your office.
If I can provide you with further information prior to the interview, please let me know.
Best Regards,

f. Interview Thank You Email

Hello Ms. XXX,
Thank you for taking the time to interview me this morning. I enjoyed our conversation about the SE position and appreciated learning more about how the role works.
I feel confident my experience at XXX, as well as my computer science degree, makes me a good candidate for this position.
I am excited about the possibility of joining GHTK and contributing to its future successes.
I look forward to hearing from you about our potential next steps. If there is any additional information you need, please contact me at phamduyhieuit@gmail.com
Thanks,

IELTS Writing Part 2 - Opinion Ex3

Hiếu Phạm Duy — Sun, 03 Oct 2021 10:12:24 GMT

2021/10/03 Sunday afternoon, at home, alone - because of Covid19

But as I said, we should take advantage of the time you have and practice your skills, such as English skills. And when the pandemic go away, we get a strong English skillset, be confident to go international.

This is the third IELTS writing part 2 I wrote on my blog, a Discuss-Opinion type.

Some people say that the main environmental problem of our time is the loss of particular species of plants and animals. Others say that there are more important environmental problems.

Discuss both views and give your own opinion.

Some people believe that the primary environmental issue of today's society is the loss of particular of species of plants and animals while opponents of this idea believe that humans are facing many other environmental problems which are more important. In my opinion, I agree with the latter idea.

On the one hand, it is undeniable that the extinction of some species of plants and creatures is one of the main environmental issues. Firstly, such a trend may lead to an imbalance of the ecosystem. If the ecosystem loses its balance, there are many negative consequences for the next generations. For example, if birds or bats, which are predators of harmful insects such as grasshoppers, are extinct, those insects will attack crops and humans' lives. Secondly, plants play a vital role in creating and reserving oxygen in the atmosphere, as well as maintaining underground water. In fact, forests are being cut down to make way for farms, which cause erosion in the mountainous areas and flooding during monsoon seasons every year.

On the other hand, the world has to face many other environment-related problems other than the aforementioned issue. One of the main issues which needs to be tackled is environmental pollution. In fact, air, water and waste pollution are the issues that are drawing a great deal of attention from the public. Another issue is global warming, which is posing such a great threat to human life. Climate change has accelerated the rate of ice melting in Antarctica, which causes the sea levels to rise. Higher sea levels are coinciding with more dangerous hurricanes and typhoons that move more slowly and dump more rain, contributing to more powerful storm surges that can strip away everything in their path.

In conclusion, although the disappearance of some species of plants and animals is an important environmental issue, it seems to me that humans still have other problems which are more vital.

320 words

Hope that I can keep practicing my English every day, write some posts each week, and take a higher level after the Covid disaster.

See you soon!!!

Six Things you can do every day to benefit your brain

Hiếu Phạm Duy — Sun, 05 Sep 2021 02:54:42 GMT

I've read this article in a bilingual book and found that it's quite useful for everyone. So I type that on this post, also I practice my English writing.

A mind is a valuable thing to waste. You've heard the saying many times, but it's true. Your mind is your most valuable asset. You need to take care of it. So here's a list of 6 things you can do every day to benefit your brain:

1. Take a nap

Refreshing your body can also help you improve brain function, increase memory, and improve your mood. Even just 15 minutes can make a huge difference in your day-to-day life. Naps improve your brain performance, so why are you still awake??

2. Do something creative just before going to bed

When you're tired, your brain can be more creative. Take advantage! Whether you're writing the next great American Novel or dusting off the old paint brush and canvas, finding your creative just before going to bed can yield great results. So tap your inner Picasso and create something beautiful. Just don't fall asleep with the brush in your hand.

3. Focus on one task at a time

Did you know that it's literally impossible for your brain to multitask? By focusing on one task at a time, you can keep your brain working at maximum capability and accomplish more than you imagined. Find a task you need to finish and focus solely on it. Leave the phone in the other room, turn the TV off, and focus. Your brain will thank you.

4. Do Exercise

You've heard that cardio leads to a healthier and better body. But it also helps the mind. Find 15-30 minutes a day and get moving! You don't need a gym membership or any fancy equipment. Just a walk around the neighborhood can do wonders and benefit.

5. Write. Like on a real piece of paper

Computers, iPads, tablets, smartphones and the connection to the internet everywhere means it's becoming less and less likely that you will pull out a piece of paper and write. But research suggests handwriting makes you smarter. So leave your computer on your desk during the next meeting and write your notes.

6. Take a multi-vitamin daily

Your car needs oil, your smartphone needs a battery, and your brain needs nutrients. A daily multi-vitamin will ensure that you get your body what it needs. And it will help your brain according to research from the British Journal of nutrition. Pro-tip: take your multi-vitamin with a healthy smoothie to get your day off to a great start.

Java 8 Notes

Hiếu Phạm Duy — Sat, 04 Sep 2021 08:32:26 GMT

Java, originally evolved from the Oak language, was born in early 1996 with its major version as Java 1 or JDK 1.0. Java was initially designed and developed by James Gosling at Sun Microsystems. Java 8 or JDK 8.0 is one of the major releases of the Java programming language in 2014.

This article would walk you through new features were added in Java 8:

- Lambda Expressions: a new language feature allowing us to treat actions as objects.

- Method References: enable us to define Lambda Expressions by referring to methods directly using their names.

- Optional: special wrapper class used for expressing optionally

- Functional Interface: an interface with maximum one abstract method; implementation can be provided using a Lambda Expression

- Default methods: give us the ability to add full implementations in interfaces besides abstract methods

- Stream API: a special iterator class that allows us to process collections of objects in a functional manner

1. Lambda Expression

    parameter -> expression    (param_1, param_2) -> expression    (param_1, param_2) -> {code block}

VD:

import java.util.ArrayList;import java.util.List;public class Main {  public static void main(String[] args) {    List<Integer> numbers = new ArrayList<>();    numbers.add(5);    numbers.add(9);    numbers.add(8);    numbers.add(1);    numbers.forEach( (n) -> System.out.println(n));  }}

2. Method References

reference a method, reduce the verbosity of some lambdas.
using double colons:
there are two reference types:

Refer to static method

List<String> messages = Arrays.asList("hello", "phamduyhieu.com", "readers!")// lambda:messages.forEach(word -> StringUtils.capitalize(word));// method referencemessages.forEach(StringUtils::capitalize);

Refer to instance method

public class BicycleComparator implements Comparator<Bicycle> {    @Override    public int compare(Bicycle o1, Bicycle o2) {        return o1.getFrameSize().compareTo(o2.getFrameSize());    }}ArrayList myList = new ArrayList<>();myList.add(new Bicycle("hieu", 20));myList.add(new Bicycle("hieu", 30));myList.add(new Bicycle("hieu", 40));myList.add(new Bicycle("hieu", 10));BicycleComparator myComparator = new BicycleComparator();// lambdaList newList = myList.stream().                sorted((a, b) -> myComparator.compare(a, b)).collect(Collectors.toList());// with method referenceList newList2 = myList.stream().sorted(myComparator::compare).collect(Collectors.toList());

3. Optional

was created to avoid any runtime NullPointerExceptions, eliminate many null checks. Also, we can develop clean and neat APIs.

// before Optionalif (name != null) {    System.out.println(name.length());}// with Optional// deal with nullable values explicitly with a shorter wayopt.ifPresent(n -> System.out.println(n.length()));// orElse() is used to return the wrapped value if it is present, and its argument otherwiseString nullName = null;String nameTest = Optional.ofNullable(nullName).orElse("Hieu");System.out.println(nameTest);

4. Functional Interfaces

is an interface with one single abstract method, no more, no less
support the lambda expression in java 8
Before java 8, we would usually create a class for every case where we needed to encapsulate a single piece of functionality => a lot of unnecessary boilerplate code.

Thread thread = new Thread(new Runnable() {    @Override    public void run() {        System.out.println("my runnable");    }});

If you look at the above code, the actual part that is of use is the code inside run() method. Rest all of the code is because of the way java programs are structured.

Java 8 Functional interfaces and Lambda Expressions help us in writing smaller and cleaner code by removing a lot of boilerplate code.

Thread thread2 = new Thread(() -> System.out.println("my runnable"));

5. Default Method

is a method with an implementation which can be found in an interface.
to add a new functionality to an interface, while maintaining backward compatibility with classes that are already implementing the interface:

public interface Vehicle {    public void move();    default void hoot() {        System.out.println("peep!");    }}

For example, the Collection interface can have a default implementation of the forEach method without requiring the classes implementing this interface to implement the same.

6. Streams

represents a sequence of objects from a source such as a collection, which supports aggregate operations.

int sum = Arrays.stream(new int[]{1, 2, 3, 4, 5})        .filter(i -> i >= 3)        .map(i -> i * 3)        .sum();

in simple terms, a stream is an iterator whose role is to accept a set of actions to apply on each of the elements it contains.
Difference between Map and flatMap:
Both map and flatMap are intermediate stream operations that receive a function and apply this function to all the elements of a stream. But the difference is that for the map, this function returns a value, but for flatMap, this function returns a stream.

IELTS Writing task 2 - Opinion - Ex2

Hiếu Phạm Duy — Sun, 29 Aug 2021 10:41:31 GMT

Sunday flow, I wrote a 2nd writing post, but in Opinion type.

Whether or not someone achieves their aims is mostly by a question of luck. To what extent do you agree or disagree?

It is argued that people's success is mostly attributable to luck instead of their own hard work. While I accept that luck plays a role in helping people reach their targets, I do believe that hard work is a much more crucial factor that contributes to the success of any individual.

On the one hand, I believe that only determined and industrious people will gain success in whatever they do. Firstly, hard-working people usually obtain their goals. For instance, Bill Gates had spent thousands of hours on coding and experimenting before he rolled out the World's most popular operating system - Windows - that we use nowadays. Cristiano Ronaldo, the best football player in the world at the moment, is another clear example of success through hard-working. Despite the modest starting point, he spent many hours of hard training and practicing, worked harder than anyone else in his football team before becoming a football superstar.

On the other hand, luck also plays a role in determining one's achievement. it is not commonly known that Bill Gates was born in a family which had a strong financial background, which gave him a precious opportunity to use and practice with computers at a very early stage of his life. If he had had no chance to meet and become familiar with computers so early, perhaps there would be no Microsoft today. In a book named Outliers, the author Glad Well has strengthened this view that behind the success of great people is always a little luck, success is also partially dependent on many other factors such as the social context, age, ... besides personal efforts.

In conclusion, it is certainly true that in order to achieve success in life, we must be extremely hard working determined till the end, but I believe that luck also plays a part in success.

305 words

Writing may be the hardest part of the IELTS exam because it requires strong formal grammar knowledge besides plentiful vocab resources.

So let's keep calm and practice every day, no pain no gain :)))

See yaaaa!!!!

IELTS Writing Part 2 - Problem Solution - Ex1

Hiếu Phạm Duy — Sun, 29 Aug 2021 09:26:45 GMT

In the 2nd year of Covid, I feel so lucky when I'm still here with the first shot of vaccine, have good health, and a job that I like. I hope that the pandemic will go away as soon as possible.

But in the meantime, we should spend time practicing our skills such as English skills. Let's take advantage of this duration so that when the pandemic disappears, we get a strong English skillset, be confident to go international.

This is my first IELTS writing part 2 I wrote on my blog, which is a Problem-Solution type. Enjoy it :)))

The older people who need employment have to compete with younger ones. What problems can this create? What are some solution?

It is widely argued that elder people have to compete with young people when it comes to applying for a job. Several problems have resulted from this tendency, and they should be tackled by a number of effective solutions from the governments, policy-makers and companies.

There are two main issues that older people have to face with in a competitive job market. Firstly, it is very difficult for them to find jobs due to their ages. In fact, many companies and factories usually employ people in a certain range of ages, which prevents the older generation from applying for these positions. Secondly, younger adapt more easily to changes in workplaces which is a huge disadvantage for older workers because they are not as flexible as younger ones. This may lead to both physical and mental problems, including depression and anxiety.

In response to the problem, Some measures should be taken by policy-makers and firms to help elder people in society. The first solution is that the government should have policies to assist such individuals in accessing more job opportunities. In particular, it is essential that the authorities raise the working-age in some appropriate sectors, which gives the older generation more chances to earn their living. Another solution is that some companies should recruit a larger proportion of older employees for some positions. If age discrimination is eradicated, those old experienced applicants will be given greater opportunities to land suitable jobs.

In conclusion, there are some negative consequences from this trend and appropriate solutions should be implemented by the governments and firms to tackle these issues.

265 words

Hope that I can keep practicing my English every day, write some posts, and take a higher level after the Covid disaster.

See you soon!!!

Why do we need Design Patterns? - Part 2

Hiếu Phạm Duy — Sun, 15 Aug 2021 17:45:56 GMT

Part 1: Why do we need Design Patterns? Inheritance isn't enough!!!

So we know using inheritance has not worked out very well, since the duck behavior keeps changing across the subclasses, and it's not appropriate for all subclasses to have those behaviors.

The Flyable and Quackable interface sounded promising at first - except Java interfaces typically have no implementation code, so no code reuse. And whenever you need to modify a behavior, you're often forced to track down and change it in all the different subclasses where that behavior is defined,probably introducing new bugs along the way :)))

After a long time, some Godfathers from US or somewhere I don't know, they invented the Design Principle:

In other words, take the parts that vary and encapsulate them, so that later you can alter or extend the parts that vary without affecting those that don't.

As simple as this concept is, it forms the basis for almost every design patterns. All patterns provide a way to let some part of a system vary independently of all other parts.

Okay, time to pull the duck behavior out of the Duck classes!

Separating what changes from what stays the same

Different from the problems with fly() and quack(), the Duck class is working well and there are no other parts of it that appear to vary or change frequently. So we're going to leave the Duck class alone.

Separate the parts that change from those that stay the same

So we create two sets of classes (totally apart from Duck), one for fly and one for quack. Each set of classes will hold all the implementations of the respective behavior.

So how are we going to design the set of classes that implement the fly and quack behaviors?

We know that we want to assign behaviors to the instances of Duck, instantiate a new Duck instance with a specific type of flying behavior and then we want to change the behavior dynamically.

Let's look at the second Design Principle

Ex:

Programming to an implementation:

Programming to an interface/superclass:

We'll use an interface to represent each behavior - for instance, FlyBehavior and QuackBehavior - each implementation of a behavior will implement one of those interfaces.

So this time it won't be the Duck classes that will implement the flying and quacking interfaces. Instead, we create a set of behavior classes.

And a behavior does not come either from a concrete implementation in the superclass Duck or by providing a specialized implementation in the subclass ifself. So we do not rely on an implementation.

With the new design, the actual implementation of the behavior won't be locked into the Duck subclass.

With this design, other types of objects can reuse our fly and quack behaviors because these behaviors are no longer hidden away in our Duck classes!

And we can add new behaviors without modifying any of our existing behavior classes or touching any of the Duck classes that use flying behaviors.

Integrating the Duck Behaviors

1. We'll add two instance variables of type FlyBehavior and QuackBehavior

remove the fly() and quack() methods from the Duck class
replace them with two similar methods, called performFly() and performQuack()

2. Implement performQuack()

public abstract class Duck {    FlyBehavior flyBehavior;    QuackBehavior quackBehavior;    // rather than handling the quack behavior itself,    // the Duck object delegates that behavior to the object    // referenced by quackBehavior    public void performQuack() {        quackBehavior.quack();    }}

In this part of code, we don't care what kind of object the concrete Duck is, all we care about is that it knows how to quack().

public class VietnamDuck extends Duck {    // VietnamDuck inherits the quackBehavior and flyBehavior instance variables    // from class Duck    public MallardDuck() {        // use Quack class to handle its quack        // so when performQuack is called, the responsibility for the quack        // is delegated to the Quack object        quackBehavior = new Quack();        // similar with flyBehavior        flyBehavior = new FlyWithWings();    }    public void display() {        System.out.println("I'm a real Mallard duck");    }}

It looks good :)) But I said that we should NOT program to an implementation, right? But in my constructor, I am making a new instance of a concrete Quack implementation class!

Yeah, you can set the duck's behavior type through a setter method on the Duck class, rather than by instantiating it in the duck's constructor.

So I add two new methods to the Duck class

    public void setFlyBehavior(FlyBehavior fb) {        flyBehavior = fb;    }    public void setQuackBehavior(QuackBehavior qb) {        quackBehavior = qb;    }

So just call the duck's setter method.

Take a look at a big picture

So you can see that HAS-A relationship can be better than IS-A. Instead of inheriting and implementing their behavior, the ducks get their behavior by being composed with the right behavior object.

This is an important technique, is the basis of our third design principle:

As you've seen, creating systems using composition gives you a lot more flexibility, lets you change behavior at runtime as long as the object you're composing with implements the correct behavior interface.

Summary

Phewww!!! I just applied the Strategy Pattern to solve code reuse and changing requirement problems. And I showed you that Inheritance is not enough to deal with it.

We need design patterns to create a software which is ready to scale and easy to maintain, adapt with changes.

See you in my next post!!

Why do we need Design Patterns? Inheritance isn't enough!!! - Part 1

Hiếu Phạm Duy — Sun, 15 Aug 2021 12:25:34 GMT

I will tell you a story :))

I had a game which can show a large variety of duck species swimming and making quacking sounds. The initial designers of the system used standard OO techniques and created one Duck super class from which all other duck types inherit.

It looks good, right??

But in the last year, the company has been under increasing pressure from competitors. After a week-long brainstorming session, the company executives think it's time for a big innovation :vvv. They need something really impressive to show at the upcoming shareholders meeting in Hanoi next week.

Now we need the ducks to FLY. And of course my manager told them it'll be no problem for us to just whip something up in a week :))

I just need to add a fly() method in the Duck superclass and then all the ducks will inherit it. Genius!!!

But something went horribly wrong...

A shareholder told that they just saw a demo and there were rubber duckies flying around the screen :(( wtf !!!!

Because when I added new behavior to the Duck superclass, I was also adding behavior that was not appropriate for some Duck subclasses.

Okay, so there's a slight flaw in my design. What I thought was a great use of inheritance for the purpose of reuse has not turned out so well when it comes to maintenance.

I realized that inheritance probably was not the answer, because I just got a memo that says that the executives now want to update the product every six months (in ways they have not yet decided on). I knows the spec will keep changing and he'll be forced to look at and possibly override fly() and quack() for every new Duck subclass that's ever added to the program... forever!

So I need a cleaner way to have only some (not all) of the duck types fly or quack. I decided that take the fly() out of the Duck superclass and make a flyable() interface.

Thay way, only the ducks that are supposed to fly will implement that interface and have a fly() method... I also make a Quackable() too, since not all ducks can quack.

It's good, right??

Could you think about that? the dumbest idea I've come up with. If I thought having to override a few methods was bad, when I need to make a little change to the flying behavior... in all 50 of the flying Duck subclasses???

It solved part of the problem, but it completely destroys code reuse for those behaviors, and just creates a different maintenance nightmare!!!

And of course there might be more than one kind of fly behavior even among the ducks that do fly... Because I've been making a game, anything could happen.

And now I felt disappointed with those above solutions. And Design Patterns comes and saves the day.

Wouldn't it be dreamy if there were a way to build software so that when we need to change it, we could do so with the least possible impact on the existing code?

Yeaa we could spend less time reworking code and more making the program do cooler things...

Ohh! the post was quite long :))) Let's me show it in another post, haa :))

See you!!!

Adapter Pattern and Java example

Hiếu Phạm Duy — Sun, 15 Aug 2021 09:58:12 GMT

This is a pattern which helps you put a square peg in a round hole. Sound impossible? Not when we have Adapter.

The Adapter Pattern converts the interface of a class into another interface the clients expect.

In fact, Adapter all around us!!! You'll have no trouble understanding what an OO adapter is because the real world is full of them. This is an example:

So if it walks like a duck and quacks like a duck, then it might not be a duck but a turkey wrapped with a duck adapter.

Coding

It's time to see an adapter in action. I will create a simple interface of the Duck.

public interface Duck {    public void quack();    public void fly();}

And here's a subclass of Duck, the VietnamDuck.

public class VietnamDuck implements Duck {    public void quack() {        System.out.println("Quack");    }    public void fly() {        System.out.println("I'm flying");    }}

I create a new animal: WildTurkey

public interface Turkey {    public void gobble();    public void fly();}public class WildTurkey implements Turkey {    public void gobble() {        System.out.println("Gobble");    }    public void fly() {        System.out.println("I'm flying a short distance");    }}

Now we have some Duck objects and you'd like to use some Turkey objects in their places. Obviously we can't use the turkeys outright because they have a different interface.

So let's write an Adapter

// First, you need to implement the interface of the type you're adapting to.// This is the interface your client expects to see.public class TurkeyAdapter implements Duck {    Turkey turkey;    // we need to get a reference to the object that we are adapting    // here we do that through the constructor    public TurkeyAdapter(Turkey turkey) {        this.turkey = turkey;    }    // now we need to implement all the methods in the interface;    // turkey can't do long-distance flying like ducks, so we call 5 times    @Override    public void fly() {        for (int i = 0; i < 5; i++) {            turkey.fly();        }    }    // translate from quack to gobble method    @Override    public void quack() {        turkey.gobble();    }}

Test the adapter

Now we just need some code to test drive our adapter:

public class DuckTest {    // this is a client is implemented against the target interface    static void testDuck(Duck duck) {        duck.quack();        duck.fly();    }    public static void main(String[] args) {        // let's create a Duck and a Turkey        VietnamDuck duck = new VietnamDuck();        WildTurkey turkey = new WildTurkey();        // And then wrap the turkey in a TurkeyAdapter, which makes it look like a Duck        Duck turkeyAdapter = new TurkeyAdapter(turkey);        System.out.println("The Turkey says ...");        turkey.gobble();        turkey.fly();        // using testDuck method which expects a Duck object input        System.out.println("\nThe Duck says ...");        testDuck(duck);        // now we try to pass off the turkey as a duck...        System.out.println("\nThe TurkeyAdapter says ...");        testDuck(turkeyAdapter);    }}

Let's run it and see...

Explain

I could explain about how the Client uses the Adapter

The client makes a request to the adapter by calling a method on it using the target interface.
The adapter translates the request into one or more calls on the adaptee using the adaptee interface.
- The Adapter implements the target interface and holds an instance of the Adaptee.
  Eg: TurkeyAdapter implemented the target interface, Duck. And Turkey was the adaptee interface.
The client receives the results of the call and never knows there is an adapter doing the translation.

Summary

Adapter makes your software more flexible.

Now despite having defined the pattern, we haven't told you the whole story yet. There are actually two kinds of adapters: object adapters and class adapters. But I will cover it in another post :)))

See yahh!!!

Slice in Go

Hiếu Phạm Duy — Fri, 06 Aug 2021 16:26:36 GMT

Compare to an array with a fixed size, the slice is a dynamically-sized, flexible view into the elements of an backing array.

Just points to other values that it stores.
Can only contain the same type of elements.
When a slice is created by slicing an array, that array becomes the backing array of that slice
Pointer: point to the position of the first element of its backing array
A nil slice does not have a backing array but it has a slice header
Capacity: all about the length of a backing array and where a slice starts
Empty slices usually do not allocate a new backing array, they use the same array
Go allocates different backing arrays for each slice
The length describes the length of a slice but a capacity describes the length of the backing array beginning from the first element of the slice
APPEND: when the capacity is full => append() allocates a new and a larger array (double), copy old values to the new array (costly)
=> return a new slice header that points to the newly allocated array
=> reduces the number of allocations by thinking about the future growth of the slice
When append: append a new element to the given slice and return a new slice, does not change the given slice unless you overwrite the result of the append function back to the original slice
Keyed slice: works like the same as a keyed array
Use full slice expressions (using capacity when slicing) to prevent other code to append more elements to a slice's backing array.
Make: initializes and returns a slice with the given length and capacity
=> using to prevent reallocating backing array by allocating a large enough back array
Using make to create a new slice and then using append: will append after the length of the slice
How to clean up memory with large array: making them lose reference to its backing array (assign it to a new empty slice)

Thơ con cóc

Hiếu Phạm Duy — Thu, 05 Aug 2021 02:11:40 GMT

Thng t v gi chnh vnh u ng..Mo cau bun vng v nng khng sangChiu vng hoe ch qu cng vi vng..V ta li bun mnh mang nh M..H vn sang..vn nng nn nh th..Ma vn ging hay tc M bc mu.?Con bit ri - M chng ni ra u..C trng ngng  mi chiu c qunh..Cn nh c gi leo vo tng vch..Mi hin bun t tch ht sng m..L Chui kh m xo xc bn thm..V con bit - M  thm mt tui .. KD - su tm

E.Musk

Hiếu Phạm Duy — Mon, 19 Jul 2021 18:10:33 GMT

i vi Gracias, nh u t ca Tesla, SpaceX v l bn ca Musk, giai on nm 2008 cho anh bit tt c nhng g cn bit v con ngi Musk.

Anh nhn thy mt ngi n ng n nc M vi hai bn tay trng, mt mt a con, b bo ch v v c bu riu v sut na mt nt s nghip.

Anh y c kh nng lm vic chm ch hn v chu ng p lc tt hn bt c ai ti tng gp, Gracias nhn xt. Nhng g anh y phi tri qua vo nm 2008 c th nh gc bt k ai khc. Anh y khng ch sng st . Anh y vn lm vic v kin nh. Kh nng kin nh trong khng hong l mt trong nhng li th ch yu ca Musk so vi cc gim c v i th khc.

Hu ht mi ngi u tr nn nng ny di nhng p lc , Gracias cho bit. H s ra nhng quyt nh sai lm. Nhng Elon li cc l tr. Anh y vn c kh nng a ra nhng quyt nh rt r rng cho di hn. Cng kh khn, anh y cng lm tt hn. Bt k ai chng kin anh y vt qua mi kh khn s lp tc phi khm phc con ngi ny. Ti cha bao gi thy ai c kh nng chu c gian kh nh anh.

Ký ức về những tháng 6 khó khăn

Hiếu Phạm Duy — Tue, 06 Jul 2021 15:27:38 GMT

Khi Thn nng nga mt v hng TyL Cha v nhng ng cy xiu voBn cn ti m chn th c ho...Trn cnh ng, ai ht cho than hoa...Kia ng lng ngy y nh bng Cha..Chiu tt nng, con Tru gi mm mmng kht nc nghe bn si lp bp trong lng, khi bp v nng rm...t n du soi sng cnh mm cm...Bt canh nng thm thm mi rau NgiTrng gia h bng trn ri xung mi.. ngng ngng c gi tm ao qu...- Su tm -

4 nguyên lý cơ bản trong OOP và ví dụ dễ hiểu bằng Python

Hiếu Phạm Duy — Sun, 04 Jul 2021 08:37:02 GMT

OOP sinh ra nhm t chc m ngun tt hn, v lm cho vic lp trnh ging nh vic t chc qun l cc i tng trong th gii thc. Trong OOP, ngi ta c 4 nguyn l c bn: tru tng, ng gi, k tha v a hnh.

Hm nay mnh s gii thch chi tit theo hiu ca mnh v 4 nguyn l ny cng vi code Python minh ha.

1. Abstraction tnh tru tng

Thit lp mc phc tp m 1 ngi tng tc vi h thng, giu i cc chi tit phc tp hn.

y c ng no dng my pha c ph ri th bit, ng n vi nt l c cc c ph ngon, nhng bn trong n lm rt nhiu cng on m chng ta khng cn quan tm n nh no. y l 1 v d tru tng thc t. V khi c 1 update g bn trong phn mm ca my cng him khi nh hng ti cch s dng my pha bn ngoi.

Vic pha c ph cng ging nh vic s dng object coffee_machine to 1 cc c ph - a_cup_of_coffee. Tt c ch cn gi hm make_coffee, vic bn trong hm hot ng nh th no ngi gi khng cn quan tm. V khi c thay i hot ng bn trong hm make_coffee th cng khng nh hng ti vic bn khc gi hm ny.

2. Encapsulation tnh ng gi

Nm na l vic ng gi data li v kim sot vic truy cp v thay i data t bn ngoi. V d trong Java c setter, getter kim sot vic truy cp vo mt bin. Mt bin l private th khng th truy cp hay chnh sa trc tip t pha ngoi class hay object.

Nu khng th d hay b m h gia 2 khi nim ng gi v tru tng. u l n y nhng ng gi th l information hiding, cn tru tng th l implementation hiding.

Python khng c keyword kiu private hay protected nh bn Java m quy c theo cch t tn bin: 1 du gch di set 1 bin thnh internal use (vn c th public access nhng b cnh bo), v 2 du gch di set thnh private (khng th public access)

3. Inheritance tnh k tha

Vic tha hng li nhng g ngi khc li =))

Trong lp trnh, k tha l cch 1 lp c th tha hng li nhng thuc tnh, method t 1 lp khc, s dng hoc override chng.

In other words, I can say that =))) k tha dng biu din mi quan h c bit ha tng qut ha gia cc lp.

VD: c nhiu loi t, nhng u c 1 s c im chung: 4 bnh, cc phng thc khi ng, run, phanh, tng tc, nn s k tha chung 1 class cha l Car cha cc thuc tnh v phng thc chung thay v phi vit i vit li nhiu class. V khi cn sa th cng i tng class sa, rt mt cng v thi gian, li cn d li.

=> Gia tng vic ti s dng code, gip ta d nng cp v bo tr

Trong code trn, cc class con khi k tha class Person s khng phi implement li cc method chung na m c th dng ngay.

4. Polymorphism tnh a hnh

Hai hay nhiu lp s c chung phng thc nhng li c implement theo cc cch khc nhau thc hin 1 hnh ng theo nhiu cch khc nhau.VD: cng 1 method c k tha lp cha, mi lp con c th override theo cch ring, hot ng khc nhau. Cng l con vt c th ku nhng ch, mo, chut s ku theo cc cch khc nhau.

a hnh cn gip cho vic s dng 1 class con v cha l nh nhau. N gip cho chng trnh chng ta vit tr nn linh hot hn.

Method bn trn ch cn quan tm kiu object n nhn vo l Animal, nn khi chng ta truyn vo Cat object hay Dog object th n u chy bnh thng.

Trn y l s qua v 4 nguyn l c bn ca OOP. Hn cc ae trong cc bi vit sau nha!!

Ti i ng y ch 1h ri 😣😣😣 Nh th like + th tim nu thy hu ch nha 😡😡😡

Cách xây dựng một mạng Nơ-ron đơn giản chỉ bằng Python

Hiếu Phạm Duy — Sun, 04 Jul 2021 08:33:01 GMT

Bi vit c dch li t bi How to build your own Neural Network from scratch in Python link. Bi vit ny hin c 44k up vote t cng ng 😘😘😘

Let's Go!

Motivation: L mt phn trong hnh trnh c nhn ca ti hiu r hn v Deep Learning, ti quyt nh xy dng Mng li thn kinh t u m khng cn th vin hc su nh TensorFlow. Ti tin rng vic hiu c hot ng bn trong ca Mng thn kinh l iu quan trng i vi bt k Nh khoa hc d liu no.

Bi vit ny cha nhng g ti hc v hy vng n cng s hu ch cho bn!

Mng N-ron l g?

Hu ht cc vn bn gii thiu v Mng n-ron a ra cc m t ging b no ca con ngi. Khng i su vo cc cch ny, ti thy d dng hn khi m t Mng thn kinh l mt hm ton hc nh x mt u vo nht nh n mt u ra mong mun.

Mng n-ron bao gm cc thnh phn sau:

Mt tng u vo (input layer), x
Mt s lng ty cc tng n (hidden layer)
Mt lp u ra (output layer),
Mt tp hp cc trng s v lch gia mi lp, W v b
Hm kch hot cho mi tng n (activation function), . Trong hng dn ny, chng ti s s dng hm kch hot Sigmoid.

S bn di hin th kin trc ca Mng n-ron 2 lp (lu rng lp u vo thng b loi tr khi m s lp trong Mng n-ron)

To mt lp Mng N-ron vi Python tht d dng !!!

class NeuralNetwork:    def __init__(self, x, y):        self.input      = x        self.weights1   = np.random.rand(self.input.shape[1],4)         self.weights2   = np.random.rand(4,1)                         self.y          = y        self.output     = np.zeros(y.shape)

Hun luyn mt mng N-ron

u ra ca Mng n ron 2 lp n gin l:

Bn c th nhn thy rng trong phng trnh trn, trng s W v lch b l cc bin duy nht nh hng n u ra .

ng nhin, cc gi tr ph hp cho cc trng s v lch quyt nh chnh xc ca cc d on. Qu trnh tinh chnh cc trng s v lch t d liu u vo c gi l hun luyn mng N-ron.

Mi ln lp ca qu trnh o to bao gm cc bc sau:

Tnh ton u ra d on , c gi l feedforward
Cp nht cc trng s v lch, c gi l backpropagation

Biu tun t di y minh ha qu trnh.

Feedforward (lan truyn tin)

Nh chng ta thy trong biu tun t trn, feedforward ch l php tnh n gin v i vi mng thn kinh 2 lp c bn, u ra ca Mng thn kinh l:

Hy thm mt hm feedforward trong m python lm iu . Lu rng n gin, chng ta gi s cc lch l 0.

class NeuralNetwork:    def __init__(self, x, y):        self.input      = x        self.weights1   = np.random.rand(self.input.shape[1],4)         self.weights2   = np.random.rand(4,1)                         self.y          = y        self.output     = np.zeros(self.y.shape)    def feedforward(self):        self.layer1 = sigmoid(np.dot(self.input, self.weights1))        self.output = sigmoid(np.dot(self.layer1, self.weights2))

Tuy nhin, chng ta vn cn mt th g nh gi "mc tt" ca cc d on ca chng ta (tc l d on c ging vi u ra mong mun khng)? Hm mt mt cho php chng ta lm chnh xc iu .

Loss Function

C rt nhiu loss function c sn, v bn cht vn ca chng ta nn quyt nh la chn loi hm no. Trong hng dn ny, chng ta s s dng hm sum-of-squares error l hm mt mt ca chng ti.

Ngha l, li tng bnh phng ch n gin l tng ca s khc bit gia mi gi tr d on v gi tr thc t. S khc bit l bnh phng chng ta o gi tr tuyt i ca s khc bit.

Mc tiu ca chng ta trong hun luyn l tm ra tp hp trng s v lch tt nht gip gim thiu loss function.

Backpropagation

By gi chng ti o c li d on (mt), chng ti cn tm cch truyn li li v cp nht cc trng s v sai lch ca chng ti.

c th iu chnh trng s v lch mt cch thch hp, chng ta cn bit o hm ca hm mt mt i vi cc trng s v lch.

Nh li t php tnh rng o hm ca hm n gin l dc ca hm.

Nu chng ta c o hm, chng ta ch cn cp nht cc trng s v lch bng cch tng/gim vi n (tham kho s trn). iu ny c gi l dc gc.

Tuy nhin, chng ta c th trc tip tnh ton o hm ca hm mt mt i vi trng s v lch v phng trnh ca hm mt mt khng cha trng s v lch. Do , chng ta cn quy tc chui gip chng ta tnh ton n.

i ch! iu tht xu x nhng n cho php chng ta c c nhng g chng ta cn - o hm ( dc) ca hm mt mt i vi cc trng s, chng ta c th iu chnh cc trng s cho ph hp.

By gi chng ta hy thm chc nng backpropagation vo m python ca chng ta.

class NeuralNetwork:    def __init__(self, x, y):        self.input      = x        self.weights1   = np.random.rand(self.input.shape[1],4)         self.weights2   = np.random.rand(4,1)                         self.y          = y        self.output     = np.zeros(self.y.shape)    def feedforward(self):        self.layer1 = sigmoid(np.dot(self.input, self.weights1))        self.output = sigmoid(np.dot(self.layer1, self.weights2))    def backprop(self):        # application of the chain rule to find derivative of the loss function with respect to weights2 and weights1        d_weights2 = np.dot(self.layer1.T, (2*(self.y - self.output) * sigmoid_derivative(self.output)))        d_weights1 = np.dot(self.input.T,  (np.dot(2*(self.y - self.output) * sigmoid_derivative(self.output), self.weights2.T) * sigmoid_derivative(self.layer1)))        # update the weights with the derivative (slope) of the loss function        self.weights1 += d_weights1        self.weights2 += d_weights2

hiu su hn v ng dng tnh ton v quy tc chui trong backpropagation, ti thc s khuyn bn nn xem hng dn ny ca 3Blue1Brown.

{@embed: https://www.youtube.com/watch?v=tIeHLnjs5U8}

Thc Nghim

By gi chng ta c m python hon chnh thc hin feedforward v backpropagation, hy ng dng Mng n-ron ca chng ta vo mt v d v xem n hot ng tt nh th no. Bn di l d liu hun luyn n gin:

Mng n-ron ca chng ta s hc mt tp hp trng s l tng biu din c hm ny. Thng th chng ta s cho dng vic hc khi ta xp x c hm trnh vn overfit.

Hy hun luyn Mng n-ron vi 1500 ln lp v xem iu g s xy ra. Nhn vo biu biu din hm mt mt trn mi ln lp bn di, chng ta c th thy r s mt mt n iu gim dn v mc ti thiu. iu ny ph hp vi thut ton gim dc m chng ta tho lun trc .

Chng ta hy xem d on cui cng (u ra) t Mng n-ron sau 1500 ln lp.

Done =)) Thut ton feedforward v backpropagation ca chng ti o to mt mng n-ron thnh cng v cc d on c hi t trn cc gi tr thc.

Lu rng c mt s khc bit nh gia d on v gi tr thc t. iu ny l cn thit, v n trnh vic m hnh b overfitting v cho php Mng n-ron d on tt hn vi d liu mi.

Hnh trnh ca chng ta vn cha kt thc. Vn cn nhiu iu tm hiu v Mng n-ron v Hc su (Deep Learning). V d:

Chng ta c th s dng hm kch hot no khc ngoi Sigmoid?
S dng tc hc (learning rate) nh th no khi o to Mng n-ron?
S dng nhn tch chp (convolution) cho cc tc v phn loi hnh nh

Tm Vy Li

Ti hc c rt nhiu iu khi vit Mng n-ron t u ch vi Python m khng dng bt c th vin c sn no.

Mc d cc th vin Deep Learning nh TensorFlow v Keras gip ta d dng xy dng cc mng li su m khng hiu y hot ng bn trong ca Mng n-ron, nhng ti thy rng vic xy dng li c li cho cc nh khoa hc d liu ang khao kht hiu su hn v Mng n-ron.

This exercise has been a great investment of my time, and I hope that itll be useful for you as well!

Tính toán bất đồng bộ quy mô lớn ở Facebook

Hiếu Phạm Duy — Sun, 04 Jul 2021 08:27:40 GMT

Chng ta ln Face mi ngy, tuy nhin khng phi ai cng ch ti rng Facebook x l cc tng tc ca chng ta nh th no ng khng ^^ Trn thc t, h thng ca Facebook phi x l hng t request mi ngy. Do , nhng request ny phi c x l bt ng b trnh vic h thng b qu ti, b chm v UX cng v th m gim.

1. Thi s khai

Ban u Facebook s dng h thng bt ng b n gin: tt c request bt ng b c x l v lu tr vo database tp trung, mt b phn iu phi s truy vn, la chn (n trc x l trc) v gi request ti worker. Trng n gin vl =)))

Khi lng request tng, h chy thm nhiu woker hn.

Tuy nhin i nh m 😂😂😂. H thng b nghn lc cao im nhng li ngi chi xi nc nhng khung gi khc nh ban m chng hn, gy lng ph ti nguyn. Cc k s phy bc cng c gng maintain nhng khng gii quyt trit vn ny.

C hng t thch thc trong vic x l request ca Facebook, c th k n nh:

S u tin: x l request quan trng trc (nhng lm sao h thng xc nh c request no quan trng hn???)
Ti u kh nng x l: lm sao ti nguyn khng b nhn ri khi khng phi lc cao im?
Quy nh s dng ti nguyn: lm sao m bo cc request khng s dng qu nh mc ti nguyn thng thng?

2. Cch m Phy Bc x l

Trc ht Facebook h chia request thnh 3 loi:

Daily traffic: n t vic bn lt fb hng ngy, comment, th phn n nh crush, cc nh th livestream mi cui tun => tng i d on
Cc s kin ln: tt, world cup hay live stream ca nhng ngi ni ting s c nhiu ngi xem v tng tc (v d nh 300k ca c Phng Hng =)))). Traffic ca n c th d on c phn no khi chng ta bit trc v s kin v c th chun b cho traffic tng t bin.
Cc s kin bt thng: cc s kin ny ging cc s kin ln nhng chng ta k bit trc n, thng xy ra trong 1 thi gian ngn vi lng truy cp/xem tng vt v tr li bnh thng trong vng vi pht/gi. (v d nh livestream 1 v tn cng khng b chng hn 😅😅)

Ngoi ra, do nhu cu kinh doanh, cc request cng level cn c th c u tin khc nhau.

V cch m Facebook x l theo mnh nh gi l kh hay 😁😁😁

u tin x l ty theo tr chp nhn c ca request
Cc request quan trng cn x l cng nhanh cng tt s c u tin (thng bo livestream, pht hin ng nhp bt thng, ), cn li nhng request c th delay vi giy s x l sau (like, comment). Nm na y ng Facebook duy tr tr c th chp nhn c t ngi dng. V d khng ai chp nhn chuyn thng bo livestream n tr c 10 pht c, lc nhn noti c khi ngi ta stream xong m ri 😆😆😆. Nhng thng bo like ca 1 bc nh crush post hm qua th c th n tr 1 2 pht cng khng vn g.
X l request linh hot theo thi gian
D don trc d liu ngi dng s s dng, x l trc v lu vo cache. Khi ngi dng cn, h thng s c t cache lun m khng cn x l. Cng nhn l fb khn 😌😌😌. Nh vy l h chuyn vic tnh ton t lc cao im sang lc nhn ri, gip gim tr, ti u ti nguyn v chp nhn c th x l trc b tha.

Hoc hon vic x l request li trong giai on cao im v x l sau. V d khi bn ti 1 video nng ln fb, n s to thng bo video ca bn ang c x l v s c thng bo khi hon tt.
X l theo batch
Cch x l theo batch th cng khng cn xa l g. Tuy nhin vn y khng ch worker b qu ti, m b phn x l bt ng b (a job vo queue, ly ra, la chn v gi ti worker) cng b qu ti. Vic fb gp cc request li thnh 1 job ln, sau mi gi cho pha worker gip gim ti cho b phn trung chuyn ny. ( vic ny khng ci thin cho pha worker do sau vn phi tch thnh cc request nh ban u ri x l).
VD thc t lun cho nng =)) ng no m HN th khng l g cnh xe ch thc n cho trim chy y ng, phng nhanh lng lch khip vcl 😣😣😣 Th y i xe iu phi ny cng ging nh b phn x l bt ng b trong h thng karaoke, nhn request t 1 qun trong h thng, sau iu o ti ni c nhu cu. V trong gi cao im tm 9 10h ti ht tng 1 sang tng 2 th vic iu o cng hay b qu ti 😅😅😅. V ng nhin h cng thng minh p dng x l request theo batch (1 xe thng ch 4, 5 o ch ti nm nay 70 tui cha thy xe no chy n l bao gi c 😘😘😘
Quy nh vic phn b ti nguyn
Khi request s dng vt qu lng ti nguyn c phn b, cnh bo s c gi ti cc k s v request s b gii hn v ti nguyn theo hn ngch phn b ban u. Tuy nhin, nh ni trn, fb c tnh n cc s kin bt thng, vy nn h vn phi m bo hn ngch ti nguyn linh hot khng gii hn ti nguyn ca cc s kin ny.
Ngoi ra, khng ch c cc worker hay b phn x l bt ng b ca h thng b qu ti m ngay c b phn tip nhn request cng c th b qu ti => cn phi gii hn ngay t giai on tip nhn request u vo m bo khng b tc nghn giai on u.

3. Cc thch thc mi

Cc gii php trn gii quyt phn no cc vn hin ti ca fb. Tuy nhin, vn cn nhiu thch thc ang ch fb vo vic =)))

H thng ngy cng tr nn phc tp, kh khn hn khi x l s c => cn c cc cng c x l s c trc quan hn, d dng hn
Cn c quy trnh qun l v gii trnh ti nguyn tt hn cho c ngi dng v ngi duy tr h thng. N s gip chng ta d on v nhu cu trong tng lai tt hn v ci thin vic phn b ti nguyn 1 cch cng bng cho cc request, m bo tnh linh hot cho 1 s trng hp c traffic bt thng. Ci ny chn chn ng fb phi dng AI h tr.

Chng ta hy cng lt dp hng fb h gii quyt cc vn ca h d lo ri hc hi nha.

Bi ny n y hi di ri, ti chim ct y 😜😜😜 See yaaa!!!

thy hay th cho 1 like + th tim nha 👌👈 khng th gi a ch y 😡😡😡

Ti liu tham kho

https://engineering.fb.com/2020/08/17/production-engineering/async/

SOLID trong OOP và ví dụ dễ hiểu bằng Python

Hiếu Phạm Duy — Sun, 04 Jul 2021 06:35:37 GMT

Th SOLID l g? SOLID l cng 😜😜😜

a t 🤣🤣🤣 y l cc nguyn l thit k trong OOP, c ghp li t cc ch ci u ca Single Responsibility, Open Close Principle, Liskov Substitution Principle, Interface Segregation v Dependency Inversion.

Hm nay mnh s i vo tng quan khi nim, sau ly v d bng Python cho cc con v d hiu nh =))

1. Single Responsibility

Mi class ch nn c 1 trch nhim duy nht
V lu di, nu khng p dng S, class s phnh to ra, kh kim sot v maintain.

# violate the Single Responsibility Principleclass Animal:    def __init__(self, name):        self.name = name    def get_name(self):        pass    def save_to_db(self, animal):        # save to MySQL        pass# comply with Single Responsibility Principleclass Animal:    def __init__(self, name):        self.name = name    def get_name(self):        passclass AnimalDB:    def get_animal(self, a_id):        pass    def save_to_db(self, animal):        pass

V d ta c class Animal, trong gm c method lu object vo database - save_to_db. Vi thit k ny, khi chng trnh thay i database, ta phi m vo sa class gc ny. Thay vo ta to thm 1 class nh dnh ring cho vic lu tr database l AnimalDB. Vic ny gip cho vic sa cha n gin, r rng hn, t bug hn.

2. Open Close Principle

Nn m rng class thay v sa i class gc
Khi thm chc nng, ta nn m rng class c (k tha, s hu) m trnh vic sa n
=> d gy li tim n khi cc module khc ang s dng class c.

class Discount:    def __init__(self, customer, price):        self.customer = customer        self.price = price    def get_discount(self):        return self.price * 0.2# when we need to add discount for VIP customers => change Discount classclass Discount:    def __init__(self, customer, price):        self.customer = customer        self.price = price    def give_discount(self):        if self.customer == 'fav':            return self.price * 0.2        if self.customer == 'vip':            return self.price * 0.4

Thit k trn vi phm nguyn l OCP, gi d mi tun c thm 1 case khch hng mi, chng ta li phi sa class Discount, logic hm give_discount s di ra v tn 😖😖😖

Thay vo , ta nn to 1 class mi k tha class c 😘😘😘

class Discount:    def __init__(self, customer, price):        self.customer = customer        self.price = price    def get_discount(self):        return self.price * 0.2class VIPDiscount(Discount):    def get_discount(self):        return super().get_discount() * 1.2class SuperVIPDiscount(Discount):    def get_discount(self):        return super().get_discount() * 1.5class DiamondDiscount(Discount):    def get_discount(self):        return super().get_discount() * 2

3. Liskov Substitution Principle

Cc class con c th thay th class cha m khng lm thay i tnh ng n ca chng trnh
m bo tnh a hnh trong OOP

VD: vit chng trnh m t cc loi chim bay

C class chimcanhcut cng l chim nn cho k tha class Bird

=> Khi gi hm bay ca object chim cnh ct s b Exception

=> thit k ny vi phm nguyn l LSP

Nm na khi thit k class phi ch , trnh b nguyn cc mi quan h ca cc object ngoi i sng vo code.A l B khng c ngha l A nn k tha B (nu class A khng th thay th c class B)

class Animal:    def leg_count(self):        passclass Lion(Animal):    def leg_count(self):        passdef animal_leg_count(animal: Animal):    print(animal.leg_count())animal = Animal()lion = Lion()animal_leg_count(animal)animal_leg_count(lion)

Vi thit k bn trn, object lion c th thay th object animal t class Animal m chng trnh vn chy ng.

4. Interface segregation Principle

Nn tch interface thnh cc interface nh hn phc v cho nhng mc ch c th

# violate Interface segregationclass MyInterface:    def connect_to_db(self):        raise NotImplementedError    def write(self):        raise NotImplementedError    def read(self):        raise NotImplementedError    def close_connect(self):        raise NotImplementedError    def show_info(self):        raise NotImplementedError    def update_info(self):        raise NotImplementedError# comply with Interface segregationclass DBInterface:    def connect_to_db(self):        raise NotImplementedError    def write(self):        raise NotImplementedError    def read(self):        raise NotImplementedError    def close_connect(self):        raise NotImplementedErrorclass DisplayInterface:    def show_info(self):        raise NotImplementedError    def update_info(self):        raise NotImplementedError

Vi Interface MyInterface, cc class khi implement MyInterface s phi implement tt c cc method trong n. iu ny thnh ra bt hp l, i khi gy d tha v 1 class i khi khng dng ht tt c cc method. V vy ta nn chia thnh cc interface nh (DBInterface, DisplayInterface) gm cc method lin quan n nhau, d qun l, d implement hn.

5. Dependency Inversion Principle

Cc module cp cao khng nn ph thuc vo cc module cp thp, c hai nn ph thuc vo abstraction
Ta c th thoi mi sa i implement ca module cp thp m khng lm nh hng ti module cp cao
Trong code thc t, cc module nn lin kt vi nhau thng qua interface

class IFood:    def bake(self):        raise NotImplemented    def eat(self):        raise NotImplementedclass Pizza(IFood):    def bake(self):        print("pizza was baked")    def eat(self):        print("pizza was ate")class Bread(IFood):    def bake(self):        print("bread was baked")    def eat(self):        print("bread was ate")class Production:    def __init__(self, food: IFood):        self.food = food    def produce(self):        self.food.bake()    def consume(self):        self.food.eat()if __name__ == '__main__':    pizza = Pizza()    bread = Bread()    p = Production(pizza)    p.produce()    p.consume()    b = Production(bread)    b.produce()    b.consume()

y ta c cc module cp thp l Bread v Pizza, module cp cao l Production. 2 module ny giao tip vi nhau bng interface IFood, gip cho chng trnh tr ln linh hot hn. Module Production ch cn s dng cc method trong IFood m khng b rng buc hay cn quan tm object no s c truyn vo. Ta c th truyn vo pizza hoc bread.

Phewww!!! Th l ti va chm xong t l thuyt cng nh code thm t Python minh ha cho cc con v ri nh 😡😡😡

Thy hay th up vote nha cc ty 😘😘😘

Nay ch nht ti i dn nh y khng v n ch i mt ra y 😖😖😖

See yaaaa!!!!